123 Main Street, New York, NY 10001

O-RAN Radio Unit (O-RU) Architecture & Key IC Blocks

← Back to: Telecom & Networking Equipment

O-RAN Radio Unit (O-RU) is the fronthaul-facing RF endpoint that must keep EVM/ACLR/SEM stable by controlling the full RU loop—RF chain + DPD/PA/LNA biasing + JESD converter links + clock/jitter + PTP/SyncE timing—while proving it in the field with counters, telemetry, and logs.

This page shows what a “good RU” looks like, where failures typically originate, and how to validate and troubleshoot with measurable evidence instead of guesswork.

H2-1 · Scope, assumptions & what “good” looks like for an O-RU

This section locks the page boundary (O-RU only) and defines a measurable acceptance language for RF quality, link determinism, timing integrity, and thermal stability.

Why What this page covers (and what it does not)

An O-RU is responsible for converting the air-interface RF chain into deterministic digital sample streams and transporting them over fronthaul, while keeping linearity, synchronization, and field evidence under control.

  • O-RU responsibilities (in-scope): RF front-end partitioning, PA/LNA bias behavior, DPD/observation loop integration, data-converter interfaces (JESD204B/C), fronthaul PHY readiness (eCPRI over Ethernet), and timing inputs/distribution (PTP/SyncE/1PPS/ToD).
  • Explicitly out-of-scope: DU/CU baseband compute and scheduling, FEC engines, PCIe switching inside compute nodes, and transport-network internals (OTN/DWDM/ROADM).
  • Goal of this boundary: every metric and every troubleshooting step in this page must be provable from RU-side measurements, counters, or logs.
Engineering shorthand: If a claim cannot be validated at the O-RU boundary (spectrum, time error, JESD counters, fronthaul counters, thermal/telemetry logs), it does not belong in this page.

Define “Good RU” = four acceptance domains

“Good” is not a single number. A deployable O-RU must simultaneously meet RF emission quality, sampling determinism, time/frequency coherence, and stability across temperature and aging.

RF quality: EVM / ACLR / SEM / spurs Sampling determinism: JESD lane health / latency stability Timing: PTP time error / SyncE lock / holdover drift Stability: thermal derating point / calibration retention
Domain What “good” must demonstrate Primary RU-side evidence
RF emission quality Stable EVM/ACLR/SEM across output power steps, temperature corners, and configured carriers; spurs under control; predictable DPD gain. Spectrum/ACLR plots vs Pout; EVM vs integrated jitter; spur map; DPD on/off delta.
Sampling determinism JESD links maintain lane alignment and deterministic latency; error counters remain quiet under thermal and supply noise stress. JESD lane error counters; alignment status; deterministic-latency stability checks (LMFC/SYSREF alignment evidence).
Timing integrity Time/frequency remains coherent under SyncE/PTP lock, and degrades predictably in holdover with clear alarms and state transitions. PTP time error statistics; SyncE lock/holdover state; timebase switch logs; holdover drift trend.
Thermal & retention RF quality does not collapse at realistic site temperatures; protection and derating are orderly, observable, and repeatable. Thermal sensors + derating states; PA current/voltage; VSWR/reflection alarms; event logs with timestamps.

Verify RU acceptance checklist (what to measure, not just what to claim)

The checklist below is written as “measurement → evidence artifact.” Values are implementation-specific; the critical point is producing comparable evidence across operating modes and environmental corners.

Item How to measure (RU-side) Pass evidence artifact
EVM Capture EVM across power steps and temperature bins; isolate clock/jitter sensitivity by controlled clock conditions. EVM vs Pout and vs temperature plots; “DPD on/off” delta plots for the same carrier configuration.
ACLR / SEM Spectrum measurement at representative Pout and bandwidth; repeat with DPD enabled/disabled and after warm-up. ACLR/SEM tables + screenshots; spur map snapshots tied to timebase state and PA bias state.
PTP time error Log time error statistics under lock and during holdover; verify source switch and alarm behavior. Time error trend and histograms; “lock/holdover/source” state timeline with timestamps.
JESD lane health Monitor lane errors/alignment status during temperature ramps and supply-noise injections. Lane error counters staying flat; alignment status stable; deterministic-latency evidence around SYSREF/LMFC events.
Thermal derating point Drive RU into thermal stress; observe orderly derate states and RF quality degradation profile. Derate state ladder (Normal→Derate→Shutdown) with triggers; RF quality plotted against temperature and derate state.
Figure F1 — O-RU boundary and responsibility map (RF ↔ samples ↔ fronthaul ↔ timing)
O-RU boundary diagram Block diagram showing antenna and RF front-end, RF transceiver with ADC/DAC, DPD with observation receiver, fronthaul eCPRI Ethernet PHY, and timing inputs feeding the RU clock tree. O-RU Boundary (RU-side evidence) RF quality · Deterministic sampling · Timing integrity · Thermal stability Antenna Air interface Filters Duplexer / BPF PA / LNA Bias + protection RF Transceiver LO / mixers / DUC/DDC ADC / DAC interface Fronthaul eCPRI / Eth PHY DPD / CFR Adaptation + clipping ObsRx feedback Coupler Timing Inputs PTP / SyncE 1PPS / ToD Clock Tree Jitter cleaner + PLLs Distribution LO / ADC / DAC PHY ref / timestamps Telemetry & Protection Temp · currents · VSWR · fault logs Derating states

H2-2 · RF signal chain partitioning: where transceiver ends and PA/LNA/filters begin

This section makes responsibility explicit: each block “owns” a dominant set of impairments and has specific test taps that prove or falsify root causes.

Partition RU RF chain = blocks with accountable metrics

A useful partition is not “RF vs digital.” A useful partition answers: which block dominates which metric under realistic operating points, and which measurement tap proves it.

Block Dominant metric(s) it most often drives Fast proof tap (RU-side)
PA / driver ACLR and in-band distortion at high Pout; temperature-sensitive compression; interaction with DPD/CFR. ACLR vs Pout curve; DPD on/off delta; PA current/thermal correlation.
Filters / duplexer Out-of-band mask (SEM) and insertion-loss sensitivity; can distort emission when heated or mismatched. Spectrum shoulders vs temperature; return loss/VSWR alarms vs emission changes.
LO / synthesizer Phase-noise driven EVM degradation; spurs that appear as discrete spectral lines. Spur map; EVM vs clock condition; correlated with timebase states.
LNA / RF front-end NF and blocker tolerance; early overload signatures under strong interferers. Blocking test response; compression/overload counters; gain-state transitions.
ADC/DAC interface Sampling coherence and deterministic latency stability when using JESD; lane errors can masquerade as “RF quality” issues. JESD lane counters; alignment status; stability across thermal ramp.
Practical rule: When a metric changes with temperature and correlates with current/derating, suspect PA/bias/thermal. When it correlates with timebase state or manifests as spurs, suspect clock/LO/jitter distribution.

Tx Transmit chain (DAC → upconversion → PA → duplexer → antenna)

The Tx chain must meet emission masks and in-band modulation quality across power steps. The most valuable analysis is identifying which sub-block is limiting EVM versus limiting ACLR/SEM.

  • DAC/Tx baseband path (within RU): sets the baseline for in-band noise and images; becomes visible when PA is backed off and still EVM-limited.
  • Upconversion + LO: phase noise raises EVM floor broadly; spurs show up as discrete spectral lines or fixed-offset tones.
  • Driver/PA: dominates ACLR near compression; interacts with DPD adaptation quality and thermal headroom.
  • Duplexer/filters: enforce spectral mask; insertion loss and temperature drift can change output power and shoulder behavior.
Measurement What it proves Common failure signature
EVM vs Pout Separates “nonlinearity-limited” (worse at high Pout) from “clock/LO/jitter-limited” (floor-like behavior). EVM flat but poor across Pout → suspect clock/LO; EVM collapses near top power → suspect PA/DPD headroom.
ACLR vs Pout Quantifies PA compression and DPD effectiveness; confirms whether linearization tracks power steps. ACLR jumps at a repeatable Pout → PA bias/thermal or DPD mis-tracking.
Spur map Distinguishes discrete spur sources (LO, switching supplies, coupling) from broadband distortion. Fixed-offset tones that follow LO plan or timebase state changes.
VSWR / reflection Detects mismatch-driven behavior that can trigger protection or distort emission under real antennas/cables. Emission changes align with VSWR alarms and derating state transitions.

Rx Receive chain (antenna → filters/LNA → downconversion → ADC → digital)

The Rx chain is not “just sensitivity.” A deployable RU must remain stable under blockers and adjacent interferers, where overload and recovery behavior become the decisive factors.

  • Front-end filtering: defines how much blocker energy reaches the LNA/mixer; insertion loss directly impacts NF.
  • LNA bias and gain states: trade NF against linearity (IIP3); must avoid unstable gain toggling under real blocker dynamics.
  • Mixer/IF path: can generate intermodulation products that mimic real signals; LO leakage and IQ imbalance amplify confusion.
  • ADC dynamic range: overload recovery and clipping behavior determine whether digital compensation remains meaningful.
Measurement What it proves Common failure signature
NF / sensitivity Quantifies insertion loss + LNA bias health; establishes baseline performance when blockers are absent. Sensitivity shifts with temperature or bias rails → suspect LNA bias / front-end losses.
Blocking response Identifies the first overload point (LNA, mixer, ADC) via gain compression or recovery artifacts. Sudden desense events aligned with AGC transitions or overload counters.
IIP3 / intermod Separates linearity bottlenecks from noise bottlenecks; predicts behavior under multi-tone interferers. Phantom products at predictable offsets → intermod source in RF/IF path.
LO leakage / images Validates calibration integrity and shielding/ground coupling; explains repeatable “mystery tones.” Fixed tones that move with LO plan or appear after thermal or power state changes.

Choice Direct sampling vs IF (RU-only decision matrix)

This page treats the choice strictly as an RU integration trade-off, not an ADC tutorial. The decision is driven by filter realizability, dynamic range pressure, LO planning complexity, and calibration burden.

  • Front-end filtering feasibility: if adequate preselection is hard, direct sampling increases ADC overload risk under blockers.
  • Dynamic range & blockers: stronger blocker environment requires either more analog selectivity or more ADC headroom and careful gain distribution.
  • LO/image management: IF architectures shift complexity into LO planning and image rejection; direct sampling shifts complexity into sampling integrity and clock/jitter.
  • Calibration & retention: whichever path increases temperature-sensitive calibration, must be paired with retention evidence and field observability.
Figure F2 — Tx/Rx chain partitioning with dominant metrics + test taps
Tx/Rx chain dominant metrics per block Two parallel chains for transmit and receive, with each block labeled by its dominant metric and test tap points for quick root-cause proof. Partitioning for accountability Each block owns a dominant metric · Taps (TP) prove/falsify root causes TX path DAC / BB EVM floor Upconv + LO Spurs / PN Driver / PA ACLR Duplexer SEM Antenna VSWR TP1 TP2 TP3 TP4 TP1: baseband output · TP2: LO/spur observation · TP3: PA output/ACLR · TP4: antenna/VSWR RX path Antenna Input Filters Blocking LNA NF / IIP3 Downconv Images ADC Overload TP5 TP6 TP7 TP8 TP5: preselector output · TP6: LNA gain/overload · TP7: image/LO leakage · TP8: ADC overload & recovery Use the taps to shorten root-cause time If a symptom cannot be tied to a block via a tap/counter/log, it is not yet an actionable explanation.

H2-3 · DPD/CFR architecture: main path, observation path, and adaptation loops

In an O-RU, DPD is not a “nice-to-have algorithm.” It is a maintainable closed-loop system that must remain truthful (observation path), aligned (time/phase), and stable across temperature and power states.

Concept The three-piece DPD system (RU closed loop)

A practical RU implementation can be described as three cooperating subsystems that must be validated together—not in isolation.

Main path: waveform → PA → emission Observation path: coupler → ObsRx → samples Adaptation: learn → write back coefficients
  • Main path (truth target): the emitted spectrum and in-band modulation quality that must meet ACLR/EVM/SEM across power steps.
  • Observation path (truth source): a measurement chain that must be wide enough, linear enough, and quiet enough to represent the PA behavior.
  • Adaptation loop (maintenance): a controlled update mechanism with explicit “when to learn / freeze / relearn” rules tied to temperature and power states.
Design goal: DPD must remain effective when the RU transitions between power states and temperature bins—without requiring manual re-tuning.

Decisions Observation path: bandwidth, dynamic range, and coupler placement

The observation chain must not “lie.” If it compresses or filters away the information that DPD needs, the adaptation loop will learn the wrong model.

Decision Why it matters in a RU RU-side proof
Obs bandwidth Too narrow hides spectral regrowth and memory-related behavior; DPD appears “stable” but ACLR refuses to recover at high Pout. ACLR improvement vs bandwidth setting; residual shoulder comparison after warm-up.
Obs dynamic range If the Obs chain clips before the PA does, the learned correction is biased; convergence can be fast but wrong. Obs chain linearity check; DPD on/off delta collapses at top power → suspect Obs clipping.
Coupler location Coupling before/after filtering changes what DPD “sees” (PA only vs PA+front-end effects). Placement must match the intended correction boundary. DPD effectiveness consistency across antenna mismatch states (VSWR) and thermal corners.
Obs noise floor Observation noise becomes model noise → longer convergence and persistent residual distortion. Convergence time vs temperature; residual EVM/ACLR stability under fixed power state.

Alignment The non-negotiables: delay align + gain/phase align + retention rules

Alignment is the foundation of the closed loop. Misalignment often looks like “DPD does not work” even when PA behavior is correctable.

  • Delay alignment: the main-path and observation-path samples must represent the same waveform time reference (group delay mismatch is a common hidden failure).
  • Gain/phase alignment: amplitude and phase errors (including frequency-dependent response) must be calibrated; otherwise the learned correction is rotated or scaled incorrectly.
  • Retention policy: define when alignment is trusted, when it must be refreshed (temperature bin change), and when learning is frozen (unstable conditions).
Operational rule: When DPD performance changes abruptly with temperature while PA current and output power remain “normal,” suspect alignment retention before suspecting the PA.

CFR CFR ↔ DPD coupling: PAPR, headroom, and efficiency

CFR changes peak behavior (PAPR) and therefore shifts where the PA operates in its nonlinear region. The DPD loop must be validated with CFR enabled, not as a separate feature.

  • CFR stronger: reduces peaks and can improve thermal headroom, but adds intentional distortion that can become the EVM limiter.
  • CFR weaker: preserves waveform fidelity but pushes the PA deeper into compression, increasing ACLR pressure and reducing DPD margin.
  • Engineering order: set a CFR target → choose PA backoff/operating point → validate DPD margin across temperature bins.

Proof Evidence set: not just “DPD on/off”

DPD must be proven with reproducible artifacts that separate power-state effects, thermal effects, and observation-path integrity.

Artifact What it proves Failure signature
ACLR/EVM vs Pout Separates “PA-limited at top power” from “alignment/obs-limited across all powers.” DPD delta shrinks only at high Pout → PA deep compression or Obs chain clipping.
DPD on/off delta Quantifies correction margin and verifies the correction boundary matches coupler placement. ACLR improves but EVM worsens → CFR/DPD interaction or misalignment.
Convergence time vs temp bin Validates retention policy and observation SNR; demonstrates stable field maintenance behavior. Convergence slow in a specific temperature region → obs noise or model mismatch/retention gap.
Keep it RU-side: all proofs above are achievable without referencing DU compute pipelines, and can be tied to RU counters/logs.
Figure F3 — DPD closed-loop: main path, observation path, and alignment/calibration points
DPD closed-loop block diagram Closed-loop diagram showing main transmit path, observation receiver path with coupler, and adaptation engine writing back DPD coefficients. Alignment points for delay and gain/phase plus temperature bin retention are marked. DPD closed loop (RU-side) Solid: main path · Dashed: observation path · Arrow back: coefficient write-back Baseband Waveform DPD / CFR Predistort + clip Upconv + LO Mix / synth PA Nonlinearity Antenna Emission Coupler ObsRx Downconv + ADC BW / DR / noise Alignment Delay align Gain/phase align Adaptation Learn / update Freeze rules Coeff write-back Retention policy Temp bins · Power states Relearn triggers Evidence artifacts ACLR/EVM vs Pout · DPD on/off Convergence time vs temperature

H2-4 · PA biasing & supply strategies: average power, peak power, ET/Doherty implications

In a RU, PA bias and PA supply behavior are the root levers that shape linearity headroom, thermal margin, and the repeatability of protection and derating. This section stays RU-local: PA rails, bias control, sensing, and state behavior.

Map Bias types and why they change linearity + heat

Bias strategy is not an isolated circuit choice; it sets the PA operating point and how the PA reacts to peaks (PAPR), temperature drift, and protection events.

Bias approach What it optimizes Typical risk
Static bias Predictability and simplicity; stable behavior for a fixed operating point. Lower efficiency under varying traffic; higher thermal stress at average power.
Dynamic bias Better efficiency and thermal margin across power states and carrier loading. Bias noise / stability can leak into RF quality; requires clear state logic and validation.
Field lesson: abrupt ACLR collapse at a temperature corner often correlates with a bias or rail state transition, not with the “DPD algorithm.”

Rails PA supply strategies (RU-local): fixed, multi-rail, adjustable rail

PA supply choice determines peak capability, droop behavior, ripple coupling, and how cleanly the RU can derate while keeping emissions compliant. “Envelope tracking” and “Doherty” are mentioned only as RU integration implications—not as amplifier theory.

  • Fixed rail: simplest validation; droop and ripple must be controlled to avoid modulation of the RF envelope.
  • Multi-rail: supports operating-point selection and derating without extreme distortion; rail switching behavior must be observable and repeatable.
  • Adjustable rail: can improve efficiency; places tighter demands on loop stability and transient response (droop/recovery shows up as RF impairment).
  • ET/Doherty implication (RU view): improving efficiency changes the “shape” of PA nonlinearity and its temperature sensitivity—DPD proof must be repeated under the intended rail/bias modes.

Protect Sensing + protection + derating states (what must be logged)

A deployable RU requires protection behavior that is deterministic and auditable. Protection is not only for survival; it also preserves emission compliance by derating in a controlled manner rather than failing abruptly.

PA_V / PA_I Temp_PA / Temp_board VSWR / reflection Bias state Derate state
Trigger Expected RU behavior Log fields (minimum set)
OTP (over-temp) Stepwise derate → protect emission and hardware; clear state transitions. temp_PA, temp_board, derate_state, power_backoff%, timestamp
OCP/UV events Limit or reduce power; prevent oscillatory state flapping. PA_V, PA_I, rail_state, fault_cause, recovery_attempts
VSWR / mismatch Reduce power or mute carriers; protect PA while maintaining predictable alarms. vswr_metric, action_taken, derate_state, duration, timestamp
Evidence principle: Each protection event must be explainable from RU-local signals (rails, bias, temperature, VSWR) and must map to an intended derating state.

Proof Evidence set for bias/rails decisions

Bias and rail strategies are validated by showing the joint relationship between efficiency, emission quality, and thermal behavior—under the same modes used in deployment.

Artifact What it proves Failure signature
PAE vs ACLR Quantifies the efficiency/linearity trade-off and confirms operating-point selection is consistent. PAE improves but ACLR collapses at a specific mode → rail/bias pushing PA into deep compression.
Thermal rise vs state Demonstrates stability across temperature; verifies that derating keeps RF quality within bounds. Thermal corner produces abrupt emission failure aligned with state transition → state logic or sensor thresholds need tuning.
Protection event timeline Proves repeatable behavior and auditability; prevents “mystery” field failures. Frequent state flapping or missing logs → insufficient hysteresis / missing instrumentation.
Figure F4 — PA bias & supply control loop with sensing and derating states
PA bias and supply control loop Block diagram showing PA supply rails, current sensing, controller, bias DAC/LDO, PA, and telemetry inputs including temperature and VSWR feeding a derating state ladder. PA bias + supply (RU-local closed loop) Rails · sensing · protection · deterministic derating states PA Linearity + heat Peak stress PA Supply Fixed / multi-rail Adjustable rail Current Sense PA_I / PA_V Controller OCP / OTP / VSWR Derating logic Logs + alarms Bias DAC / LDO Static / dynamic Temp Sensors Temp_PA Temp_board VSWR Monitor Reflection Mismatch Bias State Static/Dynamic Rail mode Derating states Normal Warm (monitor) Hot (derate) Shutdown Temp/I/VSWR Temp trend OTP risk Critical State updates + logs

H2-5 · LNA biasing, gain control, and receiver linearity under blockers

A receive chain may look clean in the lab but collapse on site when strong adjacent or out-of-band signals appear. The difference is rarely “mystery RF”—it is usually gain staging, bias drift, and overload points interacting with the AGC state machine.

Goal What “good Rx” means in an O-RU (under blockers)

Receiver performance in a RU is defined by stability under realistic blocker conditions, not only by a best-case noise figure. The practical target is to keep the chain inside its linear operating region while the AGC makes repeatable decisions.

NF: sensitivity IIP3: blocker tolerance P1dB: compression margin AGC: stable state behavior
Site reality: A single strong neighbor can push the first nonlinear block into compression, then “everything after it” looks broken even when those blocks are healthy.

Bias LNA bias and gain staging: NF vs linearity is a RU decision

LNA bias sets transconductance and therefore influences both noise and linearity. In a RU, bias must be evaluated together with gain staging because too much early gain can force later stages into overload during blocker events.

Decision lever What it impacts How it fails on site
LNA bias point NF, compression margin, temperature sensitivity of gain and linearity. Hot corner: gain/linearity drift shrinks margin → sudden desense or distortion.
Front-end gain Noise contribution and headroom for mixer/IF/ADC under blockers. Lab looks fine; site blocker causes mixer/ADC overload despite “normal RSSI.”
Attenuation / step gain Protects later stages; sets the dynamic range distribution. Too coarse steps cause AGC hunting; too slow steps allow overload bursts.

AGC AGC must be deterministic: thresholds, hysteresis, and hold time

Under blockers, AGC is a control system. If thresholding and timing are not engineered, the chain can oscillate between gain states, creating intermittent EVM/BLER spikes that are difficult to reproduce without proper logs.

  • Fast protection vs slow optimization: a fast response prevents hard overload; a slower loop stabilizes quality once the chain is safe.
  • Hysteresis: prevents state flapping near a threshold; required for field stability.
  • Hold time: keeps gain stable long enough to observe quality metrics and avoid “self-inflicted” variability.
Verification focus: repeatability—two runs with the same blocker should produce the same gain state transitions and the same overload counters.

Blockers Where the chain saturates first: LNA, mixer/IF, or ADC

Strong blockers reveal the real limiting block. A RU should explicitly instrument overload signatures so “site collapse” becomes a diagnosable event.

First to saturate Typical signature RU-side proof
LNA compression Front-end distortion dominates; reducing later-stage gain does not recover emission quality. Blocking sweep shows early collapse; LNA gain/bias state correlates with event.
Mixer / IF overload Intermod products raise in-band noise-like floor; effects appear frequency-combination dependent. Adjacent-tone combinations trigger; IF gain step changes shift the failure boundary.
ADC clip / overload Hard clipping events; sudden EVM/BLER spikes with clear overload counter bursts. ADC overload counters align with throughput drops; recovery after gain step is immediate.

Cal RU-local calibration that matters under blockers

Under strong signals, small calibration errors become large performance problems. RU calibration must be treated as a retention problem across temperature bins, not a one-time factory adjustment.

DC offset IQ imbalance Temp drift Recal triggers
  • DC offset drift: shifts baselines and thresholds, which can bias AGC decisions and mask true sensitivity.
  • IQ imbalance drift: reduces image rejection and can fold blocker energy into the wanted band.
  • Retention rules: define when to refresh calibration (temperature bin change, large gain-state change, or repeated overload events).

Proof Evidence set: blocking sweeps + temperature curves + event counters

The receiver must be proven with repeatable artifacts that connect RF behavior to RU-visible counters and state transitions.

Artifact What it proves Minimum instrumentation
Blocking test matrix Identifies the first saturation block and validates AGC behavior under stress. AGC_state, gain_step, overload_cnt (by stage if available), timestamp
NF/gain vs temperature Shows margin erosion and the temperature bin where collapse becomes likely. temp_bin, LNA_bias_state, gain, NF proxy/metric
Overload event timeline Converts “site failure” into a diagnosable event with reproducible signatures. overload_cnt, relock_cnt, calibration_refresh_cnt, temp
Figure F5 — Rx AGC and overload points map (three red saturation points)
Receiver AGC and overload points map Block diagram of the RU receive chain with AGC controller and three red overload points marked at LNA compression, mixer/IF overload, and ADC clip. Rx chain under blockers (RU-side) AGC controls gain states · Red dots mark likely first saturation points Antenna Input Filter Select LNA NF / IIP3 Mixer IMD IF / VGA Gain steps ADC Clip / DR Digital Rx Metrics + logs AGC Controller Thresholds Counters Overload / relock AGC control Overload #1 LNA compression Overload #2 Mixer/IF Overload #3 ADC clip

H2-6 · JESD204B/C link planning: lanes, clocks, SYSREF, LMFC, deterministic latency

A JESD link can come up and still behave “strangely” when deterministic latency is not controlled, SYSREF capture is inconsistent, or board-level margin is eroded by routing, jitter, or supply noise. This section stays RU-local: ADC/DAC ↔ FPGA/SoC.

Objects The minimal JESD set that explains 90% of RU issues

RU design and validation can be organized around a small set of objects that directly map to bring-up behavior and timing repeatability.

Lane count Lane rate Subclass SYSREF LMFC Deterministic latency
Practical interpretation: “Link up” is necessary, but deterministic latency repeatability across resets and temperature is what makes the RU predictable.

Clocks Clock domains in RU: device clock, SerDes reference, SYSREF

JESD behavior is governed by relationships between three timing domains. The engineering focus is consistency and capture windows, not a textbook discussion of clock theory.

  • Device clock: defines the ADC/DAC sampling timebase and the internal framing rhythm.
  • SerDes reference: anchors the serial link and impacts lane margin and error behavior.
  • SYSREF: aligns LMFC phase for deterministic latency (when used); capture consistency is critical.
RU validation rule: deterministic latency must be verified across reset cycles and temperature bins, not only at room temperature after one boot.

Board Why “strange performance” happens: routing, jitter, and supply noise

When margin is thin, the link may remain up while quality degrades intermittently. RU proof requires lane-resolved counters and a bring-up state log.

Board-level factor Typical symptom RU-side evidence
Channel loss / reflections Lane errors appear on specific lanes; behavior worsens at temperature. Lane error counters by lane; BER/PRBS margin trend.
Jitter / reference instability Intermittent alignment issues; deterministic latency drifts across resets. Bring-up stage where failures occur; LMFC lock / SYSREF capture events.
Supply noise on SerDes/PLL “Good then bad” behavior under load or temperature; bursts of lane errors. Error bursts correlated with rail telemetry and temperature bin changes.
SYSREF distribution Link up but phase repeatability is inconsistent; startup-to-startup changes. SYSREF capture window violations; deterministic latency stability test fails.

Bring-up Deterministic bring-up requires a state log (RU-local)

Field reliability depends on knowing exactly where bring-up fails and whether failures are lane-specific or systemic. A minimal RU log should record both the bring-up stage and lane-resolved errors.

Log item What it explains Example fields
Bring-up stage Shows whether failures occur early (link training) or late (alignment / data phase). stage_id, fail_reason, retry_cnt
Lane alignment / deskew Identifies weak lanes and boundary conditions (temperature, load). lane_id, align_err_cnt, elastic_events
SYSREF / LMFC events Explains deterministic latency repeatability across resets. sysref_seen, capture_ok, lmfc_phase_id
BER / PRBS Separates physical margin issues from alignment/latency issues. ber, prbs_lock, time_window
Figure F6 — Simplified JESD timing: SYSREF capture window and LMFC alignment
JESD clock and SYSREF timing diagram Simplified timing diagram with device clock ticks, LMFC boundaries, SYSREF pulse, and a highlighted SYSREF capture window that sets deterministic latency alignment. JESD timing (simplified) Key idea: SYSREF must be captured consistently relative to LMFC for deterministic latency Device clock LMFC SYSREF ticks LMFC period LMFC period LMFC period LMFC boundary SYSREF Capture window Deterministic latency depends on consistent SYSREF capture relative to LMFC Validate across resets + temperature bins · log SYSREF/LMFC events + lane errors

H2-7 · Clock tree and jitter budgeting (LO, converters, JESD, Ethernet)

In an O-RU, clocks are not a shared utility—they are a performance boundary. A “working” RU can still fail EVM, create spurs, or show intermittent JESD/Ethernet errors if the clock tree is not budgeted, cleaned, and instrumented per branch.

Consumers Three clock consumer classes and what each one punishes

A single root clock typically fans out into multiple domains, but the consumers care about different noise shapes and failure signatures. Jitter budgeting starts by separating these classes and giving each a branch-specific budget.

Consumer What it is sensitive to RU-visible symptom
LO PLL / Synthesizer Phase noise and reference contamination that lifts skirts or injects deterministic spurs. Spur map changes, adjacent leakage shifts, EVM degrades without obvious power change.
ADC/DAC sampling Integrated sampling jitter that directly reduces SNR and widens error vectors. EVM worsens; SNR drops; performance changes with clock-cleaner mode or temperature bin.
SerDes / Ethernet PHY Reference instability that shrinks eye margin and raises BER/PCS error probability. CRC/PCS errors, link flaps under load, error bursts correlated with rail noise or heat.
Design intent: treat each domain as a customer with an explicit budget, not as “whatever the root provides.”

Cleaner What a jitter-cleaner PLL must do in a RU

In practice, a RU clock IC is a system function: it cleans noise, distributes a stable reference, and provides holdover behavior when external references degrade or disappear. The key is to budget additive jitter per stage and prove it at measurement points.

  • Clean: shape input noise so downstream domains meet their budget (loop bandwidth and reference quality matter).
  • Distribute: fanout and buffers add their own jitter—each branch must account for additive contributions.
  • Holdover: keep a stable output when references are lost (validated in H2-8 as a timing requirement).
Branch budget: xx fs / xx ps Cleaner additive: xx fs Buffer additive: xx fs Local coupling: xx fs

Budget A RU-friendly budgeting rule: root → cleaner → fanout → branch load

A usable budget is not a single “total jitter” number. It is a per-branch limit that includes additive noise from each distribution step. Keep the math simple in documentation: define the branches, list the contributors, then verify each measurement point.

Branch Budget decomposition (placeholders) Primary outcome
LO reference Budget (xx fs) = root + cleaner additive + fanout additive + local coupling Spur / phase-noise driven EVM
ADC/DAC sampling Budget (xx fs) = root + cleaner additive + buffer additive + board coupling SNR / EVM floor
JESD clocks Budget (xx ps) = ref source + distribution additive + supply/jitter coupling Lane margin / error bursts
Ethernet PHY ref Budget (xx ps) = ref source + distribution additive + EMI/rail coupling BER → CRC/PCS errors
Documentation pattern: each branch gets one line-item budget and one “what breaks first” symptom label.

Symptoms Bad clock symptoms: map the symptom to a measurement point

Symptoms become actionable when a RU can connect them to a branch and a measurement point. Keep the mapping short and testable.

Symptom Most likely branch Minimum evidence
EVM degrades ADC/DAC sampling clock or LO reference. EVM vs integrated jitter trend (same RF power, same temperature bin).
Spurs appear LO reference contamination or coupling into PLL loops. Spur map repeatability; changes with clock-cleaner mode or rail condition.
JESD errors JESD ref/device clock margin and rail noise coupling. Lane error bursts aligned with jitter point changes or temperature transitions.
CRC/PCS errors Ethernet PHY reference domain and local coupling. CRC/PCS error counters correlated with ref quality or rail noise events.

Proof Evidence set: phase noise/jitter points + EVM curve + spur map

RU clocking must be proven as a measurable system. The most useful artifacts are branch-level measurement points and one performance curve per sensitive domain.

  • Measurement points list: root reference, cleaner outputs, fanout outputs, and each branch endpoint.
  • EVM vs integrated jitter: demonstrate monotonic sensitivity for the converter path under controlled conditions.
  • Spur map: show repeatable spur signatures and how they move with clock/rail conditions.
Figure F7 — Clock tree with per-branch jitter budgets (placeholders)
Clock tree with jitter budgets Block diagram of RU clock tree showing root reference, jitter cleaner PLL, fanout, and four branches with jitter budget placeholders in fs/ps for LO, ADC/DAC, JESD, and Ethernet PHY. RU clock tree and jitter budgets Budget per branch · Keep placeholders here (xx fs / xx ps) for the execution refinement Root Source OSC / Ref in jitter: xx fs Jitter Cleaner / PLL clean · distribute · holdover additive: xx fs Fanout buffers add: xx fs LO PLL / Synth phase noise budget: xx fs ADC/DAC Clock EVM / SNR budget: xx fs JESD Domains margin / errors budget: xx ps Ethernet PHY BER / CRC budget: xx ps Symptom map EVM ↑ · spurs · JESD errors · CRC/PCS errors measure points log counters

H2-8 · Timing and synchronization (SyncE/PTP/1PPS/ToD): what the RU must guarantee

RU synchronization is a contract: which inputs are supported, how the timebase is selected and disciplined, what happens during reference loss, and which evidence the RU exposes (time error, lock/holdover state, and source switching history).

Inputs Sync inputs commonly present on an O-RU

RU timing interfaces typically include frequency references, time references, and packet-based timing. The key is not protocol detail, but the RU’s guarantees and observable state.

PTP (1588) over Ethernet SyncE 1PPS ToD (Time-of-Day) Local OSC
RU responsibility: expose which source is active and whether it is locked or in holdover, with a numeric time error metric (TE).

Use How the RU uses timing internally (RU-local distribution)

Timing is consumed by the RU timebase and then distributed into internal domains that require repeatable phase and timestamps. This section stays RU-local: distribution to LO reference, JESD-related domains, and timestamp units.

RU consumer Why it needs disciplined timing What to observe
Clock-tree root Defines the stable reference that all performance branches inherit (H2-7). Locked/holdover, TE, source
LO distribution Maintains repeatable phase behavior and reduces time-varying spurs/quality shifts. TE trend vs EVM/spurs
JESD domains Supports consistent timing behavior and reduces error bursts due to reference drift. Lane errors vs TE bins
Timestamp unit Ensures consistent time tagging and externally visible alignment evidence. PTP stats + TE

Holdover Reference loss behavior: controlled degradation, not silent drift

Holdover must be specified as an RU state with alarms, numeric drift evidence, and a recovery strategy. The goal is predictable behavior during sync loss and transparent proof after recovery.

State What the RU does Evidence to expose
Locked Disciplines timebase to selected reference; maintains stable distribution. sync_source, sync_state=locked, TE
Holdover Free-runs on local quality with defined drift behavior and explicit alarms. sync_state=holdover, holdover_elapsed, TE drift
Degraded / Out-of-spec Signals that timing contract is exceeded; optional service limitation is a system decision. alarm_id, TE threshold crossing, switch history
Key rule: every source loss and source switch must be timestamped and recorded with a reason code.

Evidence What the RU must report for validation and operations

RU timing must be auditable. The minimum evidence set should make “timing issues” diagnosable without guessing.

Field Purpose
sync_source Which input is currently selected (PTP / SyncE / 1PPS-ToD / Local).
sync_state locked / holdover / degraded / out_of_spec.
time_error (TE) Numeric error metric and a trend over time (curve evidence in validation).
holdover_elapsed How long the RU has been running without an external reference.
source_switch_history Count, timestamps, and reason code for every source switch.
ptp_stats (names only) Offset/delay/servo state counters as fields, without protocol exposition.

Tests Validation proof: TE curve, lock time, and holdover drift

Timing validation should be curve-driven and state-aware. A RU passes when the TE curve and state transitions are consistent across resets and temperature bins.

  • TE curve: locked → reference loss → holdover → recovery, with clear state markers.
  • Lock time: distribution of time-to-locked across restarts.
  • Holdover drift: TE growth vs time in holdover; validate threshold crossings and alarms.
Figure F8 — Sync inputs → source select → timebase discipline → distribution → alarms/logs
Synchronization block diagram with source select and state machine Block diagram showing SyncE/PTP/1PPS/ToD/local oscillator inputs into a source select MUX, a discipline/servo block feeding the clock tree root, distribution to LO/JESD/timestamp units, and an alarm/log telemetry block driven by a small lock/holdover/degraded state machine. RU timing and synchronization contract Expose TE + lock/holdover state + source switching history PTP over Ethernet SyncE frequency 1PPS / ToD pulse + time Local OSC holdover Source Select MUX sync_source Timebase Discipline servo / filtering TE metric Clock Tree Root disciplined ref LO / PLLs JESD Domains Timestamp Unit State Machine locked / holdover Locked Holdover Degraded Alarms / Logs TE · source switch TE drives state

H2-9 · eCPRI/Ethernet fronthaul hardware (PHY, timestamping, resilience, modules)

A fronthaul link being “up” does not prove it is stable. RU-side hardware must be partitioned into measurable segments, with explicit tap points and counters that make link quality, latency, and timestamp stability auditable.

Partition RU-side fronthaul datapath: what ends where

The goal is a segment-by-segment view that separates “traffic moves” from “margin exists” and “timestamps stay stable”. Keep the scope RU-local: MAC/PCS/PHY/SerDes and the external module/interface.

Segment What to verify (RU-local) Evidence hook
MAC (SoC/FPGA) Queueing and internal timing boundaries; where hardware timestamping is taken. TS tap point; latency deltas
PCS Encoding / FEC behavior and error observability under stress conditions. PCS/FEC counters
PHY Link training stability, margin sensitivity vs temperature/power events. CRC/PCS error bursts
SerDes Reference-clock cleanliness and coupling effects at interface level. BER proxy via CRC/PCS
SFP/QSFP module LOS behavior and DDM trends (temperature, Tx/Rx power drift). DDM/LOS events
Acceptance intent: prove stability by segment counters and timestamp stability—not by link-up alone.

Tap points Timestamp and delay observability: 2–3 taps that matter

Timestamp stability becomes diagnosable only when its measurement point is explicit. Use a small set of tap points that can be correlated with counters and environmental telemetry.

  • Tap-A (MAC TS): hardware timestamp boundary used for stability assessment (distribution over time and temperature).
  • Tap-B (PCS/PHY): counter boundary where CRC/PCS/FEC error bursts are visible (quality and margin proxy).
  • Tap-C (Module/I/O): LOS/DDM boundary for optical/electrical interface symptoms (drift and flap correlation).
TS stability: p50/p99 delta CRC/PCS/FEC bursts LOS/LOF events Link flap count

Resilience “Up but not stable”: failure modes and the RU-side counters that separate them

RU field issues usually look similar at first (drops, retries, missing deadlines). The RU must separate them using counters and event timing.

Observed symptom Likely RU-local root class Minimum proof
Error bursts without link down Margin sensitivity (temperature/power coupling) in PHY/SerDes domain. CRC/PCS/FEC burst timing + temperature bin
Link flaps LOS/LOF events, module interface instability, or hard threshold crossing. LOS/LOF + flap statistics + DDM snapshot
Timestamp instability TS tap jitter/drift due to reference quality at interface level. TS delta distribution vs temperature/load
Slow drift → sudden failure Module/connector degradation or thermal drift reducing margin. DDM trend + rising counters prior to flap
Scope boundary: this chapter stops at RU hardware observability; protocol and network-level timing selection are covered elsewhere.

Modules RU module choices: what to care about (without turning into an optics chapter)

Modules are treated as observable endpoints. The RU should rely on module diagnostics and link counters, not assumptions.

  • DDM/DOM completeness: temperature + Tx/Rx power + alarm flags must be readable and logged.
  • LOS behavior: verify LOS thresholds and persistence; correlate with flap events.
  • CDR/retimer (mention only): used as an engineering lever to restore margin when board/channel constraints demand it.

Validation RU fronthaul acceptance checklist (hardware-focused)

A robust checklist proves stability across environmental bins and operational stress, using only RU-visible evidence.

  • Counters: CRC / PCS / FEC, LOS / LOF, flap count (with timestamps).
  • TS stability: delta distribution (p50/p99) vs temperature bins and traffic stress.
  • Correlation: error bursts aligned with module DDM changes and RU power/thermal events.
  • Recovery behavior: flap recovery time and post-recovery counter reset/continuity policy (documented).
Figure F9 — Fronthaul datapath with counters and timestamp tap points
RU fronthaul datapath and observability taps Block diagram from MAC to PCS, PHY, SerDes and SFP/QSFP module. Shows three tap points for timestamp and counters, plus evidence labels for CRC/PCS/FEC and LOS/LOF. RU fronthaul hardware: datapath + evidence hooks MAC/PCS/PHY/SerDes + module · taps enable correlation RU internal boundary MAC SoC / FPGA PCS FEC / encode PHY link margin SerDes refclock SFP / QSFP module DDM / LOS TS Tap-A MAC TS Tap-B CRC/PCS/FEC Tap-C DDM/LOS CRC · PCS err · FEC err link flap count · LOS/LOF events TS delta distribution · p50/p99 External I/O + module evidence Evidence keys Counters (CRC/PCS/FEC) · Events (LOS/LOF/flap) · Timestamp stability (TS delta)

H2-10 · Telemetry, protection, and thermal (field stability and traceability)

Field stability is achieved by closing the loop between measured stress (thermal, power, VSWR), controlled actions (derating ladder), and auditable evidence (logs and counters) that explain any RF KPI drift.

Telemetry RU signals that directly predict RF KPI drift

Telemetry must prioritize signals that move PA operating point or trigger non-linear behavior in a repeatable way. Keep the set minimal but sufficient for correlation.

PA rail V/I PA temperature Board hot-spot temperature Fan RPM / fault VSWR / reflected power Protection state
Field intent: every RF KPI excursion should be explainable by a small set of measured stressors and a known protection state.

Protection Derating ladder: controlled degradation instead of abrupt collapse

Protection is most effective when it is staged and hysteretic. Each level has a clear trigger, an action, and a recovery condition. This avoids oscillation and prevents silent out-of-spec operation.

State Trigger examples (2 signals) Action and evidence
Normal Temp within nominal band; VSWR within nominal band. Full performance; record periodic snapshots.
Warm Rising hot-spot temp; fan approaching high duty. Increase cooling, minor bias guard; log temp bin entry.
Hot (Derate) PA temp high; PA current near limit. Reduce power; optionally disable a subset of carriers; log action_taken.
Critical VSWR/reflected power high; temperature beyond safe limit. Shutdown; raise alarm; capture fault_cause + snapshots.
Hysteresis rule: every state transition requires a stable dwell time or a recovery threshold to prevent rapid toggling.

Traceability Field logs: minimum schema that makes every incident reproducible

Logs should be field-replayable: what happened, why it happened, what action was taken, and what the RU looked like at that moment. Use a compact key set that ties thermal/power/VSWR to DPD/sync/link context.

Field Why it is non-optional
timestamp, uptime Aligns events with counters and external site alarms; supports correlation windows.
temperature_bin Explains repeatable KPI shifts across environmental conditions.
protection_state, action_taken Proves controlled degradation and prevents “silent derate” ambiguity.
fault_cause Distinguishes thermal, VSWR, rail OCP/OTP, and sensor faults.
dpd_state Separates RF drift from linearization state changes (enabled/converging/frozen/fallback).
sync_state Correlates timebase state (locked/holdover/degraded) with KPI stability and link timing evidence.
eth_error_burst, link_flap_count Captures fronthaul instability context without protocol exposition.
pa_v, pa_i, pa_temp, board_temp, vswr Provides the minimal snapshot needed to replay thresholds and validate protection correctness.

Validation Field stability proof: step-temperature + KPI hold + transition correctness

Validation is strongest when it is state-aware. Demonstrate that RF KPIs remain within limits across temperature bins, and that protection transitions occur at documented thresholds with consistent evidence.

  • Step-temperature: apply discrete plateaus and record EVM/ACLR with telemetry and protection state.
  • Transition correctness: verify trigger thresholds, hysteresis, and recovery conditions (no oscillation).
  • Event replay: any derate/shutdown event must be explainable using the minimum log schema.
Figure F10 — Thermal & protection state ladder (with triggers and actions)
Thermal and protection state ladder A four-state ladder from Normal to Warm to Hot Derate to Critical Shutdown. Each state lists two trigger signals and a short action label. Includes hysteresis note and a minimal log fields box. Thermal & protection ladder (field stability) State-driven actions · hysteresis · auditable logs Normal Triggers: Temp OK · VSWR OK Action: full power · periodic snapshots Warm Triggers: Hot-spot ↑ · Fan duty ↑ Action: cooling up · guard bias Hot (Derate) Triggers: PA Temp high · PA I near limit Action: reduce power · optional carrier cut Critical (Shutdown) Triggers: VSWR high · Temp beyond safe Action: shutdown · alarm · capture snapshot Hysteresis / recovery rule Return transitions require cool-down or threshold margin to prevent oscillation. Minimum incident log fields timestamp · uptime temperature_bin protection_state fault_cause dpd_state sync_state pa_v · pa_i · pa_temp · board_temp · vswr · eth_burst Validation evidence Step-temperature → EVM/ACLR hold · Transition correctness · Event replay EVM/ACLR TELEMETRY + STATE

H2-11 · Validation & troubleshooting playbook (symptom → root cause → proof)

This section provides a shortest-path field workflow for O-RU issues using RU-local evidence only: measurement points, counters, and structured logs. Each symptom maps to 2–3 likely root causes, required proof, a corrective action, and a retest criterion.

How to use The shortest-path rule (do not “theory dive” first)

  • Step 1 — Classify: decide whether the dominant symptom is RF KPI, Clock/Sync, JESD, or Fronthaul.
  • Step 2 — Prove: gather one hard evidence packet: spur map / jitter or TE curve / lane counters / CRC-PCS-FEC bursts.
  • Step 3 — Act: apply the smallest RU-local corrective action (alignment, clock cleanup, rail/bias/thermal guard, SI constraint) and retest.
Spur map Integrated jitter TE / holdover drift JESD lane counters CRC/PCS/FEC bursts DDM/LOS + flap stats
MPN note: part numbers below are examples used as engineering anchors (not a recommended BOM).

Symptom table Symptom → likely root causes → proof → fix → retest

Symptom (RU-side) Likely root causes (2–3) Proof required (meas/counters/logs) Corrective actions (RU-local) Example MPN anchors
ACLR/EVM worsens
or spurs jump
  • LO/clock spurs or integrated jitter increases
  • DPD/CFR mis-tracks due to alignment or thermal bin
  • PA rail ripple / bias drift shifts operating point
  • Spur map vs reference/divider relationships
  • EVM vs integrated jitter correlation across bins
  • Logs: dpd_state, temperature_bin, pa_v/pa_i, protection_state
  • Clock/PLL cleanup at interface level; re-check spur map
  • DPD re-alignment / bin calibration; compare DPD on/off
  • Rail/bias guard + thermal stabilization; rerun step test
LMX2594 Si5341A
DPD does not converge
or temperature-sensitive
  • Observation path saturates or lacks dynamic range
  • Main/Obs alignment drifts with temperature/clock
  • PA bias/rail changes too rapidly for stable adaptation
  • DPD on/off: ACLR/EVM gain disappears or reverses
  • ORx stats: clipping/near-noise-floor indicators
  • Temperature step: failure only in specific bins
  • Fix ORx operating region (avoid clipping, ensure SNR)
  • Lock alignment references; rebuild calibration points
  • Stabilize PA operating point (bias/thermal guards)
ADRV9009
JESD intermittent
lane errors
  • SYSREF/LMFC capture window is too tight
  • SerDes margin collapses under noise/thermal stress
  • SI boundary (routing/connector) triggers at temperature
  • Lane counters / alignment / bring-up state logs
  • Check SYSREF relationship at RU clocking boundary
  • Temperature sweep: error bursts cluster by bin
  • Widen deterministic window (SYSREF distribution discipline)
  • Clean converter/JESD clocks and their supply domains
  • Constrain rate/equalization; validate with A/B changes
AD9528 DS280DF810
Sync unlock
or frequent holdover
  • Input source quality/switching policy causes churn
  • Holdover parameters/limits do not match RU requirements
  • Distribution branch is more sensitive than expected
  • TE curve + locked/holdover transitions
  • Lock time + holdover drift across bins
  • Logs: sync_state, sync_source, source_switch
  • Enforce source select state machine + hysteresis
  • Tune holdover behavior; document drift bounds
  • Retest by forced source loss and controlled recovery
ZL30732A 8A34001
Eth link flap
or error bursts
  • Module/interface drift (DDM trends) reduces margin
  • PHY/SerDes refclock or supply coupling causes bursts
  • Channel/connector SI boundary triggered by environment
  • CRC/PCS/FEC bursts + flap timing
  • DDM snapshot (temp, Tx/Rx power) aligned to bursts
  • Tap correlation: MAC TS vs PCS counters vs module LOS
  • Separate by taps; isolate module vs RU-side refclock
  • Stabilize refclock/supply domains; retest error rate
  • Validate SI margin using controlled A/B changes
88X3310 DS280DF810
Retest rule: corrective action is accepted only when the original symptom disappears and the proof metric improves (e.g., spur map cleans up, TE curve stabilizes, lane errors stop, counters stop bursting, flap count stays flat).

Evidence scripts Proof as executable steps (copy into lab/field checklist)

Script S1 — RF KPI drift (EVM/ACLR/spurs)

  • Capture spur map across the problematic condition (temperature bin or output power point).
  • Measure integrated jitter / phase-noise proxy at the RU clock boundary; correlate with EVM change.
  • Record log snapshot: dpd_state, temperature_bin, pa_v, pa_i, protection_state.
  • Pass/Fail: after corrective action, spur lines reduce and EVM/ACLR return within acceptance limits at the same bin.

Script S2 — DPD convergence

  • Run DPD on/off A/B at fixed output power; compare ACLR and EVM deltas.
  • Check ORx operating region (no clipping; not near noise floor); lock the sampling reference.
  • Repeat across temperature bins to confirm convergence stability (no “single-bin collapse”).
  • Pass/Fail: convergence time is bounded and ACLR improvement is consistent across bins.

Script S3 — JESD intermittent

  • Log lane counters and bring-up state transitions around the event; identify burst timing.
  • Validate SYSREF/LMFC relationship at RU clock distribution (window not edge-clinging).
  • Temperature sweep at fixed configuration; note bin where errors rise.
  • Pass/Fail: lane errors do not occur over the full sweep and deterministic behavior remains stable.

Script S4 — Sync unlock / holdover churn

  • Collect TE curve and sync_state transitions; annotate source changes.
  • Force controlled source loss; verify deterministic fall to holdover and bounded drift.
  • Restore source; verify lock time and transition hysteresis (no rapid toggling).
  • Pass/Fail: churn disappears, drift stays within bound, and logs fully explain transitions.

Script S5 — Fronthaul error bursts / flaps

  • Read CRC/PCS/FEC counters with timestamps; mark burst windows.
  • Read module DDM/LOS snapshots aligned to bursts; correlate with temperature and optical power drift.
  • Use RU tap points: MAC TS vs PCS counters vs module indicators to isolate segment.
  • Pass/Fail: bursts stop under the same stress profile and flap count stays flat.
Figure F11 — Troubleshooting decision tree (3 layers, 2 checks per layer)
Troubleshooting decision tree for RU issues A three-layer decision tree. Layer 1 splits RF KPI vs Digital faults. Layer 2 splits Clock/Sync vs DPD on RF side, and JESD vs Fronthaul on Digital side. Layer 3 provides two checks per branch with evidence tokens. RU troubleshooting flow (symptom → proof → action) 3 layers · 2 checks per layer · evidence-first routing Start: dominant symptom? Choose the shortest proof path RF KPI issue EVM / ACLR / spurs Digital / link issue Sync / JESD / fronthaul Clock / LO / jitter spur map · jitter correlation DPD / ORx loop DPD on/off · ORx health JESD link lane errors · SYSREF Fronthaul (Eth/eCPRI) CRC/PCS/FEC · flap · DDM Check 1 spur map pattern Check 2 EVM vs jitter Check 1 DPD on/off Check 2 ORx headroom Check 1 lane counters Check 2 SYSREF window Check 1 CRC/PCS/FEC Check 2 DDM + flap Evidence tokens (keep it minimal) spur map TE / holdover lane errs CRC/PCS/FEC DDM / LOS DPD A/B

Request a Quote

Accepted Formats

pdf, csv, xls, xlsx, zip

Attachment

Drag & drop files here or use the button below.

H2-12 · FAQs (O-RU focused)

Each answer stays RU-side and ends with a “Proof hook” (what to measure/read/log) plus a quick next step.

Q1 In O-RU acceptance, what boundaries do EVM, ACLR, and SEM each represent?
EVM is the modulation-accuracy boundary (phase noise/jitter, IQ errors, nonlinear distortion that lands in-band). ACLR is the adjacent-channel leakage boundary (spectral regrowth and out-of-band energy). SEM is the compliance boundary against a spectral mask across offsets. A “good RU” must pass all three at the same operating point, not one-at-a-time.
Proof hook: record EVM/ACLR/SEM at fixed Pout + bandwidth; attach spur map and RU logs (dpd_state, sync_state, temperature_bin). Next step: repeat across temperature bins.
Q2 Why can ACLR look great while EVM is still poor at the same output power?
ACLR can be dominated by out-of-band regrowth, while EVM is often dominated by in-band impairments: integrated jitter/phase noise (CPE), IQ imbalance, residual timing skew, or supply/bias modulation that rotates constellation points without dumping much energy into adjacent channels. Start with clock/LO and converter sampling integrity before “tuning PA linearity.”
Proof hook: correlate EVM vs integrated jitter and spur map; compare DPD on/off EVM delta; check pa_v/pa_i ripple signatures. Next step: isolate clock branch (LO vs DAC clock) by A/B reference changes.
Q3 Where should the DPD observation receiver tap be placed to avoid “seeing the wrong world”?
The observation path must capture the same nonlinear behavior the antenna sees, but with stable coupling and enough headroom. A common RU practice is a directional coupler near the PA output (before the antenna interface’s worst mismatch dynamics), plus calibrated attenuation so ORx never clips and never rides the noise floor. Placement must minimize temperature-dependent coupling drift.
Proof hook: ORx headroom histogram (no clipping, adequate SNR) + stable ACLR improvement vs Pout/temperature. Next step: move from “works in lab” to bin-by-bin validation using ORx level and DPD convergence time.
Q4 If DPD fails to converge in the field, check temperature drift first or clock/alignment first?
Decide using a fast split: if ORx health is bad (clipping or too low SNR), fix the observation path first. If ORx looks healthy but convergence collapses only in certain bins, suspect alignment drift driven by clock/jitter or temperature-dependent delay. Temperature matters, but clock/alignment is usually the “binary gate” that decides whether adaptation is even meaningful.
Proof hook: DPD on/off ACLR gain, convergence time, ORx headroom flags, and EVM vs jitter correlation. Next step: lock reference and rerun a controlled temperature step to see if failures follow bins or follow clock state.
Q5 How does CFR change DPD margin and PA thermal behavior?
CFR reduces PAPR, which can lower peak stress and stabilize thermal load, but it also changes the waveform statistics that DPD learned. Aggressive clipping can increase in-band distortion (hurting EVM) even if ACLR stays acceptable. The best RU practice treats CFR and DPD as a paired configuration: clamp PAPR, then retrain/realign DPD for the new distribution and thermal point.
Proof hook: compare PAPR, EVM/ACLR, PA current, and temperature across “CFR level × DPD on/off.” Next step: choose the smallest CFR that meets peak constraints while preserving EVM at the hottest bin.
Q6 Why can PA bias/supply noise show up directly as EVM degradation?
Bias and rail noise modulate PA gain and phase (AM-AM and AM-PM), effectively imposing amplitude/phase noise onto the transmitted vector. Even if adjacent leakage is controlled, constellation points can smear or rotate, raising EVM. The risk is highest when rail ripple or bias DAC noise lands inside the modulation bandwidth or beats with LO/converter clocks, creating deterministic spurs.
Proof hook: look for EVM movement that tracks pa_v/pa_i ripple and spur lines at ripple/switching frequencies. Next step: improve rail isolation/decoupling and verify EVM recovers without changing RF chain settings.
Q7 Under strong blockers, how to tell whether LNA, mixer, or ADC saturates first in the RX chain?
LNA-first overload often shows early gain compression and NF jump; AGC codes push harder but RSSI/quality stops improving. Mixer-first overload typically produces intermod products that rise rapidly with blocker power even when ADC headroom seems available. ADC-first overload shows obvious clipping signatures (flat tops, sudden noise-floor lift, digital clip/overrange flags) and sharp BER/EVM collapse.
Proof hook: run a blocker sweep while logging AGC state, overload/clip flags, and FFT snapshots. Next step: apply attenuation or stage-by-stage gain changes to see which evidence moves—then set RU protection thresholds accordingly.
Q8 JESD204B/C can link up yet still show BER or latency drift—what are the three most common root causes?
First: SYSREF/LMFC alignment margin is too tight, so capture occurs near the edge and becomes temperature/jitter sensitive. Second: clock/power noise collapses SerDes margin, causing intermittent lane errors without an obvious “down” event. Third: SI boundary (routing, connectors) triggers only under thermal or supply stress. Link-up is not the same as deterministic behavior.
Proof hook: lane error counters + bring-up state, SYSREF window checks, and a temperature sweep to find clustered burst bins. Next step: widen the deterministic window by stabilizing SYSREF distribution and cleaning the JESD clock domain.
Q9 In an O-RU clock tree, which jitter budget is the most “expensive” and should be allocated first?
The most expensive budget is the one that directly turns into EVM/spurs: the LO/PLL path and the converter sampling clock. SerDes/Ethernet clocks are critical for BER and link stability, but they typically tolerate a different jitter profile than LO/converters. Budget should start from KPI sensitivity: LO phase noise → spurs/EVM, converter jitter → SNR/EVM, SerDes jitter → eye/BER.
Proof hook: measure integrated jitter per branch and correlate with EVM, spur map, JESD errors, and Eth CRC bursts. Next step: prioritize cleaning the branch with the strongest KPI correlation, not the easiest-to-measure node.
Q10 When PTP/SyncE is lost, how should RU holdover degrade without causing false alarms?
RU holdover should be stateful: Locked → Holdover (bounded drift, clear alarm) → Free-run (degraded service). The key is hysteresis and explicit evidence so the RU does not flap between sources or alarm states. A good policy publishes TE (time error), drift bounds, source switch events, and recovery timing; service degradation (e.g., power/carrier reduction) is tied to drift thresholds.
Proof hook: TE curve, sync_state transitions, source switch logs, and lock time under forced source loss/recovery. Next step: verify drift stays within bound at the worst temperature bin and that alarms are consistent with TE thresholds.
Q11 For intermittent eCPRI/Ethernet error bursts, how to localize quickly to MAC vs PCS vs module using counters?
Localize by aligned timestamps and counter tiers. MAC-level CRC bursts suggest frame integrity issues at/above MAC, while PCS indicators (block lock, high-ber events) point to SerDes/PCS margin. FEC corrected/uncorrected bursts often indicate optical/electrical margin erosion even if MAC CRC stays low. Module LOS/DDM trends (temperature, Rx power) help separate module drift from RU refclock/supply coupling.
Proof hook: capture CRC/PCS/FEC counters plus module LOS/DDM in the same time base. Next step: use tap points (MAC, PCS, module) to see where bursts originate, then apply the smallest A/B isolation change (module swap, refclock cleanup, SI constraint).
Q12 If field performance drifts without an immediate link-down, what telemetry/protection states should be checked before deep troubleshooting?
Start with RU evidence that explains “silent degradation”: temperature_bin and thermal derate state, PA telemetry (pa_v/pa_i), reflected power/VSWR events, DPD state (trained/frozen/fault), sync_state (locked/holdover churn), and Eth error-burst counters. Drift often comes from gradual derating, clock state changes, or observation-path margin loss rather than a single hard fault.
Proof hook: compare pre/post drift snapshots of protection_state, temperature_bin, dpd_state, sync_state, and error counters. Next step: route the case into the H2-11 decision tree (RF vs clock/sync vs JESD vs fronthaul) using the dominant evidence token.