Retimer / Redriver for High-Speed Serial Links
← Back to: USB / PCIe / HDMI / MIPI — High-Speed I/O Index
A retimer or redriver extends high-speed links by reclaiming eye margin (loss/ISI/jitter) without changing the protocol—done right, it turns “works on bench” into “stable in real cables and connectors.”
Choose a redriver when linear EQ gain is enough, and choose a retimer when CDR re-timing and jitter transfer control are needed; validate with measurable targets (BER, eye margin, jitter, retrain rate) under a fixed test contract.
One-paragraph Answer: Retimer vs Redriver
A retimer performs clock/data recovery (CDR) and re-timing, which changes jitter transfer and can rebuild eye margin on long, lossy channels—at the cost of new sensitivity to reference clock quality, power integrity, and training transparency. A redriver is a channel booster (typically CTLE/gain) that reshapes the analog channel response without re-timing; it is best when insertion loss dominates and the clock/jitter regime is already healthy.
If the limit is loss/ISI, start with a redriver; if the limit is jitter/clocking or the channel needs re-timing, start with a retimer; if the link is training-sensitive, prioritize transparent behavior and validate margin with defined metrics before scaling reach.
- When to use a retimer: long reach with jitter-sensitive margin, multi-connector/cable channels, or scenarios where re-timing + EQ provides measurable eye opening. (Validate: jitter transfer, reference clock noise, margin metrics.)
- When to use a redriver: loss-dominated channels that need CTLE/gain to recover high-frequency content while keeping architecture simple, low-latency, and lower power. (Validate: frequency response match, noise amplification, symmetry.)
- Common pitfalls: treating clock/power noise as “channel loss,” over-EQ that creates false confidence, or layout/return-path breaks that erase any benefit. (First checks: reference clock, PDN ripple, return continuity.)
- Loss signature: failures correlate with length/connector count; equalization changes margin more than clock changes.
- Jitter signature: margin collapses with reference clock/PDN changes; “good-looking” amplitude still shows timing failures.
- Training sensitivity: link comes up but flaps, retrains, or shows periodic margin collapse—indicating transparency/interaction risk.
Scope Map: What This Page Covers (and What It Does Not)
Coverage focuses on cross-protocol engineering outputs: channel loss budgeting, EQ knobs (CTLE/DFE/pre-emphasis), clock/jitter reasoning (including jitter transfer), training transparency levels, measurable validation metrics (eye/jitter/BER/margin), placement/layout rules, power/thermal guardrails, and bring-up → production gates.
- Templates: loss/jitter budget inputs → required EQ window → margin target placeholders.
- Processes: symptom triage → dominant-limit identification → knob tuning → measurable pass criteria.
- Guardrails: placement patterns, return-path continuity, PDN noise sensitivity, thermal drift and retrain loops.
- No protocol specification clauses, state machines, or compliance step-by-step procedures.
- No deep dives into protocol-specific training details (only transparency concepts and risk modes).
- No vendor part-number comparisons on this page; only selection logic and parameter definitions.
- No “re-explaining” USB/PCIe/HDMI/MIPI fundamentals; those belong to sibling pages.
Use the links below when a design question requires protocol-specific constraints, feature negotiation, or compliance workflows.
- USB Retimer / Redriver — Type-C ecosystems, orientation paths, USB-specific validation contexts.
- PCIe Retimer / Redriver — long-board/backplane assumptions, PCIe-specific training and compliance hooks.
- HDMI Retimer / Redriver — display link behaviors, bridge contexts, cable run considerations.
- MIPI Extender / Re-timing — camera/display link extensions and conversion constraints.
This page provides shared methods, metric definitions, and guardrails. Any protocol-specific mechanism is treated as a sibling-page topic and referenced, not duplicated.
Terminology & Architecture: What Changes in the Link
Terminology is treated as an engineering contract: each term maps to a concrete architectural action and an observable consequence (eye, jitter, BER, retrain behavior). This alignment prevents “metric drift” across budgeting, tuning, and validation steps.
- Does: reshapes the analog channel response (CTLE/gain; sometimes AGC/limiting) to counter insertion-loss dominated attenuation.
- Does NOT: recover clock/data or re-time; input jitter is not “cleaned,” only reshaped indirectly through amplitude/frequency response.
- Observable consequence: eye height may improve while noise may also rise; benefits track with loss signatures more than clock signatures.
- Does: performs CDR and re-drives a reconstructed signal; may include CTLE/DFE and transmitter conditioning.
- Changes: jitter transfer and timing boundary behavior; re-timing can rebuild margin when jitter/clocking becomes the dominant limiter.
- New sensitivities: reference clock quality, power integrity, thermal drift, and training transparency become first-class design constraints.
- Transparent (wire-like): behaves like a passive segment; link negotiation/training requires no special awareness.
- Adaptive (semi-transparent): applies EQ adaptation but stays out of protocol decisions; still expected to be “invisible” to the link.
- Non-transparent boundary: introduces a re-timing boundary that can change training convergence or recovery behavior.
- Protocol-aware: observes or participates in protocol-specific mechanisms (treated as a sibling-page topic; not duplicated here).
- Loss budget produces an EQ window (H2-5/H2-6).
- Clock/jitter reasoning defines what re-timing can and cannot fix (H2-7).
- Training transparency sets the risk envelope for convergence and flaps (H2-8).
- Validation metrics define pass criteria (H2-11).
Channel Reality Check: Confirm the Dominant Limiter First
“Add a retimer” is not a diagnosis. Before inserting any device or tuning EQ, the failure must be classified by the dominant limiter: SI, Clock/Jitter, Power/Thermal, or Firmware/Training. This step prevents expensive changes that hide the real root cause.
Each symptom is treated as a classifier: if correlation follows length/connector count, suspect SI; if correlation follows reference clock or PDN conditions, suspect clock/jitter or power; if correlation follows retrain/recovery patterns, suspect firmware/training.
- Waveform proxy: eye/jitter trend (before/after a controlled change).
- Loss proxy: S-parameter or equivalent insertion/return trend (or credible channel model inputs).
- Error accounting: error rate with a defined denominator (per time / per data volume / per frame count).
Without a denominator and a time window, “error counts” cannot be used to select knobs, compare builds, or gate production.
- SI bucket: go to Loss Budget (H2-5) → EQ Toolkit (H2-6) → Placement/Layout (H2-9).
- Clock/Jitter bucket: go to Clock & Jitter (H2-7).
- Power/Thermal bucket: go to Power/Thermal/Reset (H2-10).
- Firmware/Training bucket: go to Training Transparency (H2-8) + Validation Metrics (H2-11).
Loss Budget & Reach: Write the Channel Math Before Adding Silicon
A loss budget is a decision tool: it identifies what dominates (loss slope, reflections, crosstalk, or mode conversion) and turns the channel into an actionable output: needed EQ window plus margin placeholders that can be measured and gated.
- Insertion loss vs frequency: the loss slope that drives high-frequency attenuation and ISI risk.
- Return loss: reflections that create dips/ripples and timing sensitivity even when amplitude looks acceptable.
- NEXT/FEXT: aggressor-victim coupling that rises with parallelism and sharper edges.
- Mode conversion: differential asymmetry/return-path breaks that convert energy into common-mode (margin + EMI penalty).
- Trace: length ___ / stackup ___ / routing notes ___
- Connectors/adapters: count ___ / type ___ / shielding notes ___
- Cable: length ___ / construction ___ / shield bond ___
- Vias/transitions: count ___ / stubs ___ / back-drill ___
- Crosstalk: parallel length ___ / spacing ___ / aggressor activity ___
- Mode conversion: symmetry risks ___ / return-path breaks ___
- Total channel (trend): loss slope ___ / reflection features ___ / coupling risk ___
- Needed EQ window: CTLE range ___ / DFE need (Y/N) ___ / Tx FIR range ___
- Margin placeholders: loss headroom ≥ X ___ / reflection dip margin ≥ Y ___ / crosstalk delta ≤ Z ___
- Decision note: if EQ window exceeds feasible knobs → revisit topology or consider re-timing boundary.
- A “good-looking” amplitude does not cancel reflection timing sensitivity (return loss still matters).
- An insertion-loss number without a frequency trend cannot predict EQ needs.
- Ignoring crosstalk and mode conversion often produces fragile systems even when a single-lane test passes.
EQ Toolkit: CTLE, DFE, and Tx FIR as a Reproducible Knob Set
Equalization is treated as a parameterized workflow: start from the needed EQ window, change one knob at a time, and validate using defined evidence and a stable denominator. The goal is not a pretty eye snapshot, but a stable system margin.
Counters frequency-dependent insertion loss by restoring high-frequency content; expands the effective eye when loss slope dominates.
Boosts high-frequency noise and crosstalk along with signal; excessive settings can create “pretty” eyes that collapse under real system activity.
- Use controlled reach changes (length/connector count) and confirm margin changes monotonically.
- Stress with aggressor activity (throughput / adjacent switching) and verify error rate stays bounded.
- Record the denominator: per time or per data volume, with a fixed observation window.
Cancels post-cursor ISI that cannot be solved by frequency shaping alone; effective when the channel creates strong deterministic tails.
Wrong decisions feed back into the next symbols; burst noise and non-linear distortion can destabilize adaptation and create rare but catastrophic error bursts.
- Validate long-run stability (minutes to hours) across temperature and supply variations.
- Check for “rare burst” signatures rather than relying on short snapshot improvements.
- Confirm the improvement survives crosstalk stress and does not increase flaps/retrains.
Matches the transmitted spectrum to the channel response; can reduce deterministic ISI when aligned with the loss profile.
Over-emphasis can worsen EMI/crosstalk and over-stress the receiver; combined knobs can “fight” and create fragile settings.
- Use A/B settings inside the same budget window; log error rate with identical observation windows.
- Check aggressor sensitivity and EMI risk indicators; do not optimize a single-lane view only.
- Confirm the setting survives temperature and supply drift without creating flaps.
- Redriver-heavy: analog gain/CTLE, optional limiting/AGC — primary risk is noise amplification and false confidence.
- Retimer-heavy: CDR + adaptive EQ/DFE + rebuilt Tx — primary risks shift to ref clock, PDN, thermal drift, and transparency envelope.
- Start from the needed EQ window (H2-5) and pick a baseline with minimal knobs enabled.
- Change one knob family at a time (CTLE → Tx FIR → DFE) and log the exact setting.
- Validate with a stable denominator and a fixed observation window (time/data volume).
- Stop and re-check bucketization if improvements are non-monotonic or produce new flaps.
Clock & Jitter: When Retiming Cleans Up — and When It Bites Back
Retiming can improve link margin by reshaping the jitter spectrum, but it is not a universal “jitter eraser.” The output can become dominated by reference quality, power noise coupling, and thermal drift. This section defines practical jitter accounting, the meaning of jitter transfer, and a measurement approach that avoids false confidence.
- Bandwidth: defines what the CDR tracks vs what it rejects/cleans up.
- Peaking: certain regions can be amplified, creating “clean in one view, fragile in another” outcomes.
- Low-frequency wander: slow drift can be tracked into the output; retiming does not guarantee removal of slow variations.
- Practical implication: margin risk can shift from channel loss to reference/PDN/thermal stability.
- “Retimer = jitter cleaner” → retimer reshapes the jitter spectrum; some regions are tracked, some are cleaned, some can peak.
- “Bigger eye = stable system” → stability requires long-run evidence under stress (temperature, throughput, PDN variation).
- “More aggressive device fixes everything” → reference, PDN, and thermal gating often determine the outcome.
- Fix the denominator: error counts per time or per data volume; keep the observation window constant.
- Fix the trigger: document throughput state, temperature state, and PDN mode.
- Use two evidence types: waveform evidence (eye/jitter) + long-run error evidence (BER/CRC/counters).
- Run A/B perturbations: change reach, change reference, and inject PDN stress; confirm conclusions remain consistent.
Training Transparency: Pick the Right “Visibility Level” for Link Training
“Training transparency” becomes actionable when expressed as levels. The ladder below turns marketing words into a decision structure: prefer the lowest transparency impact that meets reach, then escalate only when the budgeted EQ window cannot be met safely.
- Visibility: appears as a passive path.
- Control: minimal configuration.
- Stability: best default when reach permits.
- Visibility: mostly wire-like, with adaptive shaping.
- Control: profiles and limits may be required.
- Stability: drift sensitivity rises with environment changes.
- Visibility: re-timing boundary can change training behavior.
- Control: mode management and fallback become important.
- Stability: reference/PDN/thermal gating is critical.
- Visibility: higher “presence” in training and recovery behavior.
- Control: integration complexity rises (configuration, observability, rollback).
- Stability: failure modes can cascade into retrain storms.
- Default: prefer the lowest transparency impact that meets reach.
- Escalate when: the needed EQ window exceeds feasible knob ranges and non-channel buckets (reference/PDN/thermal) are already gated.
- De-risk: keep bypass/fallback paths and a locked profile strategy for recovery.
- Symptom: repeated link attempts or intermittent “works after reboot.”
- Mechanism: adaptation never finds a stable operating point under current channel noise.
- First check: confirm the channel budget window is feasible and triggers are controlled (temperature/PDN/throughput).
- Symptom: stable for minutes/hours, then margin collapses with temperature or load.
- Mechanism: adaptive EQ moves with environment, pushing the system to a cliff edge.
- First check: long-run evidence with fixed windows; verify reference/PDN gating for re-timing levels.
- Symptom: repeated re-training events that amplify instability.
- Mechanism: recovery thresholds are crossed too easily; each retrain becomes a disturbance.
- First check: observe correlation with environmental triggers and verify conservative default transparency level.
- Symptom: small margin dips cause large resets or link drops.
- Mechanism: thresholds and timing windows are misaligned with real system variability.
- First check: lock observation windows and validate under controlled perturbations (reach / PDN / temperature).
Placement & Layout: Where to Put It, How to Route It (Avoid “Worse After Adding”)
Most field failures come from wrong placement and broken return paths, not from missing “more silicon.” This section turns placement and routing into repeatable rules: choose a placement pattern based on loss distribution, then protect the invariants that keep a high-speed channel predictable.
- Use when the first segment must keep edges clean before a long lossy path.
- Prefer when the downstream is the dominant loss contributor.
- Guard the source-side return path and the first connector transition.
- Use when the connector/cable segment dominates uncertainty and loss.
- Prefer when shielding and chassis return quality determine margin.
- Treat connector transitions as a controlled discontinuity (short + symmetric).
- Use to split a long channel into two feasible segments (each within a safe EQ window).
- Prefer when loss and reflections are distributed and cumulative.
- Keep both sides locally well-controlled (no hidden stubs or plane breaks).
- Reference clock (retimer): keep on a continuous reference plane and away from aggressors; avoid plane splits.
- Decoupling topology: minimize loop area; use multi-band coverage; prevent high-speed return currents from detouring through supply paths.
- Ground bounce: ensure the physical return path is short and continuous; avoid forcing return to jump via narrow necks or gaps.
- What changed: device inserted near connector + vias + layer swaps.
- Why worse: discontinuities stacked; reflections and mode conversion rose.
- Fast fix: move away from the transition cluster; shorten the connector transition and restore plane continuity.
- What changed: differential pair crossed a slot/split.
- Why worse: return detoured; common-mode rose; jitter/EMI sensitivity increased.
- Fast fix: reroute on a continuous plane or provide a real return bridge at the crossing.
- What changed: extra via length or unused pad created a branch.
- Why worse: reflection timing moved into the sampling margin window.
- Fast fix: remove stubs; reduce via count; keep transitions symmetric and short.
Power / Thermal / Reset: Hidden Killers Behind “Drops After 5 Minutes”
High-speed retiming often fails late: after temperature rises, after PDN noise shifts, or when strap/config differs across builds. This section provides a bring-up gate that prevents “short tests look fine” from becoming field instability.
- Power density changes: equalization/DFE activity can raise dissipation under stress.
- Heat path matters: package-to-copper vias, copper spread, and airflow determine steady-state temperature.
- Drift mechanism: temperature shifts can move EQ behavior and reduce margin, triggering retraining or flaps.
- Test implication: include a steady-state window; do not conclude from short cold runs.
- Sensitive blocks: PLL/CDR and reference domains can translate rail noise into timing uncertainty.
- Noise types: ripple, transient droop, and ground bounce can change output jitter and error behavior.
- Decoupling rule: keep loops small and multi-band; prevent return currents from detouring through supply paths.
- Validation: correlate error bursts with PDN events and thermal ramps; use controlled stress steps.
- Power-up order: define rail sequencing and PG dependencies; ensure reset release is deterministic.
- Strap sampling: ensure strap states are stable at sample time; avoid floating or marginal pull strength.
- Config consistency: keep a configuration fingerprint (version/profile ID) and verify it at bring-up.
Validation Metrics: “Eye-Opening” Numbers That Define Pass/Fail
This section turns “looks fine on the scope” into repeatable numbers. Every metric below uses the same measurement contract (state, time window, pattern, clocking, environment) so results remain comparable across builds and across A/B (before/after) changes.
- Test state: steady-state after training vs during training (do not mix).
- Time / bit window: short window for bursts + long window for drift (declare both when used).
- Pattern / payload: PRBS vs real traffic; declare payload mix and repetition.
- Clocking condition: SSC on/off + reference source condition (declare).
- Environment: temperature window + PDN state (nominal/stress) (declare).
- A/B comparability: “before” and “after” must reuse the same contract items above.
- BERT / pattern: Keysight M8040A, Anritsu MP1900A.
- Eye / sampling: Keysight 86100D (DCA-X), Tektronix DSA8300.
- Active probing: Tektronix P7720, Keysight N2795A.
- Jitter / phase noise: Keysight E5052B.
- Low-noise rails (for controlled PDN): ADI LT3042, TI TPS7A4701.
- Inline attenuation (fixture sanity): Mini-Circuits VAT-10+, VAT-20+.
- Height / width: eye opening at the chosen sampling phase under the declared contract.
- Mask margin: margin relative to a fixed mask (same mask + same setup every time).
- Bathtub: time-margin view derived from statistical error probability vs sampling offset.
- Freeze the contract: SSC on/off, pattern/payload, temperature window, PDN state, and observation time.
- Separate “steady-state” captures from “training / retrain” captures; report them independently.
- Use consistent bandwidth, probe/fixture, and trigger settings to avoid measurement-system drift.
- Example PNs: Keysight 86100D, Tektronix DSA8300; probes Tektronix P7720, Keysight N2795A.
- Single screenshot bias: good-looking snapshots hide rare bursts that dominate BER.
- Comparing different fixtures/probes as if they were the same channel.
- Capturing before thermal steady-state and concluding “pass.”
- TJ: total timing uncertainty under a declared probability/BER condition.
- RJ / DJ: random vs deterministic components (model-dependent; do not compare across different models).
- SSC-aware: SSC on/off must be declared, since low-frequency wander can change training behavior.
- Use identical filters/bandwidth and identical reference conditions for A/B comparisons.
- Measure at well-defined points (before device / after device) and label the point on every plot.
- Correlate jitter excursions with PDN events and temperature ramps to avoid mis-attribution.
- Example PN: Keysight E5052B (signal source analyzer for phase noise / jitter studies).
- Comparing TJ numbers produced by different analysis settings as if they were equivalent.
- Ignoring SSC condition and concluding “CDR clean-up” works universally.
- Treating supply-induced phase noise as “channel loss.”
- BER: errors / total bits under a declared observation window and pattern.
- Window-bound: “0 errors” must be reported as an upper bound based on observed bits.
- Pattern-bound: PRBS vs real traffic can produce different stress on equalizers and recovery.
- Declare pattern/payload, and run long enough to separate burst faults from drift faults.
- Report the denominator explicitly (bits or time × throughput), not just “error count.”
- Log retrain / recovery events next to BER to prevent silent “recovery hides errors.”
- Example PNs: Keysight M8040A, Anritsu MP1900A.
- Short-window “no error” interpreted as pass, with no upper-bound statement.
- Changing payload mix between A/B runs and calling it device “gain.”
- Using protocol counters without a consistent denominator (throughput changes distort the rate).
- Margin: headroom against controlled stress (loss/noise/temperature) under the same contract.
- A/B: baseline (before) vs change (after) using identical setup and logging.
- Run the same contract and record Eye + Jitter + BER in parallel (single report package).
- Stress one axis at a time (loss / PDN / temperature) to keep root-cause readable.
- Track retrain count and recovery behavior during the same window.
- Example PNs (common set): Keysight 86100D, M8040A; Mini-Circuits VAT-10+; LDOs LT3042, TPS7A4701.
- A/B runs use different fixtures or different clock sources and produce fake “gain.”
- Comparing a short cold run to a long hot run and calling it a device effect.
- Ignoring retrain count; “stable” must include stability of recovery behavior.
- Declare: SSC (on/off), pattern P, window Y, thermal window, PDN state.
- Log: retrain/recovery count, error counters, throughput (denominator).
- Report: Eye (X mV / Y ps), Jitter (X/Y/Z ps), BER (< X over Y), Margin (ΔX dB) with placeholders.
- Use stable rails for characterization: ADI LT3042, TI TPS7A4701 (or equivalents) to isolate “channel vs PDN.”
Recommended topics you might also need
Request a Quote
FAQs: Field Triage for Retimers & Redrivers (Metric-Driven)
Scope: long-reach stability, EQ, jitter/clock sensitivity, training transparency, placement/PDN/thermal interactions. Each answer is actionable and measurable (placeholders X/Y/Z) without protocol-specific compliance details.
Added a redriver and BER got worse—first EQ sanity check or return-path issue?
Likely cause: CTLE/GAIN overshoots and amplifies noise, or differential symmetry/return path is broken near the insertion point.
Quick check: Freeze pattern P + window Y; compare A/B at the same measurement point; verify continuous reference plane and no nearby stubs/plane gaps.
Fix: Back off CTLE/GAIN by X steps (or flatten EQ), restore return path continuity, and re-run A/B with the same contract.
Pass criteria: BER < X over Y window (pattern P), eye margin +X% (same point), retrain ≤ X per Y.
Retimer works on bench, fails on long cable—is ref clock noise or channel loss distribution?
Likely cause: The retimer CDR is stressed by reference clock/PDN noise, or loss is concentrated after the retimer (wrong segmentation).
Quick check: Compare “short vs long” while keeping clock/PDN identical; measure loss split (pre vs post) and log retrain count over window Y.
Fix: Move the retimer to rebalance loss (split-loss rule), or improve ref clock/rail noise (e.g., add low-noise LDO stage such as LT3042 or TPS7A4701).
Pass criteria: Retrain ≤ X per Y, BER < X over Y, supply ripple ≤ X mVpp under load step L, eye margin +X%.
Link trains but flaps every few minutes—retrain storm vs thermal drift?
Likely cause: Slow parameter drift (temperature/PDN) pushes EQ/DFE out of margin, triggering periodic retrain/recovery.
Quick check: Log temperature and PDN ripple alongside retrain counters over window Y; correlate “flap timestamp” with ΔT/Δripple.
Fix: Improve thermal path (copper/airflow) and stabilize rails (decoupling + low-noise regulation); reduce aggressive adaptive settings by X steps if stability improves.
Pass criteria: Flap count = 0 over Y (steady-state), retrain ≤ X per Y, ΔT ≤ X°C, ripple ≤ X mVpp (same load).
Orientation flip changes stability (Type-C/MUX path)—Δloss/Δskew or symmetry break?
Likely cause: The flipped path introduces extra loss, skew, or imbalance (ΔCdiff/Δtrace) that reduces margin in one direction.
Quick check: Measure A/B (orientation A vs B) at the same points; compare Δloss and Δskew, and check for asymmetry in via/connector transitions.
Fix: Re-balance routing and component placement to restore symmetry; adjust EQ to cover worst-case orientation (no per-orientation surprises).
Pass criteria: Δloss ≤ X dB and Δskew ≤ Y ps between orientations; BER < X over Y (pattern P); retrain ≤ X per Y.
CRC spikes only under load—DFE convergence or buffer/latency side effect?
Likely cause: Under load, adaptive EQ/DFE becomes marginal, or system buffering/latency creates bursty stress (power/thermal/traffic coupling).
Quick check: Correlate error spikes with load steps, temperature slope, and PDN ripple; compare “fixed EQ” vs “adaptive EQ” behavior for window Y.
Fix: Stabilize PDN during load (decoupling + regulator margin) and limit overly aggressive adaptation; validate with the same pattern P.
Pass criteria: Error burst rate ≤ X per Y under load L, BER < X over Y, ripple ≤ X mVpp, ΔT slope ≤ X °C/min.
Passes eye mask, still drops in system—measurement point mismatch or common-mode issue?
Likely cause: The measurement topology/point is not representative of the real receiver, or common-mode disturbance dominates in the system state.
Quick check: Move the observation point closer to the actual receiver-side segment; log drops vs PDN ripple and retrain count over window Y.
Fix: Align fixture and measurement point to the system path; mitigate common-mode coupling (return path/shield bonding) and stabilize PDN during load steps.
Pass criteria: Drop count = 0 over Y (real traffic), ripple ≤ X mVpp under load L, retrain ≤ X per Y, eye margin +X% at point M.
Changing TVS/CMC improved EMC but broke link—added Cdiff/imbalance vs return loss?
Likely cause: Protection components added capacitance mismatch or conversion (imbalance), or degraded return loss near the connector.
Quick check: A/B swap the protection network and compare eye margin + BER with identical settings; inspect symmetry and placement distance to the connector.
Fix: Select lower-C, better-matched arrays (tighter ΔCdiff) and place them for minimal stub; re-tune EQ only after symmetry is restored.
Pass criteria: Eye height +X mV and width +Y ps (same point), BER < X over Y, Δmargin between lanes ≤ X% (symmetry).
Two vendors, same footprint, different margin—package parasitics or jitter transfer differences?
Likely cause: Different package/ESD structures change parasitics, or CDR/jitter transfer behavior differs under the same clock/PDN.
Quick check: Run identical contract (pattern P, SSC, window Y) and compare output jitter + BER + retrain; keep the same fixture and measurement point.
Fix: Re-optimize EQ for each vendor if required, or tighten ref clock/PDN noise so CDR behavior converges; document vendor-specific settings as controlled parameters.
Pass criteria: TJ ≤ X ps (same analysis), BER < X over Y, margin Δ between vendors ≤ X% (same setup), retrain ≤ X per Y.
Only fails at cold/hot—CDR bandwidth shift or EQ drift?
Likely cause: Temperature changes alter analog behavior (CDR tracking, EQ coefficients, loss/impedance) and shrink the stability window.
Quick check: Sweep temperature and log BER + retrain + PDN ripple; compare “fixed EQ” vs “adaptive EQ” stability across the same window Y.
Fix: Add thermal margin (heatsinking/airflow) and de-risk adaptation (limit knob range by X); validate at both extremes with the same contract.
Pass criteria: BER < X over Y at Tlow..Thigh, retrain ≤ X per Y, margin ≥ X% at both extremes (same point M).
Why does a retimer sometimes break training—transparency level mismatch?
Likely cause: The chosen transparency level changes timing/behavior in a way the endpoints do not tolerate (unexpected adaptation or reset coupling).
Quick check: Compare “bypass / most-transparent mode / configured mode” and record training success rate and retrain count over Y; keep clocks and PDN identical.
Fix: Default to the most transparent mode that meets reach; only enable stronger adaptation when margin demands it, and document the required configuration as a controlled parameter.
Pass criteria: Training success ≥ X% over Y attempts, retrain ≤ X per Y, post-train BER < X over Y (pattern P).
Where to place a retimer on a multi-connector path—split-loss rule of thumb?
Likely cause: Retimer placement leaves one segment too lossy or too reflection-heavy, so one side remains near the edge even after retiming.
Quick check: Estimate loss per segment (trace/connector/cable) and identify the worst segment; run A/B moving the retimer location across candidate split points.
Fix: Place the retimer so pre- and post-loss are balanced within X dB; keep return path clean at both device-adjacent transitions.
Pass criteria: Segment loss imbalance ≤ X dB, eye margin ≥ X% at both sides (same point type), BER < X over Y, retrain ≤ X per Y.
How to define pass criteria without full compliance gear—proxy metrics & correlation plan?
Likely cause: The team lacks end-to-end visibility, so “pass” becomes subjective and changes with setups and fixtures.
Quick check: Build a proxy pack: BER upper bound over Y, retrain count, relative eye margin Δ% at point M, and PDN ripple under load L; repeat across temperature.
Fix: Correlate proxies to field outcomes (drop rate) and lock the measurement contract; use consistent fixtures and measurement points for every build.
Pass criteria: Drop count = 0 over Y (real traffic), BER ≤ X upper bound, retrain ≤ X per Y, eye Δmargin ≥ X%, ripple ≤ X mVpp.