123 Main Street, New York, NY 10001

USB Redriver / Retimer for Type-C: Selection & Bring-Up

← Back to: USB / PCIe / HDMI / MIPI — High-Speed I/O Index

Core idea

This page turns “USB links that barely work” into a measurable, repeatable design: model the channel first, fix physical discontinuities before EQ, then tune a redriver (CTLE/gain/limit) or choose a retimer (CDR/DFE) based on evidence. The goal is not a parts list, but a workflow that reliably passes the same cable × orientation × temperature × voltage × rate matrix with documented margins.

H2-1 · What Redrivers & Retimers Solve in USB Links

Intent

Convert component jargon into problem types: channel loss/reflect/crosstalk/jitter → eye margin collapse → training failures, downshift, or intermittent drops. The goal is fast triage: whether conditioning is needed, which class fits (redriver vs retimer), and what evidence must be collected first.

When it is typically needed
  • Long cables, docks, or connector-to-connector paths with multiple discontinuities.
  • Board-to-Type-C connector runs with tight routing constraints and limited loss headroom.
  • Orientation-dependent stability (one flip works, the other fails) indicating asymmetric channel paths.
  • High PVT sensitivity: stable at room temperature but fails hot / at low voltage.
When adding silicon is usually the wrong first move
  • Reflection-dominated channels: large return-loss spikes from stubs, impedance steps, or connector problems should be fixed physically first.
  • Firmware/role behavior issues: problems that follow software states belong to the Controller page scope, not signal conditioning.
  • Port robustness events: ESD/TVS or VBUS-side behaviors belong to the Port Protection page scope (data-path notes only here).
Symptom map (behavior-level, protocol-agnostic)
Passes initially, drops under load
Margin is being consumed by temperature rise, supply noise coupling, or cumulative timing uncertainty.
Only certain cables or one orientation fails
Channel path asymmetry or mapping mismatch is likely; the first task is to isolate dependence by matrix testing.
Stable after downshift, unstable at max rate
A classic sign of insufficient headroom; focus on loss/ISI vs jitter dominance before tuning.
Cold OK, hot fails; nominal OK, low-V fails
PVT sensitivity suggests a marginal sampling window; the solution is often cleaner timing (retimer) or reduced channel severity.
Redriver vs Retimer (engineering definition)
Redriver
Analog conditioning (typically CTLE, sometimes limiting/boost). It reshapes amplitude and frequency response but does not regenerate a new sampling clock.
Retimer
Re-timing with CDR and often DFE. It can remove accumulated timing uncertainty by re-sampling and forwarding a cleaner, newly timed stream.
First evidence to collect (minimum viable, tuning-safe)
  • Matrix reproducibility: direction × cable × temperature × voltage. Identify whether failures track one axis (orientation/cable) or multiple axes (PVT).
  • Correlation signals: do errors correlate with thermal ramp, supply droop, or activity bursts? (distinguish SI headroom vs power/thermal coupling).
  • Channel dominance: swapping the channel (different port / shorter interconnect) should change behavior if the channel is the limiting factor.
  • Reflection sanity check: if large discontinuities exist, physical fixes usually outperform EQ tweaks.
Pass criteria (quantified, production-friendly)
  • Stability: continuous operation over the defined matrix for ≥ Y minutes with error rate ≤ X.
  • Orientation consistency: A/B flip margin delta stays within X (relative) across cable variants.
  • Repeatability: tuning/config is lockable and reproduces across builds (same recipe, same outcome).
  • Side-effects controlled: no unacceptable EMI/thermal regressions versus baseline (threshold X).
Diagram: impairments → conditioning options → outcomes (parameter-free, concept-first)
Channel impairments Loss Reflection Crosstalk Jitter Signal conditioning Redriver CTLE Gain / Limit Retimer CDR DFE Outcomes Eye margin ↑ BER risk ↓ Stability ↑ Medium improvement Stronger improvement

H2-2 · Redriver vs Retimer: Decision Tree (Don’t Guess)

Intent

Provide an executable selection engine. Inputs are observable facts (channel severity, PVT sensitivity, orientation complexity, tuning repeatability needs). Outputs are exactly one of three actions: Redriver, Retimer, or Re-architect.

Decision table (fast filter)
Channel severity
Multiple connector breaks, long cable segments, or multi-hop paths typically exceed redriver headroom → Retimer or Re-architect.
Reflection dominance
If return-loss spikes dominate, EQ rarely fixes the root cause. Fix topology/impedance first → Re-architect.
PVT sensitivity
Hot/low-V failures imply marginal sampling window; re-timing often restores timing margin → Retimer.
Orientation / mapping complexity
Strong A/B orientation dependence suggests asymmetry and mapping risk. Favor solutions with robust orientation handling → Retimer (or managed redriver).
Repeatable tuning & diagnostics
If production needs a locked recipe and measurable margins, prefer devices with stable configuration and observability → Retimer (or managed redriver).
Power/thermal budget
Tight thermal headroom can make retimers harder to sustain. If channel is not extreme, start with Redriver or simplify the channel.
Decision tree (algorithm, ordered questions)
  1. Reflection-dominated? If yes, correct stubs/impedance breaks first → Re-architect.
  2. Channel severity beyond redriver headroom? Multi-segment + long cable + limited margin → Retimer.
  3. Strong PVT sensitivity? Hot/low-V failures suggest timing window collapse → Retimer.
  4. Orientation/cable dependent? If yes, validate lane map symmetry; prioritize robust orientation handling → Retimer (or managed redriver).
  5. Need production-repeatable tuning/telemetry? If yes, prefer lockable configuration and measurable margin hooks → Retimer. Otherwise → Redriver.
“Re-architect” means concrete channel fixes (not a vague suggestion)
  • Reduce discontinuities: fewer connectors, fewer stubs, cleaner transitions.
  • Shorten the worst segment: move the conditioning device closer to the cable/connector entry.
  • Improve symmetry: make A/B orientation paths as equal as possible before tuning.
  • Fix reflections first: EQ is most effective after the channel is physically sane.
Pass criteria (selection quality)
  • 3-minute decision: the input facts can be answered quickly (yes/no or small integers) without deep protocol parsing.
  • Unique output: the tree produces one action (Redriver / Retimer / Re-architect) plus the next engineering step.
  • Evidence-driven: changing the channel axes (orientation/cable/PVT) changes the decision in a predictable way.
Diagram: decision tree with three outputs (redriver / retimer / re-architect)
Reflection-dominated? Re-architect Fix stubs / breaks Channel severity multi-segment / long PVT-sensitive? hot / low-V fails Orientation-dependent? A/B differs Redriver CTLE tuning Retimer CDR / DFE YES NO YES NO YES NO YES NO Outputs: Redriver (CTLE) · Retimer (CDR/DFE) · Re-architect (fix reflections / simplify channel)

H2-3 · Channel Model & Loss Budget (Only What You Must Measure)

Intent

Establish a measurement-first evidence model. Without a channel budget (loss vs reflection vs crosstalk), any CTLE/DFE discussion becomes guesswork. This section defines what must be measured, what each metric answers, and what the next engineering action should be.

What to measure (minimum viable set)
  • IL (Insertion Loss): frequency-dependent attenuation trend (loss/ISI severity indicator).
  • RL (Return Loss): reflection dominance and discontinuity suspicion (topology sanity gate).
  • NEXT/FEXT: near/far-end coupling trend (layout dominance indicator).
  • Mode conversion (optional): diff↔cm tendency (return-path continuity / imbalance clue).
  • TDR (optional but powerful): locate discontinuities by distance (connector/transition pinpoint).
What it tells you (diagnosis translation)
IL rises (loss-dominated)
Channel headroom is limited by attenuation/ISI; equalization can help if reflections are controlled.
RL poor (reflection-dominated)
Discontinuities are shaping the waveform; physical fixes often outperform stronger EQ.
NEXT/FEXT high (coupling-dominated)
The channel is noise-injection limited; layout and return-path continuity are first-order fixes.
Mode conversion notable (imbalance)
Differential energy is leaking into common-mode; imbalance/return path needs attention before tuning.
What to do next (action mapping)
  • IL-dominated: consider CTLE (redriver) first; if multi-segment or PVT-sensitive margin remains, move to retimer (CDR/DFE).
  • RL-dominated: prioritize re-architect (remove stubs, fix impedance breaks, reduce connector discontinuities). Avoid “stronger EQ” as a substitute.
  • Crosstalk-dominated: fix layout/return path first (spacing, reference continuity, isolation from noisy bundles), then re-measure.
  • Imbalance/mode conversion: address symmetry and return-path continuity before expecting stable tuning results.
Gate rule (repeatable across chapters)
Fix physics before EQ: reflections and coupling must be contained before CTLE/DFE tuning is expected to be stable.
Frequency vs time domain (what each tool answers)
VNA (frequency view)
Best for trend budgets (IL/RL/crosstalk vs frequency). It answers “how severe” the channel is, but not “where” the discontinuity is.
TDR (location view)
Best for pinpointing impedance steps by distance. It answers “where to fix” topology/connector/transition problems.
Scope / eye (result view)
Best for quick bring-up validation. Treat it as a result indicator; measurement points and fixtures can hide root causes.
Pass criteria (measurement completeness)
  • A copyable measurement checklist exists (IL, RL, crosstalk, optional mode conversion, optional TDR) and is executed with consistent setup.
  • Each measured result maps to a single next action (CTLE/DFE or re-architect or layout fix) without ambiguity.
  • Re-measurement uses the same method so “improvement” is comparable, not an artifact of changing fixtures or reference planes.
Diagram: measurement → diagnosis → action mapping (VNA/TDR/Scope → IL/RL/XT → CTLE/DFE/Re-architect)
Measure Diagnose Act VNA trend budget TDR locate breaks Scope result view IL RL NEXT / FEXT Mode convert CTLE redriver CDR / DFE retimer Re-architect fix breaks Gate: Fix reflections & coupling before expecting EQ tuning to be stable

H2-4 · CTLE & Gain Staging (Redriver Tuning Without Guesswork)

Intent

Turn CTLE/gain/limiting knobs into a repeatable SOP. The objective is stable operation across cable/orientation/PVT, not a “tall-looking” eye that hides worse jitter, crosstalk, or EMI side effects.

Preconditions (do not tune blind)
  • Reflection gate passed: major discontinuities are addressed; tuning is not a substitute for fixing stubs and breaks.
  • Crosstalk gate contained: coupling is not the dominant limiter; otherwise layout fixes come first.
  • Matrix exists: orientation × cable set × temperature × voltage is defined for repeatable validation.
  • One-variable discipline: only one knob changes per step so the outcome can be attributed.
CTLE: what it changes (and what it can worsen)
  • Primary effect: increases high-frequency weight to counter loss/ISI.
  • Typical win: improved edge clarity and eye margin when IL dominates.
  • Typical risk: amplifies high-frequency noise and coupling, increasing jitter-like behavior and EMI risk.
  • Key rule: CTLE works best after reflections and coupling are contained.
Gain staging & limiting (avoid overdrive)
  • Too low gain: insufficient compensation; stability remains marginal.
  • Too high gain: overdrive can worsen EMI and coupling, and can stress receiver front-end linearity.
  • Limiter (if available): can control amplitude extremes but may introduce waveform distortion if misused.
  • Priority: stable bring-up across the matrix comes before maximizing amplitude.
Common tuning failures (symptom → cause → correction)
“Eye looks taller” but stability worsens
Over-compensation amplifies noise/crosstalk; reduce CTLE or gain and re-check side effects.
Single-ended observation misleads
Differential symmetry is ignored; verify pair balance and treat tuning as a differential problem.
Orientation A works, B fails
Path asymmetry is exposed by Type-C flip; validate lane map and evaluate per-orientation margins.
No rollback rule, no repeatability
Multiple knobs change at once; enforce one-variable steps and a “last stable config” fallback.
Tuning SOP (conservative → stronger, with rollback)
Step 0 · Baseline
Lock the baseline config and record results across the validation matrix (orientation/cable/PVT) for comparison.
Step 1 · Conservative CTLE
Apply the minimum CTLE setting with default gain. Verify stability improves before increasing strength.
Step 2 · CTLE ladder search
Increase CTLE one step at a time. Only one knob changes per step. Stop when stability stops improving.
Step 3 · Gain / limiter fine-tune
Around the best CTLE point, adjust gain/limiting conservatively to avoid overdrive and side effects.
Rollback rules (always defined)
  • If stability regresses, revert to the last stable CTLE step.
  • If EMI/thermal side effects increase noticeably, reduce gain or step back CTLE.
  • If A/B orientation delta grows, prioritize symmetry fixes or validate per-orientation settings if supported.
Knob map (safe adjustments, not a wide table)
CTLE index
Increase to compensate IL. Stop when stability stops improving; stronger is not always better.
Gain
Use the minimum gain that meets stability. Excess gain amplifies noise and coupling, raising EMI risk.
Limiter (if present)
Use for amplitude extremes control. If distortion-like symptoms appear, disable and reduce gain/CTLE instead.
Orientation validation
Validate A/B as first-class axes. A setting that “wins” in one orientation may fail in the other.
Pass criteria (tuning success)
  • Stable operation across the defined matrix (cable/orientation/PVT) with error rate ≤ X over ≥ Y minutes.
  • Orientation delta stays within X (relative) after tuning.
  • Configuration can be locked as a recipe and reproduced across builds.
  • No significant EMI/thermal regression versus baseline (threshold X).
Diagram: CTLE weight + tuning ladder + guardrails (abstract, parameter-free)
CTLE weight (abstract) Low-f High-f CTLE 0 CTLE 1 CTLE 2 CTLE 3 Higher CTLE → more high-f emphasis Guardrail: too high can amplify noise Tuning ladder (SOP) Step 0 · Baseline Step 1 · Conservative CTLE Step 2 · CTLE Ladder Step 3 · Gain / Limiter Guardrails: one knob at a time · validate A/B orientation · rollback to last stable config Gate: reflections & coupling contained before tuning

H2-5 · Retimer Internals: CDR / DFE / Adaptive EQ (What Changes)

Intent

Explain why retimers can “wash out” upstream impairments: a new sampling clock domain plus equalization that reduces ISI. Clarify what improves, what does not, and what risks must be validated in real systems.

Retimer = 3 blocks (engineering view)
CDR (Clock Data Recovery)
  • What changes: sampling clock is recovered locally instead of relying on degraded upstream edges.
  • Typical win: sampling point moves back toward the UI center; jitter tolerance improves.
  • Typical risk: additional states and convergence behavior can interact with marginal links.
DFE (Decision Feedback Equalizer)
  • What changes: ISI residue is reduced by canceling post-cursor effects based on decisions.
  • Typical win: strong recovery on loss/ISI-dominated, multi-segment channels.
  • Typical risk: higher sensitivity to noise if over-aggressive; not a substitute for fixing strong reflections.
Adaptive EQ (training & adaptation)
  • What changes: EQ parameters track channel/PVT variations rather than remaining fixed.
  • Typical win: improved robustness across temperature/voltage/cable variance.
  • Typical risk: drift under power noise/thermal gradients; debug repeatability becomes harder.
Boundary rule
Retimers improve loss/ISI and upstream edge degradation. Strong reflections and coupling still require physical fixes first.
What improves vs what must be fixed physically
Often improved by retiming + DFE
  • Loss/ISI-dominated channels (high IL with controlled RL).
  • Upstream edge degradation and sampling margin erosion.
  • PVT-induced margin shrink when adaptation is stable.
Must be fixed before relying on retimers
  • Reflection dominance: major discontinuities and stubs.
  • Crosstalk dominance: coupling hotspots and broken return paths.
  • Mapping/polarity faults: orientation-dependent lane errors (handled in H2-6).
Risks to validate (engineering checks, no standard text)
Compatibility / training sensitivity
Some endpoint/channel combinations are “pickier.” Validate across cables/orientation and multiple endpoints to avoid combination-specific fragility.
Latency & deterministic behavior
Retimers introduce additional pipeline delay and state. If system timing is sensitive, compare with/without retiming early.
Power / thermal drift of adaptation
Validate stability under temperature ramps and supply perturbations. Adaptation can drift if power integrity or heat gradients are poor.
Pass criteria (observable change)
  • Using the same measurement setup, margin indicators improve versus baseline (eye/margin proxy, error rate, or stability metric).
  • Training success and stability hold across the validation matrix (cable/orientation/PVT) with thresholds ≤ X.
  • Long-run or thermal/power stress does not introduce drift-like fragility (threshold ≤ X).
Diagram: retimer internal blocks (Input → CTLE → CDR → DFE → Re-timed Output)
Input degraded Output re-timed Retimer core CTLE pre-EQ CDR new clock DFE ISI repair New sampling clock Jitter / ISI in Margin / BER out Gate: contain reflections and crosstalk before expecting retiming to be robust

H2-6 · Type-C Orientation Handling: Mapping, Polarity, and Where It Breaks

Intent

Focus strictly on orientation within redriver/retimer solutions: how flip/mapping/polarity are handled and why “A works, B fails” is common. The goal is a repeatable lane-map validation and a fast localization workflow for orientation-dependent failures.

Two architectures (high-level)
Architecture A · External MUX flip + fixed redriver/retimer mapping
Pros
Clear responsibility split; retimer configuration is simpler when mapping is stable.
Cons
MUX adds an extra discontinuity and can amplify A/B path asymmetry if layout is not mirrored.
Failure modes
  • MUX control mismatch (wrong flip state).
  • Path A/B IL/RL difference becomes dominant.
  • Extra reflection point introduced by placement/transition.
Architecture B · Orientation-aware redriver/retimer (GPIO/I²C configurable)
Pros
Fewer external breaks; potentially cleaner channel if mapping logic is correct and repeatable.
Cons
Configuration state and timing become critical; debug repeatability depends on deterministic configuration.
Failure modes
  • GPIO/I²C configuration not aligned with orientation.
  • Partial configuration causes lane-map inconsistency.
  • Polarity handling assumptions differ across blocks.
Orientation-dependent failure types (symptom → likely root)
A works, B fails
Path asymmetry: different IL/RL/transition count, or mapping mismatch that only appears when flipped.
Only some cables or endpoints fail
Marginal mapping plus channel variance: a “combination-specific” fragility, often exposed by flip and loss differences.
Flip causes sudden training instability
Polarity or lane-map inconsistency: the physical wiring and logical expectation diverge under flip.
Must-do checklist: lane connectivity matrix (A/B)
  • Define a 2-axis matrix: Orientation A and Orientation B.
  • For each orientation, list: Type-C pins → (MUX/retimer ports) → SoC/bridge lanes.
  • Verify one-to-one mapping: no swaps, no missing lanes, no hidden crossovers.
  • Verify polarity handling: differential pair polarity is consistent across all blocks.
  • Verify control state: GPIO/I²C flip state matches the physical orientation detect signal.
  • Compare A vs B channel severity: IL/RL delta stays within X (relative) for robust tuning.
Fast localization workflow
  1. Record the failure signature: orientation-only, cable-only, endpoint-only, or mixed.
  2. Run lane-map matrix verification (lowest cost, highest leverage).
  3. Measure A/B IL/RL trend to confirm whether asymmetry is physical or logical.
  4. If physical dominance is confirmed, fix topology/transition symmetry before tuning.
  5. If logical dominance is confirmed, lock configuration deterministically and re-validate across the matrix.
Pass criteria (orientation robustness)
  • Orientation A and B both pass the same validation matrix with error rate ≤ X.
  • A/B margin delta remains within X (relative) after tuning.
  • Lane-map verification is repeatable across builds; configuration state is deterministic.
Diagram: Type-C flip & lane map (Path A vs Path B asymmetry and mapping risk)
Type-C connector A/B orientation Flip handling External MUX flip lanes Ori-aware redriver/retimer Lane map matrix A: pins → lanes B: pins → lanes polarity check SoC / Bridge fixed lanes Path A Path B A/B breakpoints ΔIL / extra break lane swap risk polarity mismatch Rule: verify lane connectivity matrix first; then compare A/B IL/RL to separate logical vs physical dominance

H2-7 · Placement & Layout: Where the Retimer/Redriver Must Sit

Intent

Placement is the highest-leverage decision: if the compensator sits at the wrong segment, tuning cannot recover margin. This chapter provides review-ready rules and anti-patterns without drifting into protocol text or separate Type-C MUX topics.

3 hard placement principles
Principle A · Sit near the worst breakpoint
Place the redriver/retimer close to the segment that needs compensation most (typically the connector or long-cable entry), so the “worst” part ends early and the remaining path stays controllable.
Principle B · Avoid the “awkward middle”
Never split a bad channel into two long bad channels (connector ↔ device and device ↔ SoC). The result is two margin-starved segments and orientation/cable sensitivity.
Principle C · Geometry & return path beat “stronger tuning”
Prioritize symmetry, continuous return, and consistent differential coupling. These reduce discontinuities and make tuning repeatable. EQ cannot reliably compensate broken return paths or severe asymmetry.
Do (review-friendly)
  • Place the compensator near the connector / long-cable entry to “end the worst segment early.”
  • Keep the connector-to-compensator path short, monotonic, and low-discontinuity.
  • Mirror the high-speed lane geometry across orientations; keep via counts and transitions symmetric.
  • Maintain continuous reference planes; add explicit return stitching where transitions are unavoidable.
  • Keep aggressors away from the most sensitive segment (closest to connector and device pins).
Don’t (common anti-patterns)
  • Don’t place the device in the middle and leave two long segments on both sides.
  • Don’t create orientation-dependent asymmetry (different layer/via/topology for A vs B).
  • Don’t cross split planes or gaps without a defined return strategy.
  • Don’t treat EQ as a fix for major reflection breakpoints and stub-like structures.
  • Don’t route sensitive lanes alongside noisy bundles or across uncontrolled return regions.
Layout checks that matter most
  • Symmetry: via count, layer transitions, and critical discontinuities match across lanes and orientation paths.
  • Return continuity: reference plane does not break; transitions include return stitching (strategy only).
  • Monotonic segmenting: the “worst” segment is not extended past the compensator.
  • Discontinuity hygiene: avoid sudden impedance shifts near the connector and device pins.
  • Local environment: keep local coupling and nearby routing consistent to prevent orientation-specific deltas.
Pass criteria (review checklist)
  • Compensator is placed near the worst breakpoint; connector-to-device path is minimized.
  • No “two long segments” pattern exists; the channel is segmented intentionally.
  • A/B orientation paths are geometrically comparable; asymmetry stays within X (relative).
  • Reference plane continuity and return strategy are explicitly reviewed and recorded.
  • Checklist items are fully traceable in layout review artifacts (copy/paste ready).
Diagram: good vs bad placement (avoid “two long segments”)
Good placement: worst segment ends early Bad placement: two long segments Connector Type-C Short Redriver or Retimer Controlled SoC / Bridge fixed lanes Result: the “worst” segment is minimized; remaining segment stays controllable and repeatable Connector Type-C Long Compensator middle point Long SoC / Bridge fixed lanes Why “middle placement” fails Two margin-starved segments Higher orientation & cable sensitivity Tuning becomes non-repeatable Rule: place near the worst breakpoint; avoid splitting a bad channel into two long bad channels

H2-8 · Power, Clock, and Thermal: The Silent Link Killers

Intent

Many “works for 5 minutes then drops” failures are not pure SI. Power noise, reference clock quality, and thermal rise can shrink CDR margin or cause adaptive drift. This chapter provides the shortest symptom-to-fix loop with monitor points and threshold placeholders.

Symptom map (field patterns)
Cold OK, hot fails
Margin shrinks with temperature; adaptation may drift under heat gradients.
Idle OK, full load fails
Supply ripple and local noise increase under activity; jitter and error rate rise.
Low-voltage corner is fragile
Reduced headroom increases sensitivity to ripple and clock/noise coupling.
5–10 min later becomes fragile
Thermal rise and time-dependent noise reveal drift-like behavior, even if short tests pass.
Killer 1 · Power noise
Symptom
Errors increase under throughput; stability depends on load profile; long-run tests degrade.
Quick check
  • Measure ripple/noise at device power pins (monitor point defined in the test plan).
  • Correlate error bursts with load transitions and power events.
Fix (strategy)
  • Strengthen local decoupling loop at the device; reduce high-frequency impedance.
  • Isolate noisy domains; improve return integrity around the device region.
Killer 2 · Reference clock quality
Symptom
Link is sensitive to system noise; margin collapses under certain modes; failures cluster around “clock/noise heavy” conditions.
Quick check
  • Verify refclk integrity at the device input (monitor point + capture method).
  • Check whether failures correlate with clock-source changes or noise injection events.
Fix (strategy)
  • Use cleaner clock sources where required; reduce coupling from noisy rails and switching domains.
  • Keep clock routing return continuous and away from aggressors; avoid avoidable discontinuities.
Killer 3 · Thermal rise & gradients
Symptom
Short tests pass; long-run or high-ambient tests fail; orientation sensitivity grows when hot.
Quick check
  • Monitor device temperature and hotspot proximity (monitor point defined in the plan).
  • Correlate failure onset time with temperature ramp and airflow changes.
Fix (strategy)
  • Improve heat spreading: copper area, vias, and thermal path to planes.
  • Avoid placing the device next to major heat sources; ensure airflow path is not blocked.
Monitoring points (threshold placeholders)
  • Power ripple @ device pins: monitor point + capture method + limit ≤ X.
  • Refclk quality @ device input: monitor point + capture method + limit ≤ X.
  • Device temperature / hotspot: monitor point + logging + limit ≤ X.
Minimal stress matrix
Temperature (cold/room/hot) × Voltage (nominal/low corner) × Load (idle/full) × Time (short/long).
Pass criteria (no silent killers)
  • In the stress matrix, stability metrics (training success and long-run errors) meet thresholds ≤ X.
  • Power ripple, refclk quality, and temperature logs stay within limits ≤ X at defined monitor points.
  • No time-dependent drift is observed: long-run behavior does not degrade beyond X.
Diagram: power / refclk / thermal → CDR margin ↓ → errors ↑
Power noise ripple / spikes Refclk jitter quality at input Thermal rise hotspot / gradient Retimer CDR margin sampling window Margin Errors ↑ instability Rule: log power ripple + refclk quality + temperature; prevent margin drift before blaming SI

H2-9 · Bring-up & Debug Workflow: From “Fails” to Root Cause Fast

Intent

Convert ad-hoc debugging into a repeatable workflow: freeze variables, apply minimal physical fixes first, then scan EQ/training settings, and finally use correlation (temperature/voltage/orientation/load) to isolate silent killers. Every step must output recordable data points for QA gates.

Workflow (Step 0 → Step 4)
Step 0 · Freeze the test matrix
Lock variables (cable bucket / orientation / rate profile / temperature corner / power corner) to make failures reproducible.
Step 1 · Minimal physical fixes first
Fix termination/topology discontinuities and asymmetry before trusting EQ. Remove “two-long-segment” and return-path issues.
Step 2 · EQ / training scan
Systematically scan redriver knobs or retimer configs (conservative → aggressive), with a rollback rule and a compact scorecard.
Step 3 · Correlation analysis
Check whether failures track temperature / voltage / orientation / load. Separate reflection/EQ issues from power/clock/thermal killers.
Step 4 · Finalize & regression
Freeze final settings + evidence pack; run a minimal regression matrix and document acceptable windows (threshold placeholder X).
Per-step outputs (recordable data points)
Step 0 output
  • Matrix grid: Pass/Fail per cell (cable bucket × orientation × rate × temp × power).
  • Error counters or “symptom tags” per cell (no protocol details; consistent naming only).
  • Reproducibility note: run duration and time-to-fail statistics (placeholder X).
Step 1 output
  • Physical fix list (what changed, where, why).
  • Before/after comparison in the frozen matrix (delta must be visible).
  • Orientation delta shrink check: A/B gap within placeholder X.
Step 2 output
  • Scan table: knob/config → pass rate → error density → orientation sensitivity.
  • Rollback rule: conditions to revert (e.g., instability or side-effects beyond placeholder X).
  • Chosen profile: the best “stable region” (not just one best point).
Step 3 output
  • Correlation statement: errors track which variable(s) (temp/voltage/orientation/load).
  • Root-cause bucket: reflection/topology vs EQ/training vs power/clock/thermal.
  • Next proof action: one decisive check to confirm the bucket (recorded).
Step 4 output
  • Final config snapshot + version tag; restore procedure.
  • Minimal regression matrix (short/long run) with acceptance window ≤ X.
  • Evidence pack links: logs / screenshots / measurements (placeholders).
Pass criteria
  • Each step produces a recordable artifact (matrix grid, scan table, correlation note, final snapshot).
  • Failures can be reproduced from Step 0 within time-to-fail ≤ X (placeholder).
  • Final settings remain stable across the minimal regression matrix with thresholds ≤ X.
Diagram: debug workflow (freeze → physical → EQ/training → correlation → pass)
Bring-up & Debug Workflow (Step 0 → Step 4) Symptom training fail / flaps drops / degrades Step 0 Freeze test matrix cable / orient / temp Step 1 Minimal physical fixes topology / symmetry Step 2 EQ / training scan conservative → aggressive with rollback rule Step 3 Correlation analysis temp / voltage / orient / load bucket root-cause fast Step 4 · Finalize & Regression freeze config + evidence pack minimal matrix + window ≤ X Artifacts matrix / scan / logs snapshots / notes

H2-10 · Engineering Checklist (Design → Bring-up → Production)

Intent

Convert the entire topic into a gate-based checklist that teams can execute and audit. Each gate requires concrete evidence types (logs, screenshots, measurements, snapshots) with threshold placeholders.

Gate A
Design Gate
  • □ Channel loss/breakpoint budget documented (relative buckets; no spec quoting).
  • □ Redriver vs retimer decision recorded with input assumptions (loss dominance / jitter risk / complexity).
  • □ Placement strategy approved: near worst breakpoint; no “two-long-segment” pattern.
  • □ Orientation robustness planned: geometry and path symmetry targets defined (placeholder X).
  • □ Return continuity risks identified and mitigations listed (strategy-level).
Evidence required
Layout review screenshots (placeholders), budget worksheet (placeholder), decision note (placeholder link).
Gate B
Bring-up Gate
  • □ Test matrix frozen (cable bucket / orientation / rate / temperature / power).
  • □ Minimal physical fixes completed and recorded (termination/topology/asymmetry).
  • □ EQ/training scan completed with rollback rule and stable region selection.
  • □ Correlation analysis done (temperature/voltage/orientation/load) with root-cause bucket.
  • □ Monitor points logged: power ripple / refclk quality / device temperature (limits ≤ X).
Evidence required
Matrix grid logs, scan tables, correlation note, power/clock/thermal logs (placeholders).
Gate C
Production Gate
  • □ Configuration locked: version tag + checksum + restore procedure documented.
  • □ Tolerance window defined for corners (acceptance ≤ X placeholders).
  • □ Sampling strategy defined: minimal regression + audit frequency (placeholders).
  • □ Failure triage template published: Fail → Step 0..4 fields for quick reproduction.
  • □ Field reproduction pack prepared: matrix subset + logging recipe + snapshot form.
Evidence required
Config snapshot, regression records, sampling logs, triage template, field pack (placeholders).
Pass criteria (gate completion)
  • All gates are checkable and auditable; each item maps to a tangible evidence artifact.
  • Evidence types are standardized (logs / measurements / snapshots / regression results).
  • Any failure can be routed back to H2-9 Step 0..4 fields for fast reproduction and closure.
Diagram: three gates (Design → Bring-up → Production) with evidence types
Engineering Gates (Design → Bring-up → Production) Design Gate budget + decision placement strategy orientation robustness return continuity Bring-up Gate frozen matrix physical fixes scan + rollback correlation proof Production Gate config lock tolerance window sampling strategy triage template Evidence types logs · measurements · screenshots · snapshots · regression records

H2-11 · Applications & IC Selection Logic

This section turns the first 10 chapters into a purchase- and review-ready output: a capability checklist. It avoids brand-stacking and instead maps channel profile → required knobs → evidence to collect.

Scope guardrails (to avoid topic overlap)
  • In scope: USB redriver/retimer application profiles, selection knobs, and requirement-to-capability mapping.
  • Not covered here: detailed Type-C MUX/Alt-Mode routing, USB PHY compliance templates, port protection BOM sizing, or hub/bridge architectures.
  • Hand-off rule: if a decision depends on which MUX, which PHY, or which TVS array, capture the requirement here and resolve details in the sibling pages.
A) Applications → Turn each into a channel profile (not a story)
Each card uses the same fields so the output becomes comparable across projects.
1) Long cable / Dock / External accessory
  • Topology: cable + receptacle + board trace; frequent hot-plug; multiple cable vendors.
  • Dominant risk: mixed loss + ISI + environmental drift (temperature / supply corners).
  • Operational variables: orientation A/B, cable length tiers, load state (idle vs sustained traffic).
  • Must be observable: training stability, error counters, temperature rise near conditioner, supply ripple at the device pins.
  • Likely device class: retimer favored when “edge-case” failures exist; redriver can be sufficient when loss dominates and jitter is controlled.
2) Connector-heavy path / Adapter board / Multi-receptacle chain
  • Topology: multiple connectors, stubs, and discontinuities; reflections become first-order.
  • Dominant risk: return-loss driven closure (RL/reflections) that EQ cannot fully “erase”.
  • Operational variables: direction-dependent path differences, assembly variation, connector batch variation.
  • Must be observable: A/B margin delta, sensitivity to “which connector” and “which cable”.
  • Likely device class: start with physical fixes (impedance, symmetry, connectors). Add redriver/retimer only after topology is sane.
3) Mainboard to Type-C edge (internal traces to receptacle)
  • Topology: long internal traces, tight routing constraints, orientation handling required.
  • Dominant risk: asymmetry between A/B orientations (routing, via count, escape pattern).
  • Operational variables: EMI environment, nearby switching rails, thermal coupling to hotspots.
  • Must be observable: per-orientation pass rate, per-lane mapping consistency, “works only one way” symptom isolation.
  • Likely device class: reversible redriving switches (with EQ) are common; retimer when jitter/ISI mix and training margin is thin.
4) Board-to-board / Mezzanine / Space- and thermal-limited build
  • Topology: short distance but harsh constraints (stack height, dense routing, power/thermal budget).
  • Dominant risk: power/thermal induced drift; noise coupling into sampling margin.
  • Operational variables: enclosure temperature, airflow, sustained load duty cycle.
  • Must be observable: temperature rise vs error onset, supply ripple vs error onset.
  • Likely device class: redriver if channel is clean; retimer only when required by margin, and only with proven thermal/power closure.
Pass criteria for the applications step
Each project ends this section with a channel profile: topology summary, dominant risk, and the top variables (orientation / cable / temperature / supply).
B) Selection knobs → Write requirements, not brand names
Each knob is expressed as: Knob / Why / Trade-off / Evidence.
Redriver knobs (CTLE + gain staging)
  • CTLE steps: needed when loss dominates. Trade-off: over-boost can worsen EMI/crosstalk. Evidence: EQ sweep table + matrix pass-rate (X).
  • Gain staging: prevents overdrive and keeps eye “useful”, not just “tall”. Trade-off: too little gain reduces margin. Evidence: stable passes across cables/orientation (X).
  • Limiter / output swing control: stabilizes amplitude under variable channels. Trade-off: may introduce nonlinearity and mask reflection problems. Evidence: no regression in error rate + emissions (X).
  • Transparency / low-power states: must not break link states in real docks. Trade-off: fewer knobs can mean fewer recovery options. Evidence: state transitions and reconnect behavior recorded (X).
Retimer knobs (CDR/DFE + adaptive behavior)
  • CDR/DFE strength: required when ISI and jitter mix, or when “barely works” corners exist. Trade-off: added power/heat and complexity. Evidence: measurable margin improvement (X).
  • Adaptive EQ vs locked configuration: adaptive helps variability; lock helps reproducibility. Trade-off: drift vs lack of tolerance. Evidence: corner matrix (temp/voltage) stability (X).
  • Diagnostics/visibility: status readout shortens root-cause time. Trade-off: integration effort (I²C/GPIO). Evidence: bring-up log template captures key states (X).
  • Latency / determinism: confirm system impact for time-sensitive paths. Trade-off: “stronger” conditioning can add delay. Evidence: latency observation point defined (X).
  • Thermal/power envelope: retimers often fail “after minutes” if thermal/power is weak. Trade-off: board area for copper and airflow. Evidence: temperature rise log + ripple log (X).
Orientation handling knobs (A/B symmetry is a requirement)
  • Reversible lanes / crosspoint support: reduces “one-way only” failures. Trade-off: routing constraints and configuration complexity. Evidence: A/B pass-rate and margin delta (X).
  • Control interface (I²C/GPIO): required for deterministic mapping and bring-up control. Trade-off: firmware/board hooks. Evidence: lane map checklist + config capture (X).
  • Design-for-symmetry: the physical channel must remain symmetric. Trade-off: PCB effort. Evidence: layout review checklist signed off (X).
Pass criteria for selection knobs
The output is a capability checklist (must-have knobs + evidence), not a vendor list.
C) Example IC candidates (MPNs) — use as a filtering starting point
These examples help translate “knobs” into real orderable families. Always validate against the project channel profile and corners.
USB Type-C / USB 3.x linear redrivers (CTLE-based)
  • TI TUSB1104 — USB Type-C 10Gbps USB 3.2 x2 adaptive linear redriver (host↔receptacle / device↔receptacle). Example orderable: TUSB1104RNQR.
  • TI TUSB522P — USB Type-C 10Gbps USB 3.2 x2 adaptive linear redriver (adaptive EQ).
  • TI TUSB1044 / TUSB1044A — reversible multi-protocol Type-C Alt Mode redriver switch (up to 10Gbps class) with EQ control.
  • TI TUSB1046A-DCI — USB Type-C DP Alt Mode 10Gbps-class linear redriver crosspoint switch. Example orderable: TUSB1046A-DCIRNQR.
  • TI TUSB544 — reversible multi-protocol Type-C Alt Mode redriver switch (8.1Gbps class) with EQ levels.
  • TI TUSB1004 — 10Gbps USB 3.2 4-channel adaptive linear redriver (commonly for Type-A style channels).
Use these families when the channel is loss-dominant and topology/reflections are already under control.
USB4 / USB 3.2 retimers (CDR + retiming for thin margins)
  • Parade PS8830 — USB4 retimer supporting USB4 Gen 3×2 and USB 3.2 Gen 2×2 (host-side, USB-C receptacle focused).
  • Parade PS8833 — USB4/DP 2.1/TBT4 retimer (host applications), also conforms to USB 3.2 retimer/alt-mode support.
Use retimers when ISI + jitter + corner drift jointly push the link into “works only sometimes” territory.
Type-C switching + retiming (integration example)
  • Parade PS8803C — USB-C 10Gbps-class switch with signal retiming function (useful when mapping + SI conditioning must be handled together).
Integrated parts can simplify orientation/mapping, but increase the importance of validation evidence (matrix + thermal + ripple).
Legacy combo redriver example (use only if lifecycle fits)
  • NXP PTN36502 / PTN36502A — Type-C USB 3.1 Gen 1 + DP 1.2 combo redriver (note: lifecycle may be obsolete depending on sourcing).
If lifecycle is a risk, document it as a design constraint and select an active family instead.
Avoid overlap note
If the choice depends on the exact Type-C MUX, Alt-Mode routing, or port ESD/TVS BOM, capture the requirement here and resolve those details in the corresponding sibling pages.
D) Output: capability checklist (copy/paste template)
Capability checklist template
  • Channel profile: loss bucket (X), connector count (X), cable presence (Yes/No), orientation A/B asymmetry risk (Low/Med/High).
  • Constraints: power budget (X), thermal budget (X), board area (X), configuration hooks available (I²C/GPIO: Yes/No).
  • Must-have knobs: CTLE steps (X), gain control (X), limiter/swing control (X), diagnostics visibility (X), configuration lock (X).
  • Orientation requirements: reversible lanes / crosspoint (Yes/No), lane-map validation checklist (required), A/B delta limit (X).
  • Validation evidence: test matrix (cables × orientation × temp × voltage × rate tier), EQ/training logs, thermal log, supply ripple log.
  • Decision: □ Redriver class □ Retimer class □ Re-architect first (shorten / reduce connectors / fix symmetry).
Pass criteria
The checklist produces a reproducible device capability set and a defined evidence package (logs/measurements) for design reviews and production gates.
Diagram: Selection knob panel → Device class decision
Profile inputs feed “knobs”, then output a decision plus the evidence to collect.
Capability Mapping (Profile → Knobs → Decision) Profile inputs Loss bucket Connector count Cable presence Orientation A/B delta Knob panel Redriver CTLE / Gain Limiter Retimer CDR DFE Adaptive Orientation mapping (A/B symmetry) Power / Clock / Thermal closure Decision Redriver Retimer Re-architect shorten / symmetry Evidence to collect (production-ready) Matrix pass-rate A/B delta Thermal log Supply ripple / status log
The decision should always be backed by evidence: matrix coverage, A/B symmetry, and thermal/power closure (thresholds: X).

Request a Quote

Accepted Formats

pdf, csv, xls, xlsx, zip

Attachment

Drag & drop files here or use the button below.

H2-12 · FAQs (10–12) — Field Debug Long-Tail Only

FAQ rules
Scope: only field-debug long-tail. No new domains. Each item is exactly four lines: Likely cause / Quick check / Fix / Pass criteria (threshold placeholders X/Y).
Same cable works in one Type-C orientation but fails in the other — mapping issue or path asymmetry?
Likely cause: A/B lane map mismatch, polarity mismatch, or A/B physical path asymmetry (via count/escape/connector path) shrinking margin in one orientation.
Quick check: Run an A/B orientation A-B-A loop (same cable, same port) and log pass/fail + error counters; if failures follow orientation (not the cable), it is mapping/asymmetry.
Fix: Validate lane-map checklist end-to-end (connector→mux/flip→redriver/retimer→SoC), then reduce A/B asymmetry (routing symmetry, matched discontinuities) and lock orientation control (GPIO/I²C) where applicable.
Pass criteria: Orientation A and B both pass the same test matrix with pass-rate ≥ X% over Y plug cycles; A/B pass-rate delta ≤ X%; no orientation-specific error bursts within a Y-minute window.
Redriver “improves” eye height but link becomes less stable — over-EQ or reflection-dominated?
Likely cause: Over-boosted CTLE/gain amplifies crosstalk/noise, or the channel is reflection/RL-dominated where EQ cannot fix the root discontinuity.
Quick check: Sweep CTLE/gain from conservative→aggressive while logging stability (drop events) and error counters; if “more EQ” correlates with worse stability, it is over-EQ/noise. If no EQ setting helps, suspect reflections.
Fix: Apply “physical first”: remove/shorten stubs, restore return-path continuity, improve impedance transitions (connector/launch). Then re-tune CTLE with a strict rollback rule (stop at first stable plateau).
Pass criteria: A stable EQ setting exists where (a) no link drops for Y minutes, (b) error rate ≤ X per Y GB, and (c) stability remains within X% when moving one CTLE step up/down (robustness band).
Retimer passes bring-up but flaps after 5–10 minutes — thermal drift or power-noise coupling?
Likely cause: Retimer CDR/adaptation loses margin as junction temperature rises, or supply ripple/noise increases under steady traffic and couples into sampling margin.
Quick check: Correlate “first flap time” with temperature rise (ΔT at device area) and pin-level supply ripple (mVpp). Repeat with airflow or reduced load to see if flap time shifts.
Fix: Improve thermal path (copper/thermal vias/airflow), enforce local decoupling and short return loops, and if supported, lock stable equalization/training profiles rather than fully free-running adaptation.
Pass criteria: Under worst-case steady load for Y minutes, ΔT ≤ X°C and ripple ≤ X mVpp at defined probe points; link has zero flaps and error rate ≤ X per Y GB over the full soak.
Training succeeds at a lower rate but fails at max — first check loss budget or jitter/clock?
Likely cause: High-rate margin is closed by loss/ISI beyond what CTLE can recover, or clock/jitter/noise dominates and only appears at the highest rate.
Quick check: Compare two deltas: (1) change cable length/connector count (loss delta), (2) change reference clock/noise conditions (jitter delta). If failure tracks loss more than clock conditions, treat it as loss-dominant.
Fix: If loss-dominant: reposition conditioner closer to the discontinuity and tune CTLE/gain; if jitter/noise-dominant: prioritize retiming (retimer) and power/clock hygiene; always re-validate A/B orientation symmetry.
Pass criteria: Max rate trains successfully with pass-rate ≥ X% across the defined matrix (cables × orientation × temp × voltage); stability window ≥ Y minutes with error rate ≤ X per Y GB at max rate.
Only certain cables fail — insertion-loss variance or connector return-loss spike?
Likely cause: Cable IL spread pushes some samples over the margin edge, or connector RL/impedance discontinuity creates reflection peaks that a subset of cables amplifies.
Quick check: Sort cables by fail rate and test with a fixed orientation/temperature; if failures correlate with cable length/vendor batches, treat it as IL variance; if failures are random across similar IL, suspect RL discontinuity at the port/connector.
Fix: Expand the validation matrix to include “worst-case cables”, then tune for robustness (not peak eye). If RL-driven, repair launches/connector transitions before increasing EQ strength.
Pass criteria: With the top Y worst cables included, matrix pass-rate ≥ X%; cable-to-cable pass-rate spread ≤ X%; no single cable causes repeated drops within Y plug cycles.
After adding a retimer, EMI worsens — swing/limit too aggressive or return path broken?
Likely cause: Output swing/limiting settings inject more high-frequency energy, or layout changes broke the return path (ground reference discontinuity), converting differential energy into common-mode radiation.
Quick check: Toggle output swing/limit presets (if available) while repeating the same EMI observation and link stability; if EMI changes with settings, it is drive-strength related; if EMI persists regardless, suspect return-path/layout.
Fix: Reduce aggressiveness (swing/limit) to the minimum that meets stability; restore continuous return path (short reference transitions, controlled stitching), and re-verify orientation symmetry after layout changes.
Pass criteria: EMI metric improves by ≥ X dB versus the worst configuration while link remains stable for Y minutes; no more than X% stability regression compared to pre-change baseline.
Two retimers in series makes it worse — too many CDR domains or poor segmentation/placement?
Likely cause: Over-segmentation creates multiple CDR/adaptive domains that fight variability, or the retimers are placed such that both sides remain “long and weak”, leaving no strong margin segment.
Quick check: A/B test: disable/bypass one retimer (or use a configuration mode) and compare training pass-rate + stability window; if one-retimer improves, the chain is over-conditioned or mis-segmented.
Fix: Redesign segmentation: place the retimer at the true break point (near the connector/long cable entry) and ensure the remaining segments are short/controlled; lock stable training profiles where supported.
Pass criteria: With the intended segmentation, max-rate training pass-rate ≥ X% and stability window ≥ Y minutes; removing any extra retimer does not improve results by more than X% (no hidden over-conditioning benefit).
Errors appear only under high load — thermal or supply droop correlated with activity?
Likely cause: Activity increases power draw and heat, shrinking CDR/DFE margin; or causes transient droop/ripple that couples into sampling.
Quick check: Log error bursts against (a) temperature rise, (b) supply droop/ripple at device pins, and (c) traffic duty cycle. Repeat with reduced load to see if bursts move/disappear.
Fix: Strengthen local decoupling/return, reduce impedance to the device supply, improve thermal spreading/airflow, and re-tune EQ/training under worst-case load (not idle).
Pass criteria: Under sustained load for Y minutes, droop ≤ X mV, ripple ≤ X mVpp, ΔT ≤ X°C; error rate ≤ X per Y GB with zero link drops.
Scope looks clean at the SoC pads but link still fails — where is the first measurement point that matters?
Likely cause: The real impairment is at the discontinuity (connector/launch/mux/conditioner pins), not at the SoC pads; probing at the wrong point hides reflection and return-path issues.
Quick check: Move the “first check” to the break point: connector side and conditioner pins. Compare A/B orientation and cable changes at that point, not only at the SoC pads.
Fix: Re-center debugging around the discontinuity: validate impedance transitions, return path continuity, and symmetry; then tune EQ/training using the same measurement point for every iteration.
Pass criteria: The chosen measurement point yields a repeatable correlation: configuration changes shift stability/error metrics predictably (R² ≥ X placeholder); final setting passes matrix with pass-rate ≥ X% for Y minutes stable runtime.
Passing in lab, failing in field — missing worst-case cable + temperature + orientation matrix?
Likely cause: Validation matrix is incomplete; field combines worst-case cable + temperature + orientation (and power/EMI) that the lab did not cover.
Quick check: Recreate the field corner by expanding the matrix: worst cables, both orientations, high/low temperature, low supply corner, and sustained traffic. Compare pass-rate to the lab-only subset.
Fix: Promote the matrix to a release gate: lock configurations that pass the full matrix, and document evidence (logs/thermal/ripple) as required artifacts for sign-off.
Pass criteria: Full matrix coverage achieved (cables × orientation × temp × voltage × rate tier), with overall pass-rate ≥ X%; no single corner below X%; stability ≥ Y minutes per corner run.
Retimer “transparent” mode still breaks compatibility — first capability/strap/config sanity check?
Likely cause: Strap/config mismatch (lane polarity/map, orientation control, adaptation mode) or a capability negotiation expectation not met by the configured retimer behavior.
Quick check: Freeze one known-good baseline configuration and compare: (a) strap states, (b) I²C register dumps, (c) orientation control state, (d) power-up sequencing. If behavior changes across boots, suspect configuration nondeterminism.
Fix: Make configuration deterministic: lock straps, enforce a single boot-time config write sequence, and record the register image as a production artifact; keep lane-map/orientation checks explicit.
Pass criteria: Cold-boot repeatability ≥ X% over Y cycles; config checksum matches the golden image every boot; compatibility pass-rate ≥ X% across the supported device set within a Y-minute run.
Switching to a different vendor drop-in degrades margin — pin-compatible ≠ EQ-compatible assumptions?
Likely cause: Different CTLE peaking shape, limiter behavior, adaptation policy, or default settings move the channel away from the previous optimum; “drop-in” changes the system transfer function.
Quick check: Re-run the same EQ/training sweep and compare the “stable region” width; if the new part has a narrower stable window or shifted optimum, it is EQ-behavior mismatch (not wiring).
Fix: Treat as a new SI component: re-tune knobs, update the golden configuration, and re-qualify with the worst-case matrix; if stability window remains too narrow, revert device class or re-architect the channel.
Pass criteria: New part achieves stable window width ≥ X settings (CTLE/gain combinations) and overall matrix pass-rate ≥ X%; A/B delta ≤ X%; no performance regression > X% versus baseline across Y hours cumulative runs.