123 Main Street, New York, NY 10001

ToR Interposer Board for PAM4 Retimers, Clocks & Telemetry

← Back to: Data Center & Servers

Key takeaway

A ToR interposer board segments long PAM4 channels with retimers and keeps the system serviceable by bundling refclk fanout, rails/telemetry, and sideband (EEPROM/LEDs) into one maintainable layer—so link training, BER, and field faults can be diagnosed with interposer-visible evidence.

H2-1 · What a ToR Interposer Board Is (and what it is NOT)

One-sentence definition (scope anchor)

A ToR interposer board is a serviceable high-speed interface layer placed between a ToR switch mainboard and the front-panel connector/cage, typically integrating PAM4 retimers, reference-clock fanout, sideband management, and status/telemetry (EEPROM/FRU, LEDs, thermal & power monitors) to segment channels and improve bring-up visibility.

Channel segmentation PAM4 retiming Refclk fanout Sideband robustness Telemetry & status Serviceability

When an interposer is justified (two common engineering triggers)

  • Channel loss / discontinuities exceed margin: long traces + multiple connectors push insertion loss, reflections, and jitter beyond what receiver equalization can reliably close → retimers split the path into shorter, measurable segments.
  • Serviceability and risk isolation: front-panel connectivity evolves faster than the mainboard; moving high-risk high-speed components onto a replaceable interposer reduces rework cycles, speeds field repair, and protects mainboard yield.
Practical objective: make link failures diagnosable with board-visible evidence (lock bits, error counters, rails/temperature), instead of “mystery BER”.

Clear boundaries (avoid cross-topic overlap)

This page covers
  • PAM4 retimer vs redriver choice for interposer usage (CDR, EQ, observability)
  • Refclk distribution and fanout (jitter hygiene at board level)
  • Sidebands: I²C/SMBus/I³C device map, addressing, level domains, bus recovery
  • EEPROM/FRU + status LEDs + thermal/power telemetry (minimum useful set)
  • Bring-up and fault isolation using board-level signals and registers
Link-only topics
  • Switch ASIC internal architecture & packet pipeline (link to ToR switch pages)
  • System time sync (PTP/SyncE/GNSSDO) beyond board clock distribution
  • Platform BMC/IPMI/Redfish software stack and fleet telemetry backend
  • Rack power subsystem design (PSU/PDU/hot-swap) and facility cooling
  • Optical module internal DSP/laser/TIA design details

Figure A — Where the interposer sits in a ToR link

Mainboard Switch ASIC (boundary) Refclk in Sideband bus Connector discontinuity Interposer PAM4 Retimer Clock Fanout EEPROM FRU LEDs Status Front Panel Cage / Connector Link Segment 2 Link segmentation shorter, measurable segments Refclk I²C/SMBus/I³C sideband Power rails retimer / clock / sensors / EEPROM / LEDs Thermal sensor

The interposer’s value is not “more lanes”; it is segmented channels, cleaner clock distribution, and board-visible status/telemetry that shorten bring-up and field isolation.

H2-2 · System Role & Interfaces: Signals, Sidebands, and Boundaries

Interface contract (why this chapter exists)

A ToR interposer should be treated as an interface contract: every later design, bring-up, and debug action maps back to a specific interface group (high-speed lanes, refclk, sidebands, reset/control, and power domains). This prevents “random tuning” and keeps ownership boundaries clear.

Boundary rule: discussion stops at connector-visible signals + readable on-board registers/telemetry (no platform software deep-dive).

Grouped interface map (what carries what, and what can break)

Interface group What it carries Key constraints on an interposer Field-observable evidence
High-speed lanes
PAM4 SerDes
Data traffic across segmented channel sections; retimer input/output boundaries Channel loss & discontinuities; crosstalk control; retimer EQ range (CTLE/DFE/FFE) & CDR lock behavior; consistency across ports Link training loops; CDR lock bits; per-lane error counters; BER change vs temperature/voltage
Reference clock
Refclk
Clock reference for retimer PLL/CDR and coherent sampling Fanout additive jitter; impedance/return-path hygiene; isolation from switching noise; phase/length matching for multi-port symmetry PLL lock stability; sensitivity to supply noise; “locks but errors persist” patterns
Sideband bus
I²C/SMBus/I³C
Configuration, identification (FRU), sensor readouts, LED control Address planning; voltage domains & level shifting; bus buffering; stuck-bus recovery; write-protect strategy for EEPROM NACK bursts; bus stuck low; watchdog-driven bus reset events; FRU/ID mismatch logs
Reset / control
Reset/Fault
PERST#/reset sequencing; LOS/LOL; fault aggregation; module/presence controls Power-good gating; deterministic default states; fail-safe behavior; clear ownership of who asserts/clears faults Repeatable “won’t train after warm reboot”; fault pins asserted; sticky fault status registers
Power domains
Rails
Retimer/clock/sensors/EEPROM/LED supplies and local filtering PI→SI coupling (ripple becomes jitter); decoupling hierarchy; sequencing dependencies; telemetry thresholds & event capture Undervoltage flags; thermal rise; current spikes; correlation between rail noise and error bursts

Figure B — Single-view interface overview (lanes, refclk, sidebands, power)

Mainboard side Interposer contract Front-panel side PAM4 lanes retimer I/O boundary Refclk fanout Sidebands I²C/SMBus/I³C Reset & fault Power domains + telemetry rails, temp, V/I monitors, events EEPROM FRU/ID LEDs status High-speed lanes Refclk (jitter hygiene) Sidebands (addressing, recovery) Power domains noise → jitter → BER (correlation via telemetry) Legend High-speed Refclk Sideband

This single view acts as the “map”: lane issues map to retimer boundaries, clock issues map to fanout/jitter hygiene, and sideband issues map to addressing/recovery and FRU/LED semantics.

H2-3 · PAM4 Channel Budget: Why Retimers Are Needed Here

Typical field symptoms that point to margin loss

Retimers are usually introduced after repeated evidence that the current channel is operating near the edge. The most actionable approach is to map each symptom to a budget “bucket” (loss, reflections, crosstalk, jitter) and verify with segment-level evidence instead of random tuning.

  • Training loops / unstable link → discontinuities, reflection hot-spots, or EQ adaptation not converging
  • BER spikes → jitter bucket overflow, crosstalk bursts, or supply/thermal sensitivity collapsing eye margin
  • Downshift after warm-up → temperature-driven margin loss (CDR/EQ/connector), often amplified by clock or rail noise
  • Only some ports are bad → channel-to-channel variation dominated by connector/escape routing differences
Insertion loss (IL) Return loss (RL) NEXT/FEXT RJ/DJ Refclk noise → UI
Practical goal: convert “one long unknown channel” into “multiple short measurable segments” with clear observation points.

Segment budgeting (turn a long path into measurable sections)

A ToR interposer enables deliberate channel segmentation. Instead of treating the path as a single loss number, the path is decomposed into sections that can be correlated with board-visible evidence (lock status, counters, temperature/rails).

Segment model
  • Mainboard trace → escape routing & via fields
  • Connector section → strongest discontinuity & variation source
  • Interposer section → retimer I/O boundaries + local clock/power
  • Cage/front section → final connector/cage transitions
Measurement / evidence points
  • At retimer boundary → CDR lock, EQ status, per-lane error counters
  • On interposer rails → undervoltage flags, current spikes, correlation to error bursts
  • On interposer thermal → hotspot temperature and “warm-up margin collapse” patterns
  • Across ports → identify channel variation vs systemic clock/rail issues
Placement rule of thumb: segment the path at the most variable discontinuity (connectors/escape regions) so failures become diagnosable by section.

Figure C — Link segmentation + budget buckets (relative contributions)

Channel as segments (measure each section) Mainboard trace section Connector variation Interposer retimer boundary Cage / Front final section Segment 1 Seg 2 Segment 3 Segment 4 Discontinuities (connectors / vias / escapes) Budget buckets (relative) IL RL XTALK Jitter Legend relative share hot-spot segment line

The illustration avoids numeric standards on purpose: the design intent is segmentation plus bucket reasoning, enabling targeted measurement and placement rather than “try-and-see”.

H2-4 · Retimer Architecture Choices That Matter on an Interposer

Fast decision logic (redriver vs retimer)

Interposer selection is most reliable when driven by field evidence and recovery needs. The decision is typically determined by whether the design needs CDR-based retiming, board-visible observability, and predictable behavior across multiple ports.

Prefer retimer when
  • Training is unstable or BER is sensitive to temperature/rails (margin collapses)
  • CDR lock state and error counters are required for field isolation
  • Multi-port symmetry is required (consistent behavior per port)
Redriver may be enough when
  • Loss is moderate and the main issue is amplitude/ISI without severe jitter
  • Power/thermal limits are extremely tight and observability is not mandatory
  • Recovery can be done by simple re-initialization and the channel is stable
Interposer-specific priority: observability + recoverability often outweigh peak link “best-case” performance because they reduce outage time.

Architecture selection matrix (what matters specifically on an interposer)

Parameter Why it matters on an interposer What to verify If wrong
CDR / retiming Determines jitter tolerance and whether long, variable channels can be stabilized across temperature and port variations. Lock behavior, lock time, stability under rails/thermal changes. “Locks sometimes” behavior; BER bursts after warm-up or brownouts.
EQ range
CTLE/DFE/FFE
Defines how much insertion loss and discontinuity can be compensated before adaptation becomes unstable. Adaptation convergence, per-lane tuning visibility, safe defaults. Training loops; port-to-port randomness; sensitivity to connector swaps.
Observability Field isolation depends on readable status (CDR lock, EQ state, error counters) at the segment boundary. Readable registers, counters, sticky fault flags, snapshot capability. “No evidence” failures; long MTTR due to blind debugging.
Management mode
strap vs I²C
Controls default boot behavior, recoverability, and configuration resilience when the bus is noisy or stuck. Safe default config, bus recovery strategy, write-protect policy. Bricked behavior after config errors; stuck-bus prevents recovery.
Port symmetry Interposers often host many lanes/ports; consistent tuning reduces “lottery ports” and simplifies operations. Per-port tuning consistency, deterministic initialization sequence. Only some ports fail; repeated RMAs without root cause clarity.
Power / thermal Retimers concentrate heat; PI noise can convert to jitter and amplify BER. Thermal headroom affects long-term stability. Power vs rate, thermal derating, rail sensitivity guidance. Warm-up downshift; intermittent errors under fan curve changes.
Refclk dependency Some architectures demand low additive jitter clocks; fanout and routing hygiene become first-order constraints. Input clock requirements, forwarding behavior, tolerance to jitter. Clock “looks ok” but link stays fragile; errors correlate with clock noise.
Latency consistency Even without protocol deep-dive, consistent lane behavior simplifies validation and avoids asymmetric training outcomes. Deterministic data-path behavior, stable timing across rails/thermal. Hard-to-reproduce issues; inconsistent behavior between cold/warm states.

Figure D — Interposer-focused retimer decision map (icons, low text)

Choose by evidence + recoverability Redriver amplitude / ISI help Retimer CDR / jitter cleanup EQ help no CDR lock Lower power thermal friendly ? Limited evidence CDR lock jitter cleanup Telemetry counters Mgmt strap / I²C Key criteria: CDR lock + EQ + observability + symmetry + refclk dependency

The decision map is interposer-specific: stable recovery and diagnosable evidence are prioritized over best-case link performance.

H2-5 · Clocking: Refclk Distribution, Fanout, and Jitter Hygiene

What “clock trouble” looks like on an interposer

On dense high-speed interposer boards, refclk quality often determines whether link behavior is stable across ports, temperature, and rail variations. Symptoms that look like “random SI issues” frequently trace back to clock distribution or clock–power coupling.

  • Intermittent training / link re-initialization loops → marginal PLL/CDR lock and noisy refclk environment
  • BER bursts (errors cluster in time) → clock/rail noise injection creating short-lived jitter spikes
  • Warm-up downshift → reduced lock margin as temperature and rail impedance drift
  • Only some ports misbehave → fanout branch asymmetry or local coupling near a subset of retimers
Refclk role 1→N fanout Additive jitter Return path Noise isolation Acceptance workflow
Interposer priority: consistent clock hygiene across branches reduces “lottery ports” and improves recoverability.

Refclk distribution: topology and hygiene rules

Treat the refclk path as a small “clock tree” with a defined boundary: Refclk In → Fanout Buffer → Retimer references. Reliability improves when branch behavior is made symmetric (not just in length, but in coupling environment) and when noise sources are intentionally separated from clock-sensitive zones.

Clock tree structure
  • 1→N fanout buffer near the refclk boundary to control distribution consistently
  • Branch symmetry to keep port-to-port behavior aligned (routing + environment)
  • Clock domains separated only when boundaries are meaningful (independent groups)
  • Clear measurement points at input, fanout outputs, and near retimer reference pins
Jitter hygiene triggers
  • Keep a “sensitive zone” around fanout + retimer PLL supplies and refclk routes
  • Isolate switching edges from LED drivers and sideband pull-up current steps
  • Separate DC-DC activity (or its hot loops) from refclk return paths
  • Protect return continuity so refclk reference does not ride on ground bounce

Acceptance workflow: make margin measurable

The most useful acceptance plan is evidence-driven: measure refclk at defined points, correlate lock stability, and reproduce the worst-case operating envelope (cold start, thermal steady-state, and mild rail disturbance). The goal is to catch fragile lock margin before deployment.

Measure pointsRefclk In → Fanout Out → Near retimer reference
EvidencePLL/CDR lock status, lock flaps, per-lane error counters (if available)
EnvelopesCold boot, warm steady-state, airflow/fan curve changes, minor rail shifts
Pass signalStable lock without flaps; no correlated error bursts with rail/thermal events
Avoid “single snapshot” validation. A stable refclk tree is demonstrated by repeatability across temperature and power conditions.

Figure E — Clock tree on an interposer (fanout + hygiene zones + probe points)

Refclk Distribution + Jitter Hygiene Sensitive clock zone Noisy edge zone switching / sideband edges Refclk In boundary Fanout 1 → N Retimer A PLL ref Retimer B PLL ref Retimer C PLL ref Probe points IN / OUT / near-PLL DC-DC LED I²C Coupling Goal: stable lock + no error bursts correlated with rail/thermal events Rule: keep clock routes & returns out of switching edge zones

The diagram highlights the clock tree boundary and where noise typically couples in. Use fixed probe points and evidence signals (lock status / counters) to validate stability across thermal and rail envelopes.

H2-6 · Power Rails & Telemetry: PI Co-Design on a Dense High-Speed Board

Why “small power” can be high risk on a high-speed interposer

Even when the interposer is not a high-power board, its rails are unusually sensitive because they directly feed retimer PLL/CDR blocks and clock distribution devices. Rail noise and ground bounce can convert into jitter and margin loss, turning electrical issues into link instability.

Rail partitioning POR / reset gating V/I/T telemetry Alarms + debounce Minimal black box PI → Jitter → BER
Interposer framing: power integrity is a first-order input to clock hygiene and error behavior, not just a “power-on” requirement.

Rail partitioning and sequencing (interposer scope only)

Partition rails by sensitivity and by whether the domain is a noise source. Keep retimer PLL-related rails the cleanest, and treat LED/sideband domains as potential aggressors that require isolation and controlled edges.

Typical rail domains
  • Retimer core / I/O domains (lane logic and drivers)
  • Retimer PLL / CDR rail (highest jitter sensitivity)
  • Clock buffer rail (additive jitter and PSRR limits)
  • Level shifting / sideband rail (threshold stability)
  • EEPROM + sensors rail (FRU identity + monitoring)
  • LED rail (edge aggressor, isolate and filter)
Sequencing & reset gating rules
  • POR first: ensure rails reach stable range before releasing resets
  • Clock-ready gating: do not release retimer reset until refclk is stable
  • Brownout behavior: define a predictable recovery path after brief droops
  • Defaults are safe: strap / default configuration should avoid “half-alive” states
The most fragile failures are “almost stable” states: rails look nominal on average, but short droops or noise bursts collapse lock margin.

Telemetry loop: minimum useful signals for fast isolation

Telemetry is most valuable when it supports correlation: align link errors with rail and thermal events to decide whether the failure is channel-related or board-environment-related. Keep the telemetry set small but actionable to avoid noise and false positives.

Signal Why it matters Operational use
Voltage (V)
key rails
Detects droops and rail instability that reduce PLL/CDR lock margin and raise jitter. Correlate BER bursts with brief undervoltage events; validate POR gating.
Current (I)
per group
Captures load steps and abnormal draw that can amplify rail noise or thermal rise. Spot port-group anomalies and identify “hot” retimer clusters.
Temperature (T)
hotspot
Thermal drift reduces margin and can trigger downshifts or intermittent lock behavior. Explain warm-up failures; validate airflow assumptions and derating.
Alarms
UV/OV/OT
Provides high-signal events when paired with sensible thresholds and debounce. Reduce false positives while capturing real droop or over-temp incidents.
Event stamps
minimal log
Enables black-box reconstruction without full platform integration. Align “link went unstable” with rail/thermal alarm timeline for fast triage.
Keep telemetry interposer-scoped: deliver host-readable evidence without assuming a specific platform controller architecture.

PI → SI/clock coupling paths (what to protect)

The most common failure mechanism is conversion: rail noise or return-path disturbance becomes timing uncertainty. Protect the rails that feed clock and PLL blocks, and isolate domains that generate sharp edges.

  • Switching ripple → PLL rail noise → phase noise rises → UI margin shrinks → BER increases
  • Ground bounce / return discontinuity → reference shifts → lock margin collapses intermittently
  • Edge aggressors (LED / sideband) → coupling into sensitive zone → jitter spikes and clustered errors
Decoupling layersBulk + mid + high-frequency placement aligned to sensitive rails
Clean railsExtra filtering / LDO logic for PLL & clock domains when needed
IsolationKeep LED/sideband edges out of clock return paths and PLL zones
CorrelationMake V/I/T and alarms timestampable for fast root-cause isolation

Figure F — Power domains + telemetry map (host-readable, interposer scoped)

Power Domains + Telemetry Loop Rail domains (partition by sensitivity) Retimer Core / I/O logic + drivers Retimer PLL / CDR Rail highest sensitivity Clock Buffer Rail additive jitter Sideband / Level Shift threshold stability EEPROM + Sensors FRU + V/I/T LED Domain edge aggressor Telemetry aggregation Monitor / ADC V / I / T + alarms Host Readout I²C / SMBus Minimal Log event stamps Coupling: Rail noise → Jitter → BER

The map stays interposer-scoped: rails are partitioned by sensitivity, telemetry is aggregated into host-readable evidence, and minimal event stamps enable fast correlation without assuming a specific platform controller.

H2-7 · Thermal & Mechanics: Keeping Retimers Stable Across Load and Ambient

Why thermal stability is a first-order link requirement

On a ToR interposer, retimers often sit at the intersection of high-speed signal integrity, clock hygiene, and dense mechanics. Temperature rise can reduce EQ/PLL margin, shift coupling behavior, and amplify port-to-port variability. A good thermal plan is not “keeping it cool,” but keeping link behavior repeatable across ambient, load, and airflow conditions.

Hotspot ID Thermal path Derating Sensor placement Events + alarms Connector stress
Interposer scope: thermal strategy covers board-level heat paths, local airflow exposure, and actionable monitoring—without assuming chassis-level cooling control.

Heat sources and thermal paths (what to review)

Start with a heat map: identify retimer clusters, clock buffers, and any local switching regulators that can elevate the local temperature floor. Then review the full heat path from silicon to airflow so “good parts” do not become unstable due to poor conduction or contact.

Heat sources
  • Retimers: high power density; PLL/CDR margin can shrink with temperature
  • Clock buffers: additive jitter can worsen as local temperature rises
  • Local DC-DC: heat + switching activity can raise both thermal and noise risk
Thermal path chain
  • Die → copper: spread heat into planes (avoid tiny “islands”)
  • Copper → via array: provide vertical conduction to heat-spreading layers
  • Via → pad / sink: ensure stable contact pressure and material fit
  • Sink → airflow: confirm the zone actually sees effective airflow

Derating + monitoring: keep behavior repeatable

Thermal design should include a simple operational contract: what happens as temperature approaches limits, and how the board exposes evidence for quick isolation. Aim for “predictable reduction” rather than “random instability.”

Derating actionsDefine safe responses near limits (warn → protect) to avoid unstable oscillation
SensorsPlace near retimer hotspots, near clock-sensitive zones, and an edge ambient point
ThresholdsUse staged alarms with debounce to prevent false triggers from short spikes
Event stampsTimestamp thermal alarms and correlate with link drops / error bursts
Thermal evidence is most useful when it aligns with link evidence: temperature events + lock status + error behavior.

Mechanics that can quietly break link stability

Mechanical loading affects both thermal contact and electrical contact. Connector stress, cable pull, and board warp can change contact resistance and local heating, producing failure signatures that look like “mystery SI” or “random port issues.”

  • Connector stress: insertion force / cable torque can drift contact resistance over time
  • Board warp: thermal cycles and assembly tolerances can alter both connector and thermal-pad contact
  • Service boundary: replacement procedures should preserve thermal contact consistency (pad fit/pressure)
Interposer scope: document what is replaceable, what must remain paired (pads/sinks), and which checks validate post-service stability.

Figure G — Thermal zones on an interposer (hotspots + sensitive clock zone + airflow)

Thermal Zones + Airflow Exposure Interposer top view (zones) Hotspot cluster retimers Sensitive clock zone fanout + refclk Connector / I/O zone mechanical stress Retimer Retimer Retimer Clock Fanout Airflow Path: Die → Cu → Via → Sink → Air

Use zones to guide layout and service checks: protect clock-sensitive regions, ensure hotspot conduction is continuous, and validate airflow exposure over the retimer cluster.

H2-8 · Sideband Management: EEPROM/FRU, LEDs, GPIO, and Bus Robustness

Make “small” sideband devices operationally useful (and non-disruptive)

Sideband components define how an interposer is identified, monitored, and serviced. A robust plan keeps FRU data reliable, LED semantics consistent, and I²C/SMBus behavior recoverable—while preventing sideband edges from coupling into clock and high-speed regions.

EEPROM / FRU Write protect LED semantics Address plan Level domains Bus recovery
Interposer scope: expose host-readable identity and evidence. Avoid assuming any specific platform controller or management stack.

EEPROM / FRU content: keep it small, stable, and protected

FRU data is most effective when it is stable across revisions and supports quick compatibility checks. Store only what is operationally valuable, and keep write access controlled so field updates do not turn into “mystery incompatibility.”

Recommended FRU fields
  • Board ID + HW revision (stable identifier)
  • Serial / lot (traceability)
  • Port map profile ID (how lanes/ports are grouped)
  • Compatibility summary (simple matrix/hash)
  • Calibration summary (only if needed; keep as a compact record)
Write-protect strategy
  • Default read-only for critical identity fields
  • Explicit unlock path for updates (avoid accidental writes)
  • Fail-safe defaults to prevent “half-written” states after drops
  • Version discipline so host logic relies on stable fields
A small, stable FRU schema scales better than a large, frequently changing one—especially across multi-vendor service workflows.

LEDs, GPIO, and default states (service-friendly and quiet)

LEDs and GPIOs should be designed as an operational language: clear meanings, predictable defaults, and a low-noise implementation that does not inject unnecessary edges into sensitive zones.

  • LED semantics: Link / Activity / Fault / Temperature (consistent meanings across ports)
  • Quiet implementation: avoid aggressive edge rates on LED switching near clock-sensitive regions
  • GPIO / strap defaults: safe boot state; recoverable configuration without requiring a full power cycle
Treat LED and sideband edges as potential aggressors: keep them electrically and physically separated from refclk/PLL zones.

I²C/SMBus robustness: address planning, segmentation, and recovery

The most common field failure is not “missing a bus,” but a bus that becomes stuck during hot-plug, brownout, or marginal pull-up conditions. Robustness comes from disciplined address planning, sensible pull-up ownership, and a defined recovery mechanism.

Planning guidelines
  • Address plan: avoid collisions; reserve space for future devices
  • Level domains: define which rail owns pull-ups; avoid mixed-domain ambiguity
  • Segmentation: use buffering/muxing when bus length/device count grows
Recovery guidelines
  • Clock-pulse recovery to release stuck devices when SDA is held low
  • Reset hooks for sideband blocks that can lock the bus during drops
  • Debounced presence for hot-plug-like transitions to prevent chatter
A recoverable bus reduces the need for hard power cycles and turns “field mystery” into a repeatable diagnostic flow.

Sideband device map (interposer-scoped)

Keep a compact device map that ties each sideband device to its rail domain, default state, and the failure signature it can produce. This supports consistent bring-up and fast isolation during service.

Device Addr Rail domain Default state Failure signature
EEPROM / FRU 0x50 EEPROM/Sensor rail Read-only identity fields Read fails; wrong ID; partial write after brownout
Temp sensor 0x48 EEPROM/Sensor rail Continuous sampling Missing hotspot evidence; unstable thresholds
Power monitor 0x40 Monitor rail Alarms enabled (debounced) False alarms; missed droop events; no correlation
LED driver 0x60 LED rail Quiet default (no fast blink) Error bursts correlated with blink edges
GPIO / straps Sideband/Level rail Safe boot defaults Wrong config; hard-to-recover states
Bus buffer / mux 0x70 Sideband/Level rail Segment enabled Segment isolation missing; bus stuck propagates

Example addresses are placeholders. The key is the structure: device + domain + default + failure signature.

Figure H — Sideband map (bus + rails + recovery + keep-out)

Sideband Devices + Bus Robustness Sensitive zone clock / PLL I²C / SMBus trunk EEPROM / FRU rail: AUX Temp Sensor rail: AUX Power Monitor rail: MON LED Driver rail: LED GPIO / Straps safe defaults Bus Buffer/Mux segment Recovery clock pulses Edge noise Goal: identifiable + recoverable + quiet sidebands

Keep sideband edges away from clock-sensitive regions, segment the bus when needed, and ensure a defined recovery mechanism exists for “bus stuck” scenarios.

H2-9 · Bring-up & Debug: From Link Training to Field Fault Isolation

Turn “it won’t link” into a repeatable isolation path

Interposer failures usually look like training loops, port flaps, or intermittent error bursts. The fastest route is layered: validate rails/reset, then refclk lock, then training/EQ, then error behavior, and finally stress with temperature / vibration / service actions.

Rails / Reset Refclk Lock Training / EQ Error Counters Thermal / Mechanics Evidence + Timestamp
Interposer scope: focus on observable signals (locks/status/counters/flags) and minimal isolation actions (swap / de-rate / bypass if available).

Layer 0–1: rails/reset first, then refclk lock

Debug should never start with EQ. A marginal rail, a noisy reset edge, or a brownout event can create symptoms that mimic “bad SI.” Confirm power and reset evidence first, then verify clock lock stability before touching training parameters.

Rails / reset evidence
  • POR done and stable reset release
  • Brownout flag (sticky) and retry counters
  • Per-rail alarms (if monitored) tied to timestamps
Refclk / lock evidence
  • CDR/PLL lock + loss-of-lock sticky bit
  • Lock stability across temperature and load
  • Correlation: lock drops ↔ error bursts
A single “lock = 1” snapshot is not enough. Stability over time is the evidence that matters.

Layer 2–3: training/EQ, then error behavior and minimal isolation

Once rails and clock lock are stable, training and EQ become meaningful. Read a small set of status signals and counters, then run minimal isolation actions to decide whether the issue follows a port, a channel, or a replaceable item.

Key observables
  • EQ adapt status (converged / stuck / oscillating)
  • Training retries and time-to-lock behavior
  • Error counters (per port / per window if available)
  • Thermal flags and max-temp snapshots
Minimal isolation actions
  • Swap: cable/module/port to see what the problem follows
  • De-rate: reduce speed/load to test margin sensitivity
  • Bypass (if supported): isolate the retimer segment
  • Repeatability: reproduce with the same stimulus + timestamps
The goal is not “tuning until it works,” but finding the smallest evidence set that identifies the failing layer.

Field fault isolation: what evidence should be readable

Field issues become solvable when the interposer exposes compact, host-readable evidence that aligns with time. Keep a minimal “black-box” set so support can separate power, clock, training, thermal, and mechanical causes.

Sticky flagsbrownout, loss-of-lock, thermal alarms
Counterstraining retries, lock-drop count, error bursts
Snapshotsmax temp near retimers + a clock-zone sensor
Timestampsevent time aligned with port/slot identity
Evidence should be portable: readable by the host without requiring a specific management platform.

Figure I — Debug flow (layered decisions + chapter mapping)

Bring-up & Debug Flow (Interposer Scope) Symptom training fail / flaps / BER bursts Rails / Reset OK? POR, brownout flag, reset gating maps: H2-6 Refclk Lock Stable? CDR/PLL lock, loss-of-lock sticky maps: H2-5 Training / EQ Converges? EQ adapt status, retry count, time-to-lock maps: H2-3 / H2-4 BER Pattern Explained? error counters, burst timing, lane asymmetry maps: H2-3 / H2-4 Thermal / Mechanics? hotspot temp, vibration, connector stress maps: H2-7 Minimal Isolation Swap / De-rate / Bypass (if supported) maps: H2-9

Use layered decisions to prevent “tuning in the dark.” Each decision node maps to earlier chapters so evidence stays consistent from bring-up to field isolation.

H2-10 · Validation & Compliance: What to Measure and How to Prove Margin

Validation proves margin across corners—not just a single pass

A ToR interposer is validated when link behavior remains repeatable across temperature, supply variation, cable/modular combinations, and realistic airflow/load. The goal is a measurable margin story: how eye/BER/jitter change with corners, and what evidence confirms stability.

Eye / Bathtub BER trends Jitter / lock Interop matrix Temp corners EMI hotspots
This chapter uses executable test language without binding to any single standard’s numeric thresholds.

High-speed validation: measure behavior vs corners

Start with link-centric measurements that reveal margin shape, then sweep corners to expose sensitivity. The most useful outcomes are trends that stay stable and failure signatures that map cleanly back to power/clock/thermal chapters.

Eye / bathtubmargin shape + sensitivity to channel segments
BER vs temperatureshows thermal margin collapse or stability
BER vs voltagereveals rail noise / brownout sensitivity
BER vs cables/modulesfind worst-case combinations
When a failure appears, capture the aligned evidence: lock state + counters + temperature snapshot + timestamp.

Clock validation: test points + lock stability

Clock performance is validated by stable lock behavior and consistent jitter hygiene at key points. Measurements are most actionable when they identify where jitter is added and when lock stability weakens under temperature or load.

Where to check
  • Refclk input (source quality exposure)
  • Fanout outputs (additive jitter + distribution)
  • Retimer-side evidence (lock stability + drop counts)
What “good” looks like
  • Stable lock across temperature and workload
  • No burst correlation between lock events and BER bursts
  • Repeatability across ports and service actions
A single spectrum snapshot is less valuable than a stability story: lock behavior + evidence over time.

Interop + EMI risk points (keep it practical)

Interoperability is proven by sweeping realistic combinations and identifying the worst case. EMI handling here is limited to spotting risk hotspots and applying layout hygiene—full EMC theory belongs in the dedicated Safety & EMC page.

Interop matrix
  • Module/cable mix across representative suppliers and lengths
  • Worst-case focus instead of “everything everywhere”
  • Port consistency to avoid outlier channels
EMI hotspots (point-only)
  • Retimer cluster switching activity and reference clock regions
  • Clock fanout traces and return paths
  • Keep-out for noisy edges near clock-sensitive zones
Keep EMI discussion scoped: identify hotspots and layout intent, then link to the dedicated EMC page for theory and standards.

Validation checklist (engineering-executable)

Use this checklist format to document what was measured, why it matters, how to run it, what failure looks like, and which chapter it maps to. This supports design reviews and makes field regressions easier to isolate.

Test item Purpose Method (type) Fail signature Maps to
Eye / bathtub Show margin shape and sensitivity by segment Eye/bathtub capture at key link points Eye closure, asymmetry, unstable training H2-3 / H2-4
BER vs temp Expose thermal margin collapse Sweep temperature with steady traffic BER bursts after warm-up; port-to-port divergence H2-7 / H2-9
BER vs voltage Expose rail noise / droop sensitivity Sweep supply corners; monitor flags Brownout flags; training retries; random flaps H2-6 / H2-9
Lock stability Prove refclk hygiene and robust lock behavior Monitor lock/drop counters over time Loss-of-lock correlated with error bursts H2-5 / H2-9
Interop matrix Find worst-case combination and prove stability Module/cable mix sweep (focused set) Only certain combos fail; regression after service H2-2 / H2-10
Service robustness Prove post-replacement repeatability Repeat training/BER after re-seat/replace Connector-stress induced outliers H2-7 / H2-9
EMI hotspot check Identify radiators and protect clock-sensitive zones Spot-check around retimer/clock zones Coupled noise into refclk region; unstable lock H2-5 / H2-10

The checklist is intentionally standard-agnostic: it documents proof of margin and failure signatures that map back to design chapters.

Figure J — Validation loop (corners → measurements → evidence → actions)

Validation Loop (Proof of Margin) Corners Temp / Voltage Cables / Modules Airflow / Load Measurements Eye / Bathtub BER trends Jitter / Lock Evidence Stable training Low bursts Repeatability Actions (map to chapters) Tune EQ H2-4 Clock hygiene H2-5 PI cleanup H2-6 Thermal path H2-7 Output: repeatable margin story + mapped evidence

The validation loop ties corner sweeps to measurable trends and evidence. When something fails, the loop maps back to specific chapters for corrective actions.

H2-11 · Design Checklist & Parts Selection Pointers (Board-level)

This section is written as a review-and-RFQ checklist: what must be proven on an interposer board (retimers + refclk fanout + telemetry + LEDs/EEPROM) before build approval and before supplier commitment. MPNs below are examples to anchor sourcing conversations—final selection must match lane rate, package/thermal, management model, and availability.

RetimersPAM4 marginObservability

1) Retimer selection: ask questions that expose margin, not marketing

  • CDR / retiming depth: confirm where jitter is cleaned vs merely amplified (CDR lock behavior, wander tolerance, lock stability across temp/voltage).
  • Equalization range: CTLE/DFE/FFE capability must cover the worst segment on the interposer path (connector + vias + short trace).
  • Training & adaptability: how adaptation converges, whether adaptation can be frozen/rolled back, and whether per-lane/per-port symmetry can be guaranteed.
  • Telemetry: must have readable status/health (CDR lock, EQ state, error counters, thermal flags, brownout flags) and deterministic fault pins.
  • Latency & consistency: quantify typical and max latency; validate that lane-to-lane / port-to-port skew stays bounded after adaptation.
  • Reset & recovery: define “known-good defaults” (strap vs registers) and field recovery steps (soft reset vs hard reset vs re-train).
  • Power/thermal curve: power vs data rate and temperature; require de-rating guidance and thermal-mechanical integration notes.
MPN examples (retimer / PAM4 DSP families): BCM85361 (Broadcom 16-lane 112G SerDes retimer), BCM87850/BCM87854 (Broadcom multi-lane retimer PHY family), MV-CHA180C0C (Marvell Alaska A 800G PAM4 DSP retimer)
RefclkFanoutJitter hygiene

2) Clock fanout selection: control additive jitter and skew like a spec, not a hope

  • Output format & voltage: LVDS/LVPECL/HCSL must match retimer input; avoid format conversions unless a measured need exists.
  • Additive jitter: require additive jitter characterization and verify the budget chain: source → buffer → retimer sampling margin.
  • Output-to-output skew: specify phase matching requirements across N outputs; require test data over temperature.
  • Power sensitivity: demand guidance on supply noise sensitivity; decide whether critical rails need an LDO island.
  • Fail-safe behavior: what happens on input loss, brownout, or partial power—ensure downstream retimers never see ambiguous clocks.
  • Placement rules: “clock keep-out zones” around LEDs, buck inductors, and bus pullups; require a layout checklist from supplier.
MPN examples (refclk fanout / clock buffers): LMK1D2102 (TI LVDS clock buffer), 8P34S2102 (Renesas LVDS fanout buffer)
RailsNoise isolationReset

3) Power rails & reset: treat supplies as a signal-integrity input

  • Rail partitioning: separate “PLL/clock sensitive” rails from “digital/noisy” rails (LEDs, GPIO expanders, bus pullups).
  • Sequencing rules: define retimer + clock buffer POR, reset gating, and safe defaults (no undefined strap states during ramp).
  • Decoupling topology: multi-layer decap strategy (bulk + mid + high-frequency), and via arrays under retimers and clock parts.
  • Buck vs LDO logic: use buck for efficiency, then LDO/filter for sensitive rails where PSRR/jitter coupling matters.
  • Brownout observability: require brownout flagging and “last-known-good” logging hooks (at minimum: rail OK + time correlation).
Block What to require from supplier MPN examples
Low-noise LDO island Noise + PSRR data at relevant frequencies; stability with chosen output caps; thermal headroom. TPS7A94 (TI ultra-low-noise LDO)
Local buck Switching node containment guidance; spread-spectrum option; predictable soft-start; PG behavior. TPS62130 (TI 3A step-down converter)
Supervisors / reset IC Threshold accuracy; reset delay; manual reset; glitch immunity under noisy ramps. TPS3808 (TI voltage supervisor / reset IC)
TelemetryI²C/SMBusFault evidence

4) Telemetry that actually helps bring-up and field triage

  • Minimum telemetry set: per-retimer temperature (or hotspot proxy), at least one sensitive rail voltage, and a board-level “alive” heartbeat.
  • Alert strategy: define thresholds and alert pins; avoid “always-on noisy interrupts” that mask real faults.
  • Correlation: ensure events can be correlated with link drops (use latched flags and monotonic counters even if no full timestamping exists).
  • Read path boundary: telemetry must be readable by the host/management path without requiring deep platform assumptions.
MPN examples (power/rail monitors): INA238 (TI I²C power/current/voltage monitor), LTC2977 (ADI/Linear Tech PMBus power system manager)
EEPROM/FRULEDsBus robustness

5) Sideband devices: FRU/EEPROM, LEDs, GPIO—and keeping the bus alive

  • FRU content policy: store board ID, HW revision, compatibility table hash, and calibration constants (if any). Keep it compact and versioned.
  • Write protection: require a hardware write-protect pin or policy to prevent accidental fleet-wide corruption during bring-up.
  • LED noise containment: avoid fast edges near clock zones; prefer controlled-slew drivers; keep return paths local and separated from refclk.
  • I²C/SMBus survivability: address planning, pull-up budgeting, hot-plug behavior, and recovery from “stuck-low” lines.
  • GPIO/strap sanity: define default strap states with strong pulls; avoid floating during ramp; document a recovery mode.
Function Board-level selection pointers MPN examples
EEPROM / FRU 1 MHz bus option, endurance, write-protect, and temp grade aligned to interposer thermal zone. M24C02 family (ST I²C EEPROM)
Temperature sensor Place near retimer hotspot; verify self-heating; use alert pin if needed. TMP117 (TI high-precision digital temp sensor)
I²C hot-swap buffer Use when insertion/removal or brownouts can corrupt the backplane bus; verify idle/STOP behavior. PCA9511A (NXP hot-swappable I²C/SMBus buffer)
Differential I²C extender Use when sideband must cross noisy zones; validate common-mode tolerance and cabling rules. PCA9615 (NXP differential I²C buffer)
GPIO expander Consolidate presence/LOS/INT lines; require reset pin and deterministic power-up states. TCA9539 (TI 16-bit I²C I/O expander)
LED driver Choose I²C or shift-register based on bus budget; confirm dimming/blink features and thermal limits. PCA9632 (NXP I²C LED driver) / TLC5928 (TI multi-channel LED driver)
ESDLow-CSurvivability

6) Protection & survivability: keep sidebands safe without harming signal integrity

  • Protect what is exposed: sideband lines going to front-panel areas need ESD strategy; avoid adding capacitance on high-speed lanes unless explicitly qualified.
  • Low-capacitance discipline: for sensitive lines, demand capacitance data and layout guidance; keep stubs short and return paths tight.
  • Grounding intent: ensure ESD return has a controlled path that does not inject noise into refclk or retimer ground references.
MPN examples (ESD / TVS): PESD5V0S1UL (Nexperia ESD protection diode), RClamp0524P family (Semtech low-capacitance TVS arrays; use current-generation alternates where applicable)
DeliverablesBring-up repeatabilityFactory readiness

7) What must be delivered with the board (so production and field teams can win)

  • Register cookbook: a versioned configuration template per retimer SKU (defaults, recommended EQ profiles, reset/retrain procedure).
  • “Known-good” test recipe: the exact validation steps that proved margin (what points were measured, what conditions were swept).
  • Telemetry map: addresses, rails, sensors, and meaning of each alert/flag with a “symptom → next check” mapping back to chapters.
  • Revision control: FRU schema and compatibility policy—what changes are allowed without breaking interoperability.

Figure K — Parts map: what typically lives on a ToR interposer (and why)

A board-level map that keeps scope tight: retimers, refclk fanout, rails, sideband devices, and protection. Text is minimized; blocks represent procurement and review “zones”.

Figure K · Interposer parts & zones (retimers / clock / rails / sidebands)
ToR interposer board parts map: retimers, refclk fanout, rails, EEPROM/LEDs, and telemetry Block diagram showing connector side, retimer cluster, refclk fanout buffer, power islands, sideband bus devices, and ESD boundary. ToR Interposer — Board-Level Parts Map (Scope: Retimer + Clock + Rails + Sidebands) Host / Mainboard Edge PAM4 lanes + refclk + sideband Connector / Mezz Front Panel / Cage Segments after re-timing Cage / I/O Retimer Cluster CDR / EQ / counters / lock flags Retimer A PAM4 lanes Retimer B PAM4 lanes Retimer C PAM4 lanes Retimer D PAM4 lanes Link segment 1 Link segment 2 Refclk Fanout Zone Clock Buffer / Fanout Low jitter + low skew Power & Reset Island Buck + LDO (noise islands) Reset / POR / PG gates Sideband & Telemetry Zone EEPROM / FRU LED Driver I²C Buffer / Ext Rail / Temp Monitor Sideband bus ESD boundary focus: sidebands near front panel
Practical usage: use this map as an internal review template—each rectangle corresponds to a supplier “evidence package” (datasheets + characterization + layout notes + bring-up procedure), not just a BOM line item.

Request a Quote

Accepted Formats

pdf, csv, xls, xlsx, zip

Attachment

Drag & drop files here or use the button below.

H2-12 · FAQs (Troubleshooting + Validation + Operability)

Each answer focuses on interposer-visible evidence (lock flags, EQ/training status, counters, rails, temperature) and minimal experiments that isolate whether the root cause is channel margin, retimer configuration, refclk hygiene, power integrity, or sideband robustness.

1Why can the same cable/module downshift more often after changing the interposer revision?

Interposer revisions often change the effective margin through different retimer defaults (EQ, adaptation rules), clock fanout additive jitter/skew, rail noise/decoupling, or connector/via geometry. A controlled A/B test should align the retimer configuration template, then compare training retries, lock stability, and BER trends across temperature and voltage corners using identical ports, modules, and cables.

Maps to: H2-3 (budget) · H2-4 (retimer choices) · H2-10 (validation corners)
2When link training repeatedly fails, which retimer indicators should be checked first?

Start with power/reset sanity (PG, brownout flags, reset deasserted), then verify CDR/PLL lock and any loss-of-lock sticky events. Next, check adaptation/training state (converged vs stuck/oscillating), retry counters, and whether error counters rise before stable lock. If failures vary run-to-run, freeze a known-good EQ profile and re-test at a de-rated speed to separate configuration from physical margin.

Maps to: H2-9 (bring-up flow) · H2-4 (observability & management)
3BER worsens at higher temperature—how to tell clock trouble from insufficient EQ margin?

Clock-related issues often appear as multi-port correlation (several links degrade together), intermittent loss-of-lock, or strong sensitivity to rail noise and nearby switching activity. Pure channel/EQ margin problems are more lane/port-specific, improve with de-rate, and become repeatable when EQ is frozen. Use a thermal step test while logging lock flags, error counters, temperature near the retimer hotspot, and key rail readings to classify the failure pattern.

Maps to: H2-7 (thermal) · H2-5 (refclk hygiene) · H2-10 (corner validation)
4When is a redriver enough, and when is a CDR-based retimer required?

A redriver can be sufficient when the dominant impairment is insertion loss on a short, well-behaved channel and training is stable with acceptable jitter tolerance. A CDR-based retimer becomes necessary when long/connector-heavy channels must be segmented, jitter must be cleaned rather than reshaped, or training and BER are unstable across temperature and module combinations. Decide using a segmented budget and symptom-driven evidence, not nominal data rate alone.

Maps to: H2-4 (redriver vs retimer) · H2-3 (channel segmentation method)
5After refclk fanout, “lock looks OK” but rare errors still appear—what are the usual causes?

Common culprits include output-to-output skew mismatch, additive jitter that grows with supply noise, refclk format/termination mismatch, and local coupling from LED drivers or buck switch nodes into the clock keep-out zone. Confirm by probing fanout outputs at defined test points, watching for loss-of-lock events, temporarily forcing LEDs to steady state, and comparing error rates with an LDO-isolated clock rail versus the baseline supply.

Maps to: H2-5 (clock tree) · H2-9 (fault isolation steps)
6I²C/SMBus frequently hangs in a noisy environment—how can the interposer recover autonomously?

Implement bus survivability: segment the bus with hot-swap buffers, add a defined bus-clear method (SCL toggling + STOP generation), provide dedicated resets for sideband devices, and size pull-ups for rise time and sink budget. Detect “stuck-low” events using simple line-level checks and latched fault flags. Place series damping resistors near the interposer boundary and avoid running sideband returns through clock-sensitive zones.

Maps to: H2-8 (sideband robustness)
7Can LED blinking impact high-speed links? How to prevent coupling via layout and power?

Yes—LED current pulses can inject ground bounce and rail ripple that modulates refclk/PLL-sensitive supplies or couples into nearby lanes. Prevent coupling by isolating LED power (separate rail island or filtered branch), controlling slew rate and blink patterns, keeping LED returns local, and enforcing a clock keep-out zone free of switching currents. Diagnose by forcing LEDs to constant state and checking whether BER spikes disappear without any retimer setting changes.

Maps to: H2-8 (LED strategy) · H2-6 (rail isolation) · H2-5 (clock keep-out)
8How can total channel loss be split into measurable, actionable segments?

Define segments with explicit measurement boundaries: mainboard trace segment, connector stack, interposer segment, and cage/front-panel segment. For each segment, characterize insertion loss and reflection hotspots, and mark major discontinuities (connector transitions, via fields, stubs). Build a simple contribution chart per segment rather than a single “total” number. This segmentation turns an uncontrollable long link into shorter, controllable links and directly guides retimer placement and EQ expectations.

Maps to: H2-3 (budget segmentation)
9What is the smallest experiment to separate connector/routing issues from retimer configuration issues?

Use a minimal isolation set: swap modules/cables across ports and observe whether the failure follows a physical channel or follows settings. De-rate speed to see whether stability returns immediately (margin-limited behavior). Fix the retimer to a known-good EQ profile (freeze adaptation) and retest for repeatability; highly repeatable failures usually point to the physical channel. If the design supports it, compare a bypass path or a reduced-lane configuration to localize the problematic segment.

Maps to: H2-9 (debug flow) · H2-3 (segment thinking)
10Which validation corners must be covered to claim real margin?

Cover corners that stress the interposer’s weakest links: temperature extremes, rail min/max (including ripple scenarios), worst-case cable lengths, and module/vendor combinations that maximize loss and reflections. Prove stability with repeatable lock behavior, consistent training time, and BER/eye trends that do not collapse at corners. Log evidence (sticky flags, counters, temperature/rail snapshots) so failures are diagnosable, not anecdotal, and ensure the checklist can be rerun on new revisions.

Maps to: H2-10 (validation checklist)
11With only telemetry (temperature, rails, lock), how to quickly judge power-noise vs thermal root cause?

Power-noise issues typically correlate with rail dips/brownout flags, error bursts during load steps, and failures that appear even at moderate temperature. Thermal issues track hotspot temperature and improve with airflow or de-rate, often without brownout evidence. Clock-related problems usually show loss-of-lock events or correlated multi-port degradation. Correlate the time series: lock events, error counter slopes, rail readings, and temperature ramps. The pattern is often more diagnostic than any single snapshot.

Maps to: H2-6 (rails/telemetry) · H2-7 (thermal) · H2-9 (field isolation)
12What should FRU/EEPROM store to expose compatibility and lot-related issues quickly?

Store a compact, versioned identity record: board part number, hardware revision, serial/lot code, key BOM identifiers (retimer/clock families), configuration profile ID and checksum, and a compatibility matrix hash (or allowed-module class summary). Add a schema version and hardware write-protect policy to prevent accidental corruption. This enables operations to detect mismatched revisions, apply the correct configuration template, quarantine a suspect lot, and avoid repeated “mystery downshift” incidents.

Maps to: H2-8 (FRU devices) · H2-11 (delivery artifacts)
Figure L — FAQ symptom map: which signals point to which chapter
FAQ symptom map for ToR interposer board troubleshooting Block diagram mapping common symptoms to evidence signals and the relevant chapters (budget, retimer, clock, rails, thermal, sideband, debug, validation, checklist). FAQ Symptom Map (Interposer-only evidence) Symptoms Downshift / speed drop Training fails / retries BER spikes (thermal) I²C hangs Rare errors (clock/power) Evidence (readable) Lock / LOL flags EQ / training state Error counters Rails / brownout Temp (hotspot) Chapters to use H2-3 Budget H2-4 Retimer H2-5 Clock H2-6 Rails H2-7 Thermal H2-8 Sideband H2-9 / H2-10