ADC Synchronization: Multi-Card Timebase & Trigger Alignment

Q: How do I prove deterministic latency after power cycles?

Deterministic latency is proven by repeatable alignment at a declared alignment point across resets, not by a one-time snapshot. Acceptance: alignment-point index is stable after N cycles (P99 bounded) and single-mode (no multi-modal states). Minimum test: lock → link-ready → align marker → verify trigger → record marker index; repeat N times and histogram deltas. Common traps: changing step order; elastic buffers; lane deskew shifting boundaries. Next action: follow bring-up sequence and verification methods.

Q: What skew is acceptable for beamforming / multi-phase measurements?

Acceptable skew depends on the target type. Beamforming typically requires sample-level coherence; multi-phase power often tolerates frame-level alignment unless transient analysis is required. Acceptance: define P99 bounds for relative timing error (skew) and, if required, drift slope across windows. Minimum test: inject the same stimulus; estimate Δt via cross-correlation or phase difference; repeat across time/temperature windows. Common traps: no P99; ignoring drift; using trigger alignment as a proxy for sample alignment. Next action: align requirement spec to budgeting.

Q: How to separate drift from jitter when debugging alignment?

Jitter is short-window scatter; drift is window-to-window mean movement. Acceptance: express drift as slope across windows and jitter as within-window scatter. Minimum test: compute Δt in short windows (scatter) and long windows (mean shift); repeat under a temperature step and correlate with lock/epoch logs. Common traps: mixing time scales; treating relock jumps as drift; changing bring-up steps between windows. Next action: use the system timing model and verification methods.

Q: Star vs daisy chain: when does daisy become unmanageable?

Daisy chains become unmanageable when repeatability and skew ownership cannot be closed with a budget and minimum tests across node and connector count. Acceptance: P99 skew and repeatability remain within limits as the chain scales. Minimum test: segment the chain; measure skew per segment; check accumulation with hop count; repeat after replug/reboot. Common traps: one-time end-to-end test; ignoring connector repeatability; assuming equal hop delay. Next action: review distribution topology, budgeting, and troubleshooting.

Q: What logs should firmware capture for sync bring-up?

Firmware logs must provide traceability so every alignment shift is explainable by recorded epochs and state transitions. Acceptance: any shift has a logged cause (lock loss, retrain, re-align, reset step). Minimum log set: epoch ID; lock status with timestamps; link-ready and retrain counters; align state; marker index; config hash; error counters. Common traps: logging too late; missing epoch identifiers; not logging config deltas. Next action: follow bring-up sequence and deterministic latency guidance.

Q: How often should periodic re-sync run (if allowed)?

Periodic re-sync frequency is derived from the drift budget and the time-to-limit before misalignment exceeds acceptance. Acceptance: drift never exceeds the defined P99 limit between re-sync events in the operating envelope. Minimum test: measure drift slope under worst-case windows; compute time-to-limit; set a re-sync period with margin. Common traps: no allowed windows; ignoring side effects (data gaps/state churn); fixed periods without slope measurement. Next action: use drift budgeting and bring-up policy.

Q: Why does alignment pass but fail after a temperature step?

A temperature step can expose drift, thermal gradients, and relock state changes that do not appear in short room-temperature validation. Acceptance: after the step and settling window, skew/drift returns within limits and repeatability remains single-peak. Minimum test: apply a controlled temperature/airflow step; log lock/epoch; track Δt over windows to separate slope drift from discrete jumps. Common traps: measuring only immediately after the step; missing lock-loss events; assuming drift is uniform across boards. Next action: use the timing model and verification methods.

Q: How to validate trigger skew without special hardware?

Trigger skew can be validated by relative edge capture using existing FPGA counters or timestamped markers; dedicated timing hardware is not required for most acceptance checks. Acceptance: event skew and repeatability (P99) meet limits across reboots and re-plugs. Minimum test: distribute the same trigger; capture edge time-tags; compute pairwise deltas; repeat N times and build a P99 table. Common traps: measuring only one edge; ignoring gate width mismatch; skipping replug/reboot tests. Next action: review trigger & gating and verification methods.

Q: What is the fastest way to isolate “clock vs digital path” ownership?

Use minimum isolation tests: swap to localize fixed delay, windowing to classify drift, and histograms to expose discrete digital states. Acceptance: results clearly follow swap, follow temperature window, or show multi-modal states consistent with ownership. Minimum test: (1) swap cable/port and re-measure skew, (2) apply temperature window and observe drift slope, (3) run N boots and histogram marker index deltas. Common traps: no declared alignment point; mixing root causes; changing firmware config between runs. Next action: follow troubleshooting and deterministic latency sections.

← Back to:Analog-to-Digital Converters (ADCs)

Synchronization is not “sharing a clock” — it is making skew, drift, and repeatability measurable and repeatable across boards. This page turns multi-card alignment into a budget + bring-up recipe + verification tests so ownership is clear and results stay deterministic after reboots and temperature changes.

Quick browse (tap to hide/show) ▼

Jump to any chapter:

H2-1 · What this page solves H2-2 · Definition: synchronization targets H2-3 · Requirement spec: what must align H2-4 · System timing model: where uncertainty enters H2-5 · Timebase generation: choose the master reference H2-6 · Distribution topology: star vs daisy vs tree H2-7 · Alignment reference: SYSREF / FSYNC / frame markers H2-8 · Trigger & gating: coherent start/stop events H2-9 · Deterministic latency: keep digital paths repeatable H2-10 · Skew & drift budgeting: allocate error to owners H2-11 · Bring-up sequence: make alignment reproducible H2-12 · Verification methods: measure skew, drift, repeatability H2-13 · Troubleshooting: isolate ownership quickly H2-14 · Engineering checklist H2-15 · Applications H2-16 · IC selection logic H2-17 · FAQ

Tip: if any chapter anchor is renamed later, update only the corresponding href here.

What this page solves

Goal: determine in 30 seconds whether the issue is a synchronization problem, and obtain a repeatable architecture + bring-up + verification deliverable.

Typical symptoms (observable and falsifiable)

Stitch mismatch: waveforms from Card A/B/C do not merge cleanly; seam steps, echoes, or periodic discontinuities appear.
Coherence loss over time: alignment looks correct at start, then relative phase slowly shifts during long capture (hours/minutes scale).
Trigger-to-sample mismatch: the same trigger event lands on different sample indices across cards; event markers disagree.
Non-repeatable after reboot: a reset/re-lock changes relative alignment; calibration or stitching breaks after power cycle.

Synchronization failures are rarely fixed by “sharing a clock” alone. A working system requires three controlled objects end-to-end: Clock (timebase), Align reference (common start state), and Trigger (event timing) — plus proof by measurement.

Deliverables provided on this page (project-ready)

Architecture chooser: choose star/tree/daisy and select which signals must be distributed (Clock / SYSREF-or-FSYNC / Trigger) based on the alignment target.
Budget template: allocate Skew, Drift, Repeatability to owners (fanout, routing, connector/cable, receiver, digital path).
Bring-up recipe: a fixed order (lock → align → verify → log) to make alignment reproducible across boots and re-locks.
Acceptance test plan: measurable pass/fail criteria + required logs (P50/P90/P99 statistics, conditions, and evidence captures).

60-second self-check (Yes/No)

Q1: Does the system require multi-card stitching, coherent combining, or phase-sensitive processing? If Yes → sample/frame coherence matters.

Q2: Does a reboot or link re-lock change the alignment outcome? If Yes → repeatability and deterministic latency are not guaranteed.

Q3: Does the alignment drift with temperature or long capture time? If Yes → drift budget + thermal symmetry must be validated.

Q4: Are triggers shared but sample indices disagree across cards? If Yes → trigger distribution is not equivalent to sample alignment.

Scope guardrail: this page focuses on alignment architecture, repeatable bring-up, and verification. Jitter-to-SNR math, SERDES eye tuning, and protocol register walkthroughs are intentionally out of scope here to avoid cross-page overlap.

Definition: synchronization targets (measurable and verifiable)

“Synchronization” is not a single property. It is a set of acceptance targets that must hold simultaneously for Clock, Align reference, and Trigger. This section defines the three core metrics and the minimum evidence required to claim pass/fail.

Required output format: (Budget) + (Measurement method) + (Conditions) + (P50/P90/P99) + (Logs).

The three acceptance metrics (each must have pass/fail)

1) Static skew (fixed offset)

Definition: channel-to-channel time offset under steady-state conditions.
Units: ps/ns or samples (use samples when the system consumes sample indices; use time when mixing sample rates).
How to measure: common stimulus + cross-correlation; or common tone phase difference converted to time; or per-card marker capture to sample index.
Pass/Fail template: P99(|Δt|) ≤ Skew_budget under defined conditions.

2) Drift (relative change over time/temperature)

Definition: change in relative alignment after initial lock/alignment.
Two required windows: long capture (time window) and temperature step (thermal window).
How to measure: repeated correlation/phase checks at fixed intervals; report slope and worst-case excursion over the window.
Pass/Fail template: max(|Δt(t) − Δt(t0)|) ≤ Drift_budget over the specified time + temperature profile.

3) Repeatability (reboot / re-lock return-to-state)

Definition: alignment distribution after reset, power-cycle, or re-lock (the system must return to the same alignment state).
Why it is critical: poor repeatability invalidates “one-time calibration,” stitching offsets, and long-term comparability.
How to measure: N cycles (power cycle or forced re-lock) → measure skew each cycle → report histogram statistics.
Pass/Fail template: P99(|Δt| across cycles) ≤ Repeat_budget with a fixed bring-up order.

Minimum evidence required (to avoid “seems aligned”)

Conditions: temperature points, duration, airflow state, and the exact bring-up order used.
Logs: PLL/lock state, align state (armed/latched), trigger marker captures, link-ready state, and any re-sync events.
Statistics: P50/P90/P99 for each metric; report worst-case and cycle count N for repeatability.

Acceptance table (copy into a project spec)

Metric	Pass/Fail	Evidence
Static skew	P99(\|Δt\|) ≤ Skew_budget	Correlation/phase report + raw captures
Drift	max excursion ≤ Drift_budget (time + thermal windows)	Timestamped checks + temperature profile
Repeatability	P99(\|Δt\| across N cycles) ≤ Repeat_budget	Bring-up logs + cycle histogram

Requirement spec: decide “what must align”

Synchronization architecture is dictated by the alignment target. Lock the target first, then design the clock/align/trigger distribution to meet it with measurable pass/fail.

Step 1 — Pick exactly one target type

A) Sample-level phase coherence

What must align: the sampling instant across cards/channels, continuously (phase-coherent over time).
What it enables: coherent combining, beamforming, multi-channel phase measurements, stitched wide-aperture capture.
What it forbids: nondeterministic latency (elastic buffers that change depth across boots) and uncontrolled re-lock phase states.
Non-negotiables: shared timebase + deterministic alignment reference + deterministic data path.

B) Frame-level alignment

What must align: frame or cycle boundaries (start-of-frame sample index is identical across cards).
What it enables: multi-phase control sampling, periodic processing windows, deterministic segmentation.
What can vary: phase inside the frame may be calibrated, as long as boundary and index mapping are deterministic.
Non-negotiables: a common frame marker (FSYNC/SYSREF/epoch) + deterministic mapping to sample indices.

C) Event-level trigger coherence

What must align: the same event is captured at the same time (or time-tag) across cards, within a defined tolerance.
What it enables: time-correlated measurement, distributed capture, episodic acquisition.
What is optional: continuous phase coherence is not required; periodic re-sync is often acceptable.
Non-negotiables: trigger distribution + a shared time reference (or a well-defined timestamp conversion).

Step 2 — Convert system constraints into hard requirements

Hard requirement fields (copy into a spec)

Field	Why it changes the architecture	Minimum to declare “done”
Distance & topology	Propagation mismatch and thermal sensitivity grow with cables/backplanes; star/tree selection becomes mandatory.	Skew budget met at the farthest node.
Across chassis?	Ground potential, connector variability, and environmental drift dominate; measurement points must move outward.	Drift budget met over the defined window.
Thermal envelope	Relative drift is a temperature function; symmetric routing and controlled airflow become part of timing design.	Worst-case excursion reported with a temperature profile.
Re-sync policy	Allowing periodic re-sync changes the drift requirement into a windowed requirement.	Window length and downtime defined and verified.
Link determinism	Elastic buffers and training can change end-to-end latency; repeatability may fail even with a shared clock.	N-cycle reboot/re-lock histogram meets P99 target.
Acceptance method	Phase-based, correlation-based, and marker-based methods are not interchangeable; pick one that matches the target type.	Evidence bundle: conditions + logs + P50/P90/P99.

Step 3 — Declare the spec in one paragraph (freeze scope)

Target type is Sample / Frame / Event. The acceptance metrics are Skew, Drift, and Repeatability, reported in time and samples. The topology spans [distance/topology] and [single/dual chassis]. The thermal envelope is [range/profile]. Re-sync is [allowed/not allowed] with [period/window]. Link determinism is [required/not required]. Verification must provide P50/P90/P99 statistics and the defined log bundle.

System timing model: where uncertainty enters

A synchronization problem becomes solvable only after the end-to-end timing chain is decomposed into segments. Each segment must be assigned an owner and mapped to one dominant uncertainty type: Mismatch (static skew), Repeatability (re-lock spread), Routing (drift), or Determinism (digital latency states).

Rule of thumb: fixed offset → investigate mismatch; time-dependent shift → investigate drift; reboot-dependent shift → investigate repeatability/determinism.

Segment map (assign responsibility)

1) Ref → Cleaner/PLL

Primary risk: Repeatability (power-up phase state), Drift (frequency/phase wander across temperature).
Typical failure signature: alignment changes after re-lock even if routing is unchanged.
Evidence: lock state logs + repeated lock cycles with measured skew distribution.

2) Fanout → Board ingress

Primary risk: Mismatch (channel delay differences), Routing (cable/backplane thermal drift).
Typical failure signature: a stable but non-zero offset that varies with cable swaps or temperature.
Evidence: edge-to-edge measurements at the far nodes + drift checks over the declared window.

3) Card receiver → ADC sampling instant

Primary risk: Repeatability (local PLL/CDR state), Mismatch (fixed internal delays).
Typical failure signature: ingress edges look aligned, but correlation/phase at ADC outputs is not.
Evidence: common stimulus phase/correlation check translated into time/samples.

4) ADC → FPGA capture → Data marker/time-tag

Primary risk: Determinism (FIFO depth changes, CDC timing), Repeatability (training/align state machines).
Typical failure signature: alignment differs across boots even with identical analog timing and routing.
Evidence: marker index consistency + FIFO/align state logs across N cycles.

Turn the chain into budgets (time + samples)

Total static skew = Σ(mismatch terms) across fanout, cables/backplane, routing, receiver, and fixed digital offsets.
Total drift = worst-case excursion of relative alignment over the specified time + temperature profile.
Total repeatability spread = distribution of alignment states across reboot/re-lock/re-train cycles (must be reported as P99).
Samples conversion: samples = time × fs (report both units when the consumer is sample index based).

Minimum measurement loop (do not skip)

Lock → align → capture in a fixed bring-up order; record lock/alignment states and any training counters.
Measure skew at the output domain that matches the target (stimulus correlation/phase for sample-level; marker index for frame/event).
Measure drift by repeating the same check over time and across the temperature profile; report worst-case excursion.
Measure repeatability using N cold/warm boots and forced re-locks; report a histogram and P50/P90/P99.

Timebase generation: choose the master reference

A master reference is not selected for “cleanliness” alone. It must keep the synchronization state controllable through boot, re-lock, reference loss, and redundancy switching so that Skew, Drift, and Repeatability remain verifiable.

What the reference is responsible for (synchronization view)

Provide a distributable timebase that can be converted into clock + alignment signals for all nodes.
Keep phase behavior testable across power cycles and re-lock events (repeatability is a requirement, not a hope).
Define failure modes for reference loss and recovery (holdover entry/exit must be observable and policy-driven).

Key reference fields (only what affects synchronization)

Spec fields checklist

Field	Why it matters	Impacted metric
Frequency / outputs	Determines how the timebase is generated and distributed (direct clock vs synthesized).	Skew / Drift
Phase determinism	Controls whether alignment returns to the same state after boot/re-lock.	Repeatability
Stability (windowed)	Defines relative phase drift over the system’s measurement window and temperature profile.	Drift
Warm-up behavior	Sets when it is meaningful to claim “aligned”; early thermal transients can look like drift failures.	Drift / Repeatability
Distributability	Drive/format/links must survive routing, connectors, and scaling without changing the timing state.	Skew / Repeatability
Holdover capability	After reference loss, the timebase must either remain bounded (windowed drift) or force a controlled re-sync state.	Drift / Repeatability
Observability	Lock/holdover/alarm signals are required to align policy with measured evidence.	All (verification)

Redundancy and switching (what must be verified)

Single reference

Benefit: one state machine; repeatability is easier to control and test.
Risk: reference loss forces a mode transition (holdover or re-sync).
Verification: reference-loss injection must be part of acceptance testing.

Redundant reference (main/backup)

Benefit: improves uptime.
New risk: switching can break phase continuity; “still running” is not equal to “still aligned.”
Verification: after a switch, alignment must remain within repeatability limits or a controlled re-sync policy must trigger with a clear epoch marker.

Holdover is a mode, not a hope

When the reference is lost, the system must enter a declared holdover mode with observable state. The spec must define a holdover window and a re-sync policy for exit. Any data captured across a mode boundary must be epoch-marked to prevent silent stitching failures.

Distribution topology: star vs daisy vs tree

Topology determines how skew can be controlled, how the system scales, and how quickly root causes can be isolated. Choose a topology that matches the target type and the verification plan.

Compare using the same three lenses

Skew control: can delay mismatch be budgeted and corrected at the endpoints?
Scale: does adding nodes grow the budget linearly and remain testable?
Debug: can failures be localized without dismantling the system?

Star

Skew control: best; each node has a directly budgeted path from the center.
Scale: limited by fanout resources and cabling/ports.
Debug: strongest isolation; a failing branch is contained.

Daisy chain

Skew control: weakest at scale; delay accumulates and end nodes inherit all upstream variability.
Scale: wiring-efficient, but the last node is hardest to keep within budget.
Debug: cascading failures; middle nodes are critical risk points.

Tree

Skew control: strong if budgets are defined per level (trunk vs leaf).
Scale: balanced; grows by branches while keeping measurement points structured.
Debug: good when each tier has observability; failures localize by tier.

Connectors and distance are repeatability risk points

Any pluggable point can change delay and drift characteristics across cycles. Place connectors away from the highest-sensitivity segments, and require reboot/re-plug repeatability testing as part of acceptance, not as an afterthought.

Alignment reference: SYSREF / FSYNC / frame markers

A shared clock advances time, but an alignment reference defines the zero point. It collapses random alignment states into a deterministic boundary that can be verified and repeated.

Why an alignment reference is needed

Defines a boundary: phase zero, frame start, or epoch marker shared by all nodes.
Forces a discrete state: internal alignment logic converges to a repeatable latch point instead of drifting across random boot/training states.
Makes repeatability measurable: the same bring-up sequence should land on the same latched index/phase distribution.

Alignment strategies (single-shot vs periodic)

Single-shot alignment

Goal: lock the zero point once and keep data continuous.
Works when: drift remains inside the acceptance window for the entire capture interval.
Acceptance focus: reboot/re-lock repeatability (P99) and stable mapping to sample indices.
Failure mode: slow drift accumulates and silently breaks stitching if not monitored.

Periodic alignment

Goal: re-anchor the boundary to control drift across long time or harsh temperature profiles.
Works when: the system can tolerate epoch boundaries or aligns only inside a declared window.
Acceptance focus: epoch markers must be explicit; alignment events must not be silent.
Failure mode: periodic events can inject discontinuities unless windows and markers are enforced.

Alignment window and failure handling

Window open: all nodes are ready to observe the reference and latch a common boundary.
Latch state: the alignment state machine locks an index/phase and freezes the state.
Window close: a timeout ends the attempt and activates the defined failure policy.
Failure policy: fail-fast (data invalid), degrade (mode switch with marking), or re-try (bounded retries with logs).

Minimum acceptance evidence (protocol-agnostic)

Repeatability: N boots/re-locks produce a P99 distribution of latched phase/index inside the requirement.
Deterministic completion: align_done occurs within a declared time range; align_fail provides a reason code.
Epoch correctness: periodic alignment produces explicit epoch/frame markers so data stitching is policy-driven.

Trigger & gating: coherent start/stop events

A trigger is an event system, not just a wire. Coherent triggering requires controlled arrival skew, bounded edge uncertainty, and consistent gate width, with measurable evidence per board.

Acceptance metrics (how to claim “coherent”)

Trigger acceptance checklist

Arrival skew: time difference of the event at each board measurement point (report P99 in time and samples).
Edge uncertainty: repeated captures of the same trigger edge to bound jitter (measurement method, not protocol fields).
Gate consistency: start and stop boundaries produce the same effective window length across nodes.

Trigger coherence does not guarantee sample-phase coherence

A coherent trigger aligns an event boundary. It does not guarantee that sampling instants are phase-aligned across boards. Sample-level coherence still requires a shared timebase and a deterministic alignment reference.

Distribution and measurement points

Star trigger tree: simplifies delay matching and keeps skew budget visible at the endpoints.
Standardize a measurement point: define where the system “consumes” the event (board ingress vs FPGA capture are not equivalent).
Convert the event into data: produce an event index or timestamp per board for direct comparison.

Minimum evidence bundle

Edge capture logs: per-board arrival timestamp or sample index, plus gate start/stop markers.
Statistics: P50/P90/P99 of arrival skew and window length differences.
Traceability: each event is tied to a unique event index for cross-board correlation.

Deterministic latency: keep digital paths repeatable

Clocks can be aligned while data still refuses to stitch. The root cause is variable digital latency. Synchronization requires a declared alignment point and a repeatable digital state across boot, reset, and re-lock.

Define the alignment point first

Sample index: the strict endpoint for sample-level coherence (most sensitive to variable latency).
Frame marker: the boundary for frame-aligned systems where phase inside a frame may be secondary.
Event index / time tag: the endpoint for coherent triggering and timestamped acquisition.

Where non-determinism comes from (symptoms and acceptance impact)

Elastic buffers and multi-stage pipelines

Symptom: the same marker lands at different indices run-to-run.
Acceptance risk: repeatability fails even if static skew looks fine.
Required observability: buffer level, slip/realign counters, alignment status.

Reset and align sequencing differences

Symptom: a late-ready node shifts the effective latch point, changing the end-to-end delay.
Acceptance risk: alignment becomes dependent on timing accidents.
Required observability: align window open/close times, align_done timing, reason codes.

Lane deskew and clock-domain crossing (CDC)

Symptom: sample boundaries move after retrain or re-lock while the link stays “up.”
Acceptance risk: cross-board delta shifts across boots or temperature changes.
Required observability: deskew lock, FIFO pointers, retrain/realign events tied to epochs.

Synchronization controls (digital determinism)

Fence the variable latency zone: declare which blocks are allowed to vary and keep that variation bounded relative to the alignment point.
Standardize a reset + align state machine: clock stable → link stable → align window → latch → data valid, with bounded retries and explicit epoch changes.
Prove repeatability with a known marker: use a training sequence or marker to validate that each run lands on the same index distribution.

Minimum acceptance claims (protocol-agnostic)

Run-to-run index repeatability: the marker’s latched sample/frame index stays within the requirement distribution (report P99).
Cross-board delta repeatability: board-to-board index differences remain bounded across reboots and relocks.
Mode boundary traceability: retrain/reset/realign events always produce an epoch marker and a log entry.

Skew & drift budgeting: allocate error to owners

Engineering-grade synchronization turns “alignment” into a budget sheet. Each segment must have a measurement method, an acceptance threshold, and a clear owner who can change it.

Budget template: static skew (who owns each contributor)

Static skew budget sheet (copy template)

Segment	Contribution	Measurement point	Method	Threshold	Owner
Fanout / buffers	Channel mismatch, channel-to-channel delay	Fanout outputs at endpoints	Relative delay compare	Pass/Fail	HW
Cable / connector	Insert/remove variability, length mismatch	Ingress at each board	Swap/replug repeatability	Pass/Fail	SYS
PCB routing	Trace length mismatch, impedance transitions	Near receiver pins	Layout review + delta test	Pass/Fail	PCB
Ingress / capture	Receiver threshold and edge capture differences	Capture point (consumption)	Common stimulus compare	Pass/Fail	HW
Digital path	Elastic buffer slips, deskew/CDC variability	Alignment point (index)	Marker index distribution	Pass/Fail	FW

Budget template: drift (windowed and testable)

Drift budget sheet (copy template)

Segment	Driver	Measurement point	Method	Threshold	Owner
Timebase chain	Warm-up, aging	Relative phase vs window	Windowed drift test	Pass/Fail	HW
Thermal gradient	Airflow changes	Board-to-board delta	Heat/flow stimulus	Pass/Fail	SYS
Supply sensitivity	Rail steps, noise	Relative timing delta	Step response drift	Pass/Fail	HW
Digital state	Retrain events	Alignment index / epoch	Epoch-tagged drift	Pass/Fail	FW

Bring-up sequence: make alignment reproducible

A stable clock is not enough. Reproducible alignment requires a fixed state machine: each step must pass a gate before the next step is allowed, and every recovery must be traceable by logs and epochs.

Recommended sequence (do not skip steps)

Bring-up recipe (gated)

Ref stable → ref present, stable timer satisfied, holdover state explicit.
PLL lock → lock asserted within bounds; lock failure returns a reason code.
Link ready → link up and stable; retrain/slip counters are bounded and observable.
Align ref → alignment window opens; align observed; align_done latched; epoch updated.
Trigger verify → arrival skew and gate width deltas stay inside the acceptance window.
Record calibration → snapshot alignment state, indices, counters, and versions for replay.

Why skipping steps breaks repeatability

Skipping Ref stable: warm-up drift appears as “mysterious” alignment change after a seemingly good bring-up.
Skipping Link ready: deskew/CDC/buffer states can move after alignment, shifting sample boundaries.
Skipping Align ref: a trigger can align events but cannot guarantee a deterministic sample or frame boundary.

Re-sync policy (periodic re-align vs full-chain reset)

Periodic re-align is allowed only when

Epoch markers exist: boundary changes are explicit and policy-driven (no silent switching).
Windows exist: re-align runs inside declared windows (frame gaps, gated intervals, or safe epochs).
Fast verification exists: a quick marker/trigger check validates the new state immediately.

Full-chain reset is required when

PLL unlock / holdover uncertainty: phase continuity cannot be claimed.
Link retrain or deskew changes: sample boundary may move (variable latency zone changed).
Trigger out-of-window persists: bounded retries fail to restore acceptance.
Repeatability fails: run-to-run distributions exceed thresholds or become multi-modal.

Minimal logs to make bring-up auditable

Timestamps: ref_stable, pll_lock, link_ready, align_done, trigger_ok.
State: pll_lock, holdover, link_up, retrain_count, deskew_status, fifo_ptr, slip_count.
Markers: epoch_id, align_fail_reason, trigger_skew_stats, gate_width_stats.
Versions: firmware/bitstream/config snapshot ID for reproducibility.

Verification methods: measure skew, drift, repeatability

Verification must produce a report that survives hardware swaps and firmware updates. Each test case needs a fixed fixture, a defined measurement point, a statistic (P99), and a pass/fail threshold.

Test case TC0: readiness (make results meaningful)

Claim: the system is in a measurable state before skew/drift tests start.
Checks: pll_lock=1, link_up=1, retrain/slip counters bounded, epoch explicit.
Fail action: return to the bring-up gates until TC0 passes.

Test case TC1: static skew (two practical methods)

Method A: cross-correlation

Fixture: generator → splitter → A/B/C inputs.
Process: correlate waveforms to find Δsamples; convert to time.
Acceptance: report P50/P90/P99 and a board-to-board delta matrix.

Method B: sine phase difference

Fixture: same generator/splitter setup.
Process: measure Δφ and convert to Δt using the stimulus frequency.
Acceptance: use P99; verify results at multiple frequencies if needed.

Test case TC2: drift (windowed and stimulus-driven)

Stimulus: temperature chamber, hot-air, airflow change, or load step that creates thermal gradients.
Capture: measure skew in windows (e.g., repeated short captures) over long time.
Acceptance: drift per window stays under threshold; log epoch changes and retrain events.

Test case TC3: repeatability (N reboots / relocks)

Procedure: repeat bring-up N times and measure skew each time at the same alignment point.
Statistics: min/mean/max plus P99; plot a histogram of run-to-run deltas.
Interpretation: multi-modal histograms indicate multiple discrete digital states (non-determinism).

Report format: synchronization acceptance sheet (copy template)

Acceptance table

Test	Setup	Metric	Statistic	Threshold	Result	Notes
TC0	Bring-up gated	Readiness	Pass/Fail	Required	PASS	epoch_id logged
TC1	Gen + splitter	Static skew	P99	Define	PASS	matrix saved
TC2	Thermal stimulus	Drift	Window Δ	Define	PASS	epochs tracked
TC3	N boots/relocks	Repeatability	Histogram	Define	PASS	multi-modal check

Troubleshooting: isolate ownership quickly

Fixing synchronization starts with fast ownership isolation. Each branch below uses a minimum test to push the issue into one segment (PCB / HW / FW / SYS) before deeper debugging begins.

Tree A: large skew but stable (fixed delay)

Symptom: Δt is large, but barely changes within a time/temperature window.
Likely causes: routing mismatch, fanout channel mismatch, ingress capture differences.
Minimum tests: swap cable/port paths; compare results against the same stimulus and measurement point.
Ownership: follows cable/port → SYS/HW; follows board location → PCB; follows capture settings → HW/FW.

Tree B: drift is obvious (time/temperature sensitivity)

Symptom: Δt grows or wanders with time, temperature, airflow, or load.
Likely causes: thermal gradient, PLL/holdover behavior, supply sensitivity.
Minimum tests: apply a controlled heat/airflow stimulus; observe windowed drift while tracking epochs and relock events.
Ownership: tied to airflow/layout → SYS/PCB; tied to lock/holdover → HW; tied to rail steps → HW/SYS.

Tree C: non-repeatable across boots/relocks (digital state)

Symptom: alignment looks good in one run, but shifts after reboot/relock/reset.
Likely causes: variable latency zones (buffers/deskew/CDC), unstable align sequencing, silent retrain events.
Minimum tests: run N bring-ups with the same recipe; build a histogram of marker index deltas; check for multi-modal states.
Ownership: multi-modal index states → FW; retrain/deskew involvement → FW/HW; step ordering dependence → FW/SYS.

Fast isolation rules (minimum tests that save time)

Swap test: if the error follows a cable/port, the cause lives in the swapped segment.
Window test: stable in a window implies fixed delay; changing across windows implies drift.
Epoch test: any silent state change without an epoch marker is a design bug (traceability failure).
Histogram test: multi-modal distributions indicate multiple discrete digital states.

Engineering checklist: design, bring-up, acceptance

This one-page checklist compresses the full synchronization workflow into four phases. Each item is written as an action that can be checked off during a real project.

Phase checklist (copy and paste)

Spec

Pick target type: sample / frame / event.
Define pass/fail thresholds (P99).
Declare temperature and airflow envelope.
Decide re-sync policy and allowed windows.

Design

Choose topology: star / tree / daisy.
Declare alignment point (index/marker).
Fence variable latency zones.
Build skew + drift budget sheets with owners.

Bring-up

Run gated bring-up (no skipping).
Log lock/link/align/epoch fields.
Define bounded retries and recovery.
Capture calibration snapshot after verify.

Verify

TC0 readiness before measurements.
TC1 static skew with P99 matrix.
TC2 drift with windowed stimulus.
TC3 repeatability with histogram check.

Applications: why synchronization must be done this way

Each application below is written as a three-line capsule: requirement → target type → common architecture. The focus is on failure modes and acceptance, not protocol fields.

Multi-card DAQ (wideband stitching)

Requirement: stitched waveforms must not jump at boundaries.

Target type: Frame-level (upgrade to Sample-level for coherent analysis).

Common architecture: shared master ref + star/tree distribution + explicit align marker + deterministic data alignment point.

Stitch artifacts usually come from sample-index shifts, not random noise.
Acceptance must track marker index deltas (P99), not only average skew.

Phased array / beamforming (phase coherent)

Requirement: relative phase must remain coherent across channels.

Target type: Sample-level phase coherence.

Common architecture: low-drift master ref + star distribution + align marker policy (single-shot/periodic) + epoch-based traceability.

Trigger coherence alone aligns events, not sample phase.
Drift is the silent killer; acceptance must include long-window statistics.

Multi-phase power measurement (phase & gating)

Requirement: phase relationships must be consistent for PF, harmonics, and transient analysis.

Target type: Frame-level (upgrade to Sample-level for transient debug).

Common architecture: synchronized sampling + trigger tree with bounded skew + explicit alignment point at sample index or frame marker.

Gate width mismatch creates hidden measurement-window offsets.
Acceptance must define a shared measurement point and a P99 phase/skew limit.

Distributed monitoring (event-first)

Requirement: the same event must align in time-tags for correlation and root cause analysis.

Target type: Event-level trigger coherence.

Common architecture: trigger/marker distribution + epoch policy + reproducible bring-up + explicit event index/time-tag definition.

Event alignment can be strong without paying for sample-level coherence.
Acceptance must include repeatability after reboots and relocks.

IC selection logic: fields → risk mapping → RFQ template

Synchronization failures are often procurement failures: the wrong questions were asked. This section turns synchronization requirements into device fields, maps missing fields to system risks, and provides a copy-ready RFQ template.

A) Parameter fields checklist (with example part numbers)

Field table (ask these, or risks stay unowned)

Block	Must-have fields	Sync risk covered	How to verify	Example part numbers
Master reference	Frequency options, warm-up behavior, stability, distribution capability, holdover statement (if applicable)	Drift, repeatability during loss/recovery	Windowed drift test + reboot/relock histogram	Crystek CCHD-957; SiTime SiT5711; SiTime SiT5503
PLL / jitter cleaner	Lock time + lock status, loss-of-lock behavior, output format, SYSREF/align output options, phase continuity policy	Drift, repeatability, deterministic alignment enablement	Lock/relock cycling + epoch tracking + alignment marker consistency	TI LMK04832; TI LMK04821; Renesas 8V19N490-24; Renesas 8T49N240
Fanout / buffer	Output-to-output skew spec, additive jitter, channel enable behavior, supply sensitivity, input threshold requirements	Static skew ownership, drift sensitivity	TC1 skew matrix + swap cable/port isolation test	TI LMK1D210xL; TI CDCLVD110A; TI LMK00304; Microchip ZL40234; ADI ADCLK905/907/925
Trigger / gating	Edge integrity, output skew, enable/gate behavior, deterministic propagation, input-output timing consistency	Event skew and repeatability, window alignment	Trigger skew P99 + gate width delta P99	TI LMK1C1104; TI LMK1C110x
Receiver / input conditioning	Input thresholds, hysteresis (if relevant), propagation consistency, line termination guidance, ESD tolerance	Event capture stability, repeatability across cabling	Trigger marker capture A/B/C + reboot repeatability	TI SN65LVDS2
ADC (sync ecosystem examples)	Multi-device synchronization support, align marker handling, lane/link behavior, reset determinism statements	Deterministic latency and repeatability across devices	Marker index consistency across N bring-ups	TI ADS54J54; ADI AD9234; TI ADC12DL3200
Programmable delay (optional)	Step size, range, monotonicity, temperature stability, power-up state	Residual skew trim, event path tuning	Sweep delay codes while monitoring skew window	Renesas 8S89296

Note: example part numbers are provided as starting points for category search and RFQ structuring; final selection must match frequency plan, I/O levels, channel counts, temperature range, and supply constraints.

B) Risk mapping: missing fields → system failures

Risk mapping table

Missing / unclear field	Likely failure	Acceptance symptom	Owner hint
Holdover / loss-of-lock behavior	Epoch jumps after reference loss or relock	Repeatability histogram becomes wide or multi-modal	Clock HW + system policy
Output-to-output skew spec (fanout/buffer)	Static skew cannot be owned or budgeted	TC1 skew matrix fails; swap tests point to distribution	HW distribution
Align marker handling / epoch definition	Silent state changes without traceability	Data aligns sometimes, then shifts with no logged reason	FW state machine
Link retrain / deskew determinism	Deterministic latency cannot be guaranteed	Alignment point drifts after reset/retrain events	FW + HW link
Trigger propagation + gating behavior	Event skew / gate mismatch breaks cross-card correlation	Trigger verify fails (P99), or window alignment slips	SYS + HW distribution

C) RFQ template (copy-ready)

Paste the following into an email to vendors or distributors:

Subject: RFQ – Multi-card synchronization (clock + align + trigger)

1) System targets (pass/fail)
- Static skew (P99): ________ (ps/ns or samples)
- Drift (windowed): ________ (samples/hour or ppm/°C equivalent)
- Repeatability (N reboots/relocks, P99): ________
- Target type: Sample / Frame / Event
- Deterministic latency required: Yes / No
- Allowed periodic re-align: Yes / No (allowed windows: ________)

2) Topology and environment
- Topology: Star / Tree / Daisy
- Cable/connector count per path: ________
- Cross-chassis: Yes / No   Distance: ________
- Temperature range / airflow constraints: ________

3) Device questions (must answer with datasheet references)
A) Master reference / oscillator
- Warm-up behavior, stability, and any holdover statement:
- Recommended distribution conditions:

B) PLL / jitter cleaner
- Lock time, lock status pins/flags, loss-of-lock behavior:
- Phase continuity policy on relock:
- Align marker support (SYSREF/FSYNC/marker generation and routing):

C) Fanout / buffer
- Output-to-output skew specification:
- Additive jitter specification:
- Channel enable behavior and power-up defaults:

D) Trigger / gating distribution
- Propagation consistency and skew:
- Enable/gate behavior and deterministic timing:

E) Receiver / input conditioning (if long lines)
- Input threshold requirements, termination guidance:

4) Verification artifacts requested
- Provide a recommended bring-up sequence (gated steps).
- Provide how to verify skew/drift/repeatability (minimum tests).
- Provide a list of example parts that match the above constraints, with availability lead times.

Request a Quote

Name

Company

Part Number(s) / BOM

Quantity & Target Lead Time

Alternates Allowed

Temperature Grade

Package / Footprint

Compliance

Budget Window

Lot Size / Qty

Message

Attachment

Accepted Formats

pdf, csv, xls, xlsx, zip

Attachment

Drag & drop files here or use the button below.

FAQ: synchronization, deterministic latency, and verification

Each answer uses the same data-structured format: Decision → Acceptance → Minimum test → Common traps → Next action.

Do I need SYSREF if I already share the sampling clock? +

Decision: SYSREF (or an equivalent alignment marker) is needed whenever deterministic latency and repeatability after power cycles are requirements; a shared sampling clock alone does not guarantee that.

Acceptance: after N power cycles/relocks, the marker index delta between boards is single-peak and bounded (P99).
Minimum test: run a gated bring-up recipe N times; record marker/sample index per board; plot a histogram of index deltas.
Common traps: shared clock ≠ aligned start; “looks aligned once” ≠ deterministic; silent retrain events break repeatability.
Next action: see Alignment reference and Deterministic latency.

How do I prove deterministic latency after power cycles? +

Decision: deterministic latency is proven by repeatable alignment at a declared alignment point across resets, not by a one-time alignment snapshot.

Acceptance: alignment-point index is stable after N cycles (P99 bounded) and shows a single mode (no multi-modal states).
Minimum test: lock → link-ready → align marker → verify trigger → record marker index; repeat N times and histogram the deltas.
Common traps: changing the step order creates different states; elastic buffers may add variable latency; lane deskew can shift boundaries.
Next action: see Bring-up sequence and Verification methods.

What skew is acceptable for beamforming / multi-phase measurements? +

Decision: acceptable skew depends on the target type. Beamforming typically requires sample-level coherence; multi-phase power often tolerates frame-level alignment unless transient analysis is required.

Acceptance: define a P99 bound on relative timing error (skew) and, if required, a P99 bound on drift slope across windows.
Minimum test: inject the same stimulus into all channels; estimate Δt via cross-correlation or phase difference; repeat across time/temperature windows.
Common traps: quoting a single “good” number without P99; ignoring drift in long runs; using trigger alignment as a proxy for sample alignment.
Next action: see Requirement spec and Skew & drift budgeting.

How to separate drift from jitter when debugging alignment? +

Decision: jitter is short-window scatter; drift is window-to-window mean movement. The debug method is to freeze one and observe the other with controlled windows.

Acceptance: drift is expressed as slope (samples/hour or equivalent) across windows; jitter is expressed as scatter within a window.
Minimum test: compute Δt in short windows (scatter) and long windows (mean shift); repeat under a temperature step and correlate with lock/epoch logs.
Common traps: mixing time scales; treating re-lock jumps as drift; changing bring-up steps between windows.
Next action: see System timing model and Verification methods.

Star vs daisy chain: when does daisy become unmanageable? +

Decision: daisy chains become unmanageable when repeatability and skew ownership cannot be closed with a budget and minimum tests across the full chain length and connectors.

Acceptance: P99 skew and repeatability remain within limits as node count and connector count increase.
Minimum test: segment the chain; measure skew per segment; verify whether error accumulates linearly with hop count; repeat after replug/reboot.
Common traps: only testing one end-to-end path once; ignoring connector-induced repeatability loss; assuming equal delay per hop.
Next action: see Distribution topology and Troubleshooting.

What logs should firmware capture for sync bring-up? +

Decision: firmware logs must support traceability: every alignment state must be explainable by a recorded epoch and state transitions.

Acceptance: any alignment shift must have a corresponding logged cause (lock loss, retrain, re-align, reset step).
Minimum log set: epoch ID; lock status + timestamps; link ready + retrain counters; align state; marker index; config hash; error counters.
Common traps: logging too late (after failure); missing epoch identifiers; not logging config deltas between runs.
Next action: see Bring-up sequence and Deterministic latency.

How often should periodic re-sync run (if allowed)? +

Decision: periodic re-sync frequency is derived from the drift budget and the maximum allowed misalignment before crossing the acceptance limit.

Acceptance: between re-sync events, drift never exceeds the defined P99 limit within the allowed operating envelope.
Minimum test: measure drift slope under worst-case thermal/airflow windows; compute time-to-limit; set re-sync period with safety margin.
Common traps: running re-sync without allowed windows; ignoring re-sync side effects (data gaps, state churn); using fixed periods without slope measurement.
Next action: see Skew & drift budgeting and Bring-up sequence.

Why does alignment pass but fail after a temperature step? +

Decision: a temperature step can expose drift, thermal gradients, and relock state changes that do not appear in a short room-temperature validation.

Acceptance: after the step and settling window, skew/drift returns within limits and repeatability remains single-peak.
Minimum test: apply a controlled temperature or airflow step; log lock/epoch; track Δt over windows to separate slope drift from discrete jumps.
Common traps: measuring only immediately after the step; missing lock-loss events; assuming drift is uniform across boards (it is often gradient-driven).
Next action: see System timing model and Verification methods.

How to validate trigger skew without special hardware? +

Decision: trigger skew can be validated by relative edge capture using existing FPGA counters or timestamped markers; dedicated timing hardware is not required for most acceptance checks.

Acceptance: event skew and repeatability (P99) fall within limits across reboots and cable re-plugs.
Minimum test: distribute the same trigger to all boards; capture edge time-tags; compute pairwise deltas; repeat N times and build a P99 table.
Common traps: measuring only one edge; ignoring gate width mismatch; forgetting to test after replug/reboot.
Next action: see Trigger & gating and Verification methods.

What is the fastest way to isolate “clock vs digital path” ownership? +

Decision: use minimum isolation tests: swap to localize fixed delay, window to classify drift, and histogram to expose discrete digital states.

Acceptance: the chosen segment shows consistent ownership signals (follows swap, follows temperature window, or shows multi-modal states).
Minimum test: (1) swap cable/port and re-measure skew; (2) apply a temperature window and observe drift slope; (3) run N boots and histogram marker index deltas.
Common traps: debugging without a declared alignment point; mixing root causes across segments; changing firmware config between test runs.
Next action: see Troubleshooting and Deterministic latency.

ADC Synchronization: Multi-Card Timebase & Trigger Alignment

ADC Synchronization: Multi-Card Timebase & Trigger Alignment

What this page solves

Typical symptoms (observable and falsifiable)

Deliverables provided on this page (project-ready)

60-second self-check (Yes/No)

Definition: synchronization targets (measurable and verifiable)

The three acceptance metrics (each must have pass/fail)

Minimum evidence required (to avoid “seems aligned”)

Requirement spec: decide “what must align”

Step 1 — Pick exactly one target type

Step 2 — Convert system constraints into hard requirements

Step 3 — Declare the spec in one paragraph (freeze scope)

System timing model: where uncertainty enters

Segment map (assign responsibility)

Turn the chain into budgets (time + samples)

Minimum measurement loop (do not skip)

Timebase generation: choose the master reference

What the reference is responsible for (synchronization view)

Key reference fields (only what affects synchronization)

Redundancy and switching (what must be verified)

Holdover is a mode, not a hope

Distribution topology: star vs daisy vs tree

Compare using the same three lenses

Connectors and distance are repeatability risk points

Alignment reference: SYSREF / FSYNC / frame markers

Why an alignment reference is needed

Alignment strategies (single-shot vs periodic)

Alignment window and failure handling

Minimum acceptance evidence (protocol-agnostic)

Trigger & gating: coherent start/stop events

Acceptance metrics (how to claim “coherent”)

Trigger coherence does not guarantee sample-phase coherence

Distribution and measurement points

Minimum evidence bundle

Deterministic latency: keep digital paths repeatable

Define the alignment point first

Where non-determinism comes from (symptoms and acceptance impact)

Synchronization controls (digital determinism)

Minimum acceptance claims (protocol-agnostic)

Skew & drift budgeting: allocate error to owners

Budget template: static skew (who owns each contributor)

Budget template: drift (windowed and testable)

Bring-up sequence: make alignment reproducible

Recommended sequence (do not skip steps)

Why skipping steps breaks repeatability

Re-sync policy (periodic re-align vs full-chain reset)

Minimal logs to make bring-up auditable

Verification methods: measure skew, drift, repeatability

Test case TC0: readiness (make results meaningful)

Test case TC1: static skew (two practical methods)

Test case TC2: drift (windowed and stimulus-driven)

Test case TC3: repeatability (N reboots / relocks)

Report format: synchronization acceptance sheet (copy template)

Troubleshooting: isolate ownership quickly

Tree A: large skew but stable (fixed delay)

Tree B: drift is obvious (time/temperature sensitivity)

Tree C: non-repeatable across boots/relocks (digital state)

Fast isolation rules (minimum tests that save time)

Engineering checklist: design, bring-up, acceptance

Phase checklist (copy and paste)

Applications: why synchronization must be done this way

IC selection logic: fields → risk mapping → RFQ template

A) Parameter fields checklist (with example part numbers)

B) Risk mapping: missing fields → system failures

C) RFQ template (copy-ready)

Recommended topics you might also need

Request a Quote

Accepted Formats

Attachment

FAQ: synchronization, deterministic latency, and verification

Explore

Categories

Get in Touch