Clock & Sync: SYSREF/LMFC Alignment and Phase-Coherent DACs
← Back to:Digital-to-Analog Converters (DACs)
Phase-coherent DAC systems are built by locking a repeatable alignment state (SYSREF/LMFC) and a deterministic event-to-output path—not by “sharing a clock” alone. This page shows how to distribute clocks and triggers, isolate jitter without breaking repeatability, and prove the result with simple, production-ready tests.
What “phase-coherent DAC” really means (and what it does not)
A multi-channel DAC system often looks “synchronous” because channels share a clock, the digital link is up, and register writes update together. However, phase coherence and deterministic latency are stricter and must be proven as repeatable across power cycles, link rebuilds, and re-alignment events.
The most common failure pattern is: shared REFCLK + link up + simultaneous update — yet the phase relationship changes after reboot. Typical break points are LMFC boundary not anchored, elastic buffer alignment differences, or trigger path skew (especially cross-board).
| Term | Guarantees | Common false belief | How to prove (minimum) |
|---|---|---|---|
| Phase coherence | The channel-to-channel phase offset of the same tone returns to the same value after reboot / link rebuild / re-alignment. A fixed constant offset is acceptable if it is stable and calibratable. | “Sharing one clock automatically guarantees phase coherence.” | Output the same sine on all channels; record phase differences; repeat across N restarts (and after re-establishing the link / reissuing SYSREF) and confirm the phase returns within a defined tolerance. |
| Deterministic latency | A fixed time from a defined event (trigger / frame boundary) to analog output response. The absolute delay can be calibrated, but must be repeatable. | “Link up means latency is fixed.” | Use a step/marker waveform aligned to the event; measure Δt to output response; repeat after link rebuild and verify Δt returns. |
| Synchronous update | Channels change codes at the same update boundary (LDAC / shared trigger / aligned sample-and-hold). It guarantees “change together,” not “phase repeats after restart.” | “Synchronous update is the same as phase coherence.” | Drive a large step on all channels; confirm step edges align. Then still run the phase-repeatability test to validate coherence. |
| Frequency coherence | Channels share the same timebase so frequencies track together. It does not guarantee startup phase or alignment state is identical. | “Same frequency implies same phase.” | Observe long-term drift/beat; confirm frequency lock. Treat phase-repeatability as a separate requirement with separate proof. |
- Phase changes after reboot → prioritize SYSREF/LMFC anchoring and buffer alignment behavior.
- Stable within a run but boards disagree → prioritize cross-board trigger distribution and event timebase.
- Edges update together but tones misalign → synchronous update is OK; prioritize clock + alignment state.
With the target terms separated and testable, the next step is to map each requirement to the clock/sync signal stack (REFCLK → device clock, LMFC, SYSREF, and SYNC~) and understand which layer can introduce or remove uncertainty.
The clock/sync stack: REFCLK, device clock, LMFC, SYSREF, SYNC~
A coherent DAC system is best understood as a stack with three layers: a physical timebase (REFCLK → PLL → device clock), an alignment-marker layer (LMFC and SYSREF), and a link control/training layer (SYNC~ and lane state). Each layer answers a different question: “What runs?”, “Where are the boundaries?”, and “When does the system re-align?”
- REFCLK is the external timebase entry; its quality and distribution determine what is possible downstream.
- Device clock is what actually drives the SERDES/DAC domains; phase-repeatability ultimately depends on how this clock is generated and distributed.
- Failure signature: frequency appears correct, but startup phase is not repeatable after power cycle or re-lock.
- LMFC defines a shared “time grid” for multi-device boundaries; it is the reference for deterministic alignment in JESD systems.
- SYSREF is not a frequency source; it is an anchor event that forces devices to agree on the same LMFC phase/boundary.
- Failure signature: link can be up, but the system shows run-to-run phase/latency changes because the boundary was not anchored consistently.
- SYNC~ (or equivalent control) governs link state and (re)training; it can indirectly change alignment points by forcing re-initialization.
- For coherence work, the key is not protocol detail but knowing when retraining happened and whether it altered buffer alignment or boundary capture.
- Failure signature: “random” slips correlate with retrain/relock events, not with the DAC waveform itself.
- Phase coherence needs device clock consistency and LMFC/SYSREF anchoring.
- Deterministic latency depends on boundary anchoring and preventing alignment points from moving across retrains.
- Cross-board alignment adds trigger/timebase distribution as a separate engineering item.
Once the stack is clear, the next chapter can focus on SYSREF/LMFC alignment as the repeatability mechanism: defining the capture window, preventing alignment-point drift across retrains, and making phase/latency behavior testable and production-friendly.
SYSREF & LMFC alignment: the only repeatable way (conceptual model)
“Repeatable phase coherence” requires more than a shared reference clock. The missing ingredient is a shared boundary grid (LMFC) that is anchored the same way on every run. SYSREF is the alignment marker that pins the LMFC phase to a known, repeatable position across devices.
- Observed: channel-to-channel phase/latency changes after power cycle or link rebuild.
- Root causes: PLL re-lock phase is not guaranteed, and elastic/buffer alignment points can differ run-to-run.
- Needed anchor: a shared discrete boundary (LMFC grid) that all devices agree on.
- Solution: distribute SYSREF so every device captures the same LMFC phase inside a stable capture window.
- Best when alignment is performed at bring-up and should remain stable in steady state.
- Reduces ongoing marker activity; favors “align once, then hold.”
- Requires controlled reset/retrain behavior to preserve repeatability across restarts.
- Best when the system can retrain/relock and must recover to the same boundary reliably.
- Acts as a recurring anchor opportunity; not “more synchronous” by itself.
- Demands strict control of SYSREF skew and capture-window placement to avoid random alignment.
- Good window: SYSREF arrives at a stable point relative to the device clock, enabling deterministic boundary capture.
- Bad window: SYSREF lands near a sensitive edge/transition region; capture becomes ambiguous and alignment turns random.
- Engineering implication: SYSREF distribution must be treated like a timing signal (controlled skew + repeatable conditions).
A system that “sometimes aligns” is usually failing the capture-window or distribution-repeatability requirements. The next step is to examine multi-device failure points: elastic buffers, lane deskew, and distribution skew that can move alignment points even when the link is up.
Multi-device JESD: subclass timing, elastic buffers, and why “alignment” fails
In a single device, “alignment” often looks straightforward. In multi-device systems, repeatability fails because alignment points can move. The biggest culprits are elastic buffers, lane-to-lane skew/deskew, and non-uniform SYSREF/trigger distribution. The goal here is not protocol detail, but identifying which mechanism can create discrete run-to-run phase/latency jumps.
- How uncertainty appears: the buffer absorbs skew and can select different boundary positions after retrain/relock.
- Symptom: output latency/phase changes in discrete steps after link rebuild.
- Minimum check: force a controlled retrain/reset and verify whether Δt or phase returns to the same value.
- How uncertainty appears: deskew can make data valid but still shift effective alignment relative to the shared boundary.
- Symptom: lanes are healthy, yet channels differ by a consistent offset or change after relock events.
- Minimum check: correlate phase/latency shifts with deskew/retrain events (rather than waveform changes).
- How uncertainty appears: skew or channel delay differences make devices capture different boundary phases.
- Symptom: the same bring-up sequence yields different alignment results across devices/boards.
- Minimum check: swap distribution channels/paths and see whether the offset follows the path.
Quick check: swap SYSREF fanout outputs; see if the offset follows the channel.
Fix direction: star distribution, matched path delays, and a controllable phase/delay knob for bring-up trim.
Quick check: re-map fanout outputs; observe whether fixed offsets move.
Fix direction: low-skew fanout selection, reserved delay adjust, and production-time characterization/compensation.
Quick check: repeatedly reset/retrain under controlled conditions; count the number of distinct states.
Fix direction: stabilize boundary capture (SYSREF/LMFC), avoid unnecessary retrains, and validate deterministic latency across resets.
Quick check: constrain trigger into a single clock domain and re-test repeatability.
Fix direction: deterministic event distribution (fanout + domain sync) and measurable skew budgets.
The highest leverage debugging approach is to look for discrete states (a few repeatable phase/latency outcomes). Discrete states usually indicate alignment-point selection (SYSREF capture, buffer state, deskew) rather than continuous noise or waveform distortion.
Cross-board synchronization: triggers, timing fanout, and deterministic eventing
Cross-board synchronization is a different class of problem than JESD alignment inside one board. The goal is deterministic eventing: the same event is defined the same way on every board, arrives with controlled skew, and maps to a repeatable output time. A trigger that arrives “at the same time” does not guarantee analog outputs change at the same time unless the event-to-output delay is measured, repeatable, and compensated.
| Approach | When it fits | What can break repeatability | Minimum proof |
|---|---|---|---|
| Shared REFCLK | Highest coherence needs; a common timebase across boards for phase-stable operation and long-term tracking. | Fanout skew, path mismatch, relock behavior that changes absolute phase across boots. | Repeat the same bring-up; verify channel-to-channel phase returns after power cycles and after event re-alignment. |
| SYSREF across boards | Systems that define boundaries in an LMFC grid and must recover to the same boundary state across boards. | SYSREF skew/window violations; distribution channel delays that make boards capture different boundary phases. | Reissue SYSREF and rebuild links; verify deterministic latency and phase return to the same state. |
| Trigger-only sync | Lowest cost; “do the same thing at the same time” where fixed delay can be measured and compensated. | Trigger skew, domain crossing, and event-to-output delay drift or state changes after resets/retrains. | Measure Δt from trigger to output marker on each board; confirm Δt repeats after restarts before applying compensation. |
- Define the event reference point (trigger edge at a named connector or FPGA pin).
- Define the output reference point (marker, step edge, or phase reference on the analog output).
- Measure Δt for each board under a controlled bring-up sequence.
- Apply compensation (delay/phase adjust) only after Δt is proven repeatable.
- Repeat after resets/retrains to validate that the same Δt state returns.
Cross-board systems become production-friendly when distribution skew and event-to-output delay are treated as measurable parameters. A deterministic event is not a claim; it is a repeatable measurement across bring-up sequences and resets.
Jitter isolation: where to clean, where to NOT clean
This section focuses on engineering decisions that affect synchronization and repeatability, not on phase-noise theory. Jitter cleaning and isolation should be placed where they improve shared timebase consistency without introducing uncontrolled relock states that break run-to-run phase/latency repeatability.
- Reference source: noise shared by all paths if not controlled at the entry point.
- Distribution network: fanout and routing skew create channel-to-channel differences.
- Digital coupling: FPGA/SerDes switching injects noise via supply/ground paths.
- PLL behavior: relock can change absolute phase and break run-to-run determinism.
- Board-level jitter cleaner: best for delivering one consistent timebase to many devices/boards.
- DAC internal PLL: can isolate local noise but may require SYSREF/LMFC anchoring to stay repeatable.
- FPGA reference conditioning: improves capture stability; must preserve a deterministic mapping to DAC timing.
- Clean at the timebase entry point and distribute with controlled skew.
- Treat SYSREF and trigger as timing signals (repeatable conditions, measurable skew).
- Standardize bring-up and re-alignment sequences so the same state returns after resets.
- Do not add an uncontrolled relock path after the alignment marker (SYSREF) has defined the boundary phase.
- Do not mix independent timebases across boards and expect phase coherence without calibration and anchoring.
- Do not allow unsynchronized triggers to cross clock domains into boundary-critical capture logic.
Jitter cleaning helps only when it preserves a deterministic mapping between the shared timebase, alignment markers, and event capture. Any relock behavior that changes absolute phase must be paired with a repeatable re-alignment method (SYSREF/LMFC) and a production test that proves it.
Clock tree engineering: distribution, skew, and phase-adjust hooks
A coherent DAC system is only as repeatable as its clock tree. Distribution topology, channel-to-channel delay variation, and board-to-board path differences define the practical skew floor. A production-friendly design treats skew as a measurable parameter and reserves phase/delay adjustment hooks so bring-up and manufacturing can converge to the same aligned state.
- Timebase paths: REFCLK / SYSREF / TRIG distribution must be controlled and repeatable.
- Topology consistency: same fanout type, same connector class, fixed port mapping across builds.
- Boundary markers: SYSREF/trigger timing relative to the receiving clock must avoid window ambiguity.
- Fanout channel delay: compensate only after port mapping is fixed and measured.
- Board-to-board fixed delay: characterize event-to-output Δt and store a compensation table.
- Residual skew: use programmable delay/phase knobs for deterministic convergence in bring-up.
- Length match helps when routing/connector delay dominates.
- Length match saturates when fanout channel delay or device-state variation dominates.
- Trim hooks matter when production needs the same aligned state across boards and replacements.
Skew control is most efficient when the dominant term is identified first. Tight length matching is meaningful only when the routing/connector term dominates; otherwise, fixed mapping plus trim hooks provide a more repeatable path to coherent alignment across builds.
Bringing-up a coherent system: step-by-step checklist (lab workflow)
Coherence bring-up succeeds when the dependency chain is respected: lock clocks first, establish stable links next, anchor boundaries with SYSREF, then verify deterministic latency before measuring phase and expanding to cross-board timing. Each step below lists a goal, an observable, and a fast fallback.
Observe: fixed topology and fixed port mapping.
If fail: stop link tuning and fix distribution decisions first.
Observe: skew distribution (repeat measurements, not one sample).
If fail: change fanout mapping, topology, or enable delay/phase trim.
Observe: lock status, lock time consistency, and restart repeatability.
If fail: fix reset sequencing and remove conditions that cause relock.
Observe: link state and error counters (if available).
If fail: fix link integrity before attempting alignment.
Observe: alignment indicators and run-to-run consistency after re-alignment.
If fail: revisit SYSREF skew and capture-window placement.
Observe: Δt repeats after resets/retrains (count discrete states).
If fail: return to PLL and SYSREF steps; state randomness must be removed first.
Observe: phase difference distribution across repeated bring-ups.
If fail: suspect clock-tree skew or alignment state changes, not waveform content.
Observe: phase changes follow the knob and converge to a repeatable target.
If fail: the trim point is after an unstable state; relocate or re-anchor alignment.
Observe: output markers align after compensation and remain aligned after restarts.
If fail: debug trigger distribution, domain crossing, and fixed-delay characterization.
Observe: the same recipe yields the same aligned state without “manual tuning.”
If fail: identify remaining random states and remove them before scaling production.
This workflow scales when each checkpoint is tied to an observable and a controlled fallback. A coherent system is not defined by a single “good run,” but by returning to the same aligned state across resets and rebuilds.
Verification methods: prove deterministic latency and phase coherence without fancy gear
Synchronization verification should focus on repeatability, not on one “good” screenshot. Two proofs matter in practice: deterministic latency (event-to-output Δt returns after resets) and phase coherence (phase difference returns after re-lock and re-alignment). The techniques below use simple markers and correlation so the root cause can be separated from dynamic performance measurements.
| Test signal | What it proves | Minimum gear |
|---|---|---|
| Step edge / marker | Event-to-output Δt, discrete states after resets, and skew repeatability. | Oscilloscope (2+ channels) + a stable trigger. |
| Short pulse | Δt visibility in noisy environments; checks trigger capture stability. | Oscilloscope; optional logic probe for trigger timing. |
| Same-tone sine | Phase difference repeatability across bring-ups using correlation or phase fit. | Oscilloscope (time capture) or basic acquisition. |
| Few-point freq sweep | Phase-vs-frequency trend suggests delay error versus random alignment state. | Oscilloscope; no spectrum purity requirement. |
| Reset / retrain repeats | Proves whether “the same recipe returns the same state” (pass/fail for production). | Any of the above tools; the key is repetition and logging. |
- Discrete Δt states after resets → suspect SYSREF/LMFC capture, elastic buffering, or relock-dependent states.
- Wide Δt scatter with stable average → suspect trigger capture uncertainty (threshold, CDC, reference shifts).
- Slow drift over time/temperature → suspect clock-path stability and distribution, not one-time alignment.
- Re-issuing alignment changes the state → the system is alignment-driven; window/skew control becomes priority.
A verification plan is complete only when it includes repetition. Logging Δt and phase differences across resets distinguishes random state changes from measurable fixed offsets that can be compensated in bring-up and production.
Root-cause map: when phase slips, where to look first
Troubleshooting should start with the fastest discriminators: alignment events, clock-path state, and trigger-chain uncertainty. Each item below maps a symptom to the most likely cause and a single verification action. This keeps the debug loop short and avoids broad detours.
- Symptom: phase jumps between a few discrete values after reset.
Likely: capture window/skew causes different boundary states.
Check: re-issue SYSREF and log whether the same state returns. - Symptom: link is up but phase is not repeatable.
Likely: elastic buffer alignment state changes.
Check: repeat retrain cycles and count discrete Δt states. - Symptom: “works once” but cannot be reproduced by another bring-up.
Likely: sequence-dependent state.
Check: freeze a fixed reset/link/alignment order and retest.
- Symptom: slow drift with time or temperature.
Likely: distribution stability or PLL behavior dominates.
Check: log Δt over time at fixed state; compare across channels/boards. - Symptom: changing fanout port changes phase offsets.
Likely: channel-to-channel fanout delay is the dominant skew term.
Check: fix port mapping and remeasure the skew distribution. - Symptom: phase return depends on relock timing.
Likely: relock introduces a different absolute phase state.
Check: enforce deterministic lock sequence and verify state return.
- Symptom: trigger looks aligned, but outputs are not.
Likely: event-to-output Δt differs and is not compensated.
Check: use a step marker and build a per-board Δt table. - Symptom: wide jitter scatter but stable mean.
Likely: trigger threshold/reference or CDC uncertainty.
Check: observe trigger edge and capture domain together; retest with differential trigger. - Symptom: board-to-board behavior changes with cabling/probing.
Likely: trigger reference shift or coupling path sensitivity.
Check: change only the trigger path and see if Δt distribution moves.
This map turns phase slips into a bounded search. When the symptom is classified (jumps, drift, or scatter), the first check becomes obvious, and the debug loop stays short enough to be repeatable across teams and production builds.
Production & vendor questions: what to lock down before layout and before PO
Phase-coherent systems fail in production when “sync” is treated as a promise instead of a measurable, repeatable state. Before layout and before purchase orders, lock down three things: capability (SYSREF/LMFC and deterministic alignment), observability (status/indicators that prove alignment), and repeatability (the same recipe returns to the same state after resets).
Evidence: register fields + a repeatable alignment state after resets.
Evidence: documented configuration + readback/verification path.
Evidence: a single readout that can be logged in production.
Evidence: marker test demonstrates one stable state (or a bounded state count).
Evidence: measured skew distribution at endpoints (not one probe point).
Evidence: moving ports measurably changes phase/Δt; fixed mapping stabilizes compensation.
Evidence: monotonic response + ability to return to the same target after resets.
Evidence: every unit can be traced to a reproducible recipe and measured proof.
Evidence: trigger-only changes move Δt scatter; differential distribution reduces sensitivity.
Evidence: restart tests show the same aligned state returns without “lucky timing.”
- Provide SYSREF support details: one-shot vs periodic, recommended pulse width, and valid frequency range.
- Provide the required SYSREF timing window relative to the receiving clock (setup/hold concept and recommended margin).
- Provide the exact “alignment done / SYSREF captured / SYSREF valid” indicators (register bits, GPIO options, interrupts).
- Provide LMFC-related configuration: what is configurable, what is fixed, and how configuration can be read back/verified.
- Confirm multi-device repeatability: does the same bring-up recipe return to the same phase state after cold boot and retrain?
- Describe the deterministic latency mechanism (event definition, boundary, and what conditions break determinism).
- Provide channel-to-channel skew specifications (typ/max) and temperature behavior for the relevant sync path.
- Describe re-lock behavior (lock time, re-lock triggers, and whether re-lock changes absolute phase state).
- Provide recommended distribution topology (star/daisy) for REFCLK and SYSREF and the allowed endpoint skew budget.
- Provide a minimum production test recommendation to prove repeatable alignment (signal, steps, and pass/fail definition).
- Confirm what must be stored to restore a repeatable state (profile/OTP/EEPROM), and how restoration is validated.
- Provide any known failure modes where link-up is possible but alignment is not repeatable (and the fastest discriminator).
- Cold boot → lock clocks → bring up link → issue SYSREF → confirm alignment status bits.
- Output a marker (step edge or short pulse) on all channels.
- Measure Δt between channels (and optional phase on a same-tone sine window).
- Repeat N times (example: N = 5 or 10) with the same reset/retrain recipe.
- Log status bits + config hash + Δt/phase stats; decide PASS/FAIL.
- Δt repeatability: max(Δt) − min(Δt) ≤ T_rep
- Phase repeatability: |Δφ| ≤ Φ_rep after reboot/re-align
- State count: discrete Δt states ≤ S_max
- Alignment flags: required status bits must be asserted and logged
- Recipe integrity: config hash must match the golden profile
Calibration and compensation only hold when the system returns to the same state. If state count is not bounded, treat it as an alignment/clock/trigger state problem before introducing additional calibration layers.
The list below is a category checklist, not a recommendation list. Use it to freeze BOM buckets (clock/SYSREF, fanout, trigger fanout, storage) early.
- TI LMK04828
- Analog Devices AD9528
- Analog Devices HMC7044
- Renesas 8V19N490B
- TI CDCE62005
- TI CDCM6208
- Analog Devices ADCLK948
- TI LMK00334
- TI LMK1C1104
- TI SN65LVDS104 (1:4 LVDS fanout)
- Microchip 24LC02B (I²C EEPROM, example)
When the vendor can provide explicit sync modes, alignment indicators, and repeatability guidance—and production can prove it with a short marker test— layout decisions and purchase orders become low-risk. Without those locks, “sync” becomes a bring-up anecdote rather than a manufacturable feature.
FAQs: clock & sync for phase-coherent DAC systems
These FAQs capture long-tail sync questions (SYSREF/LMFC, multi-device alignment, cross-board triggers, jitter isolation strategy, verification, and production gates) without expanding the main content scope.
Why does phase change after every cold boot even with a shared clock? ALIGN
A shared REFCLK guarantees frequency coherence, but not a repeatable alignment state after reset.
- Why: PLL relock can start from different phase states; elastic buffers can choose different boundary points; SYSREF capture can land outside a stable window.
- Do this: repeat the same bring-up recipe N times and log (1) alignment status bits, (2) Δt between channels, and (3) state count (how many discrete Δt clusters appear).
One-shot SYSREF or periodic SYSREF — which gives better repeatability? ALIGN
Use one-shot to minimize ongoing disturbance, and periodic only when continuous re-anchoring is required and verified stable.
- Why: periodic SYSREF can keep boundaries aligned but may introduce additional capture events; one-shot is cleaner but depends heavily on skew/window control.
- Do this: compare reboot scatter (phase/Δt clusters) under both modes; choose the mode that yields a single stable cluster across resets.
What does “SYSREF setup/hold” mean in practice if no explicit spec is given? ALIGN
It means SYSREF must land inside a capture window; outside that window, alignment becomes random.
- Why: the effective window is set by distribution skew, jitter, and the receiving clock domain.
- Do this: sweep SYSREF timing (delay/phase) and look for a “good window” where alignment flags and Δt/phase repeatability remain stable across resets.
The link is up, but channels are not phase-aligned — what is the first suspect? STATE
Suspect alignment state (elastic buffers/boundary selection), not spectrum quality.
- Why: “link up” can be compatible with multiple valid internal boundary states; those states can map to different output phase/latency.
- Do this: re-issue SYSREF and retrain N times; if Δt falls into multiple discrete clusters, fix state repeatability before adding compensation.
How to tell “alignment randomness” from “fixed skew that can be compensated”? VERIFY
Fixed skew produces one stable Δt; randomness produces multiple Δt clusters or wide scatter after identical bring-ups.
- Check 1: repeat N bring-ups and compute max(Δt)−min(Δt).
- Check 2: count distinct Δt clusters (state count). If state count > 1, treat it as a sync-state problem, not a calibration problem.
Trigger arrives at the same time, but outputs do not — why? TRIG
“Trigger-aligned” is not “output-aligned” because each board has its own event-to-output pipeline delay.
- Why: fixed latency differs by board, link state, and alignment boundary; identical trigger arrival does not equal identical output boundary.
- Do this: output a step marker, measure per-board Δt, and compensate using a recorded Δt table (only after state repeatability is proven).
Shared REFCLK vs shared SYSREF vs trigger-only sync — when to use which? SYSTEM
REFCLK aligns frequency, SYSREF anchors boundaries, and trigger-only aligns events without guaranteeing phase repeatability.
- Use shared REFCLK: when frequency coherence is enough and phase does not need to return after reboot.
- Use shared SYSREF: when phase/boundary must be repeatable across devices and restarts.
- Use trigger-only: for “simultaneous action” with per-board Δt compensation, not for absolute phase coherence.
Star or daisy-chain for cross-board fanout — what breaks first? TRIG
Daisy-chains tend to accumulate skew and temperature drift; star fanout is easier to control and validate.
- Why: connector/cable delay and fanout channel delay often dominate over “trace length matching.”
- Do this: measure endpoint skew distribution (not a single probe point) and choose the topology that keeps the distribution within the budget.
Where should jitter cleaning happen, and where should it NOT happen? JITTER
Clean jitter on the shared reference path and avoid introducing new relock-dependent phase states after alignment.
- Why: cleaners and PLLs add lock time, loop bandwidth behavior, and possible relock phase-state changes that can defeat repeatability.
- Do this: verify that re-lock and re-align return to a single stable state (state count = 1) before optimizing performance.
Why did performance not improve after adding a jitter cleaner? JITTER
Jitter may not be the dominant limiter, and sync-state randomness can mask any improvement from cleaning.
- Why: if phase/Δt is not repeatable across bring-ups, the system is not in a stable state where performance comparisons are meaningful.
- Do this: first prove repeatability with a marker test and reboot scatter; only then compare performance under controlled, identical states.
How to prove deterministic latency with only a scope? VERIFY
Use a step (or short pulse) marker, measure Δt, and repeat bring-up cycles to show Δt returns to the same value.
- Do this: run the exact recipe (lock → link → align) and measure Δt for each iteration.
- Pass condition: max(Δt)−min(Δt) stays within the project threshold, and state count stays bounded (ideally 1).
What is the minimum production gate to guarantee “repeatable alignment”? PROD
A one-key test must prove alignment flags, repeatability statistics, and recipe integrity (config hash) across repeated bring-ups.
- Must log: alignment status bits, config hash/version, Δt/phase statistics, and state count.
- Must decide: PASS only when thresholds (T_rep, Φ_rep, S_max) are met across N repeats with the same recipe.