GPSDO / Atomic Clock: OCXO/CSAC Disciplining & Holdover
← Back to: Avionics & Mission Systems
A GPSDO/atomic clock discipline uses GNSS long-term accuracy to steer an OCXO/CSAC so the system delivers both low short-term jitter and predictable holdover when GNSS is lost. The engineering focus is controlling noise injection, budgeting time error across outages, and exporting trustworthy quality flags for outputs and timestamps.
GPSDO / Atomic Clock: OCXO/CSAC Disciplining & Holdover
A practical avionics-oriented view of what a GPS-disciplined reference delivers, how the disciplining system is built, and what signals/flags prove it is trustworthy.
What a GPSDO/Atomic Clock actually delivers in avionics
Featured answer (the “definition that matters”)
A GPSDO (GNSS-disciplined oscillator) is a timing-reference system that continuously steers a high-quality local oscillator (OCXO/CSAC) using GNSS as a long-term truth source. The result is a reference that preserves low short-term phase noise from the local oscillator, while correcting long-term frequency/phase drift to remain traceable to GNSS time.
The three deliverables (what the system truly “hands to” other boxes)
| Deliverable | Typical outputs | What it is used for (engineering meaning) |
|---|---|---|
| Frequency reference | 10 MHz, 25 MHz (and derived clocks) | A continuous “tick-rate” that defines how fast counters advance and how synthesizers/PLLs derive clocks. It is judged by short-term noise (phase noise/jitter) and long-term accuracy (drift/aging compensation). |
| Timing pulse | 1PPS (aligned to a time epoch) | A time boundary marker that defines “where the second starts.” It is judged by time/phase error to the epoch and by pulse jitter (edge stability + measurement noise). |
| Time-of-day / time code | ToD, IRIG-B (as a time label channel) | A human-usable time label for logs, recorders, and coordination. It is judged by consistency, update rules, and quality flags that describe trust level. |
Why a free-running XO/RTC is not enough (the engineering boundary)
- Traceability vs self-consistency: a free-running oscillator can be stable yet still drift away from an external time scale; a GPSDO explicitly constrains drift using GNSS observations.
- Time-scale fusion: short-term behavior is dominated by the local oscillator; long-term behavior is corrected by disciplining. This “fusion” is what enables both low phase noise and bounded long-term error.
- Observability and trust reporting: a GPSDO can output lock state, holdover state, and estimated error so the rest of the avionics stack can make safe decisions.
- Defined loss-of-reference behavior: when GNSS is degraded or lost, the system can enter holdover with a known policy (freeze/slow-steer/model-based) rather than drifting silently.
What “Atomic Clock” means on this page (no physics, only system value)
“Atomic clock” here refers to using a CSAC (chip-scale atomic clock) or a rubidium-based reference as the local holdover timebase. The practical value is not “better GNSS,” but more predictable holdover: lower mid-term drift, improved resistance to temperature/aging uncertainty, and therefore smaller time-error growth when GNSS is unavailable.
Figure F0 is intentionally minimal: it prevents scope creep while making the three deliverables explicit. The full architecture and signal flows appear in H2-2.
System architecture: GNSS receiver + disciplining engine + OCXO/CSAC + outputs
How to read the system: four flows (keeps the design coherent)
A GPSDO becomes easy to design and validate when the architecture is expressed as four distinct flows. Each flow has different failure modes, different “right” bandwidth/time constants, and different evidence signals.
- Reference flow: GNSS provides 1PPS, time-of-day, and reference health indicators (e.g., C/N0, satellite count, solution status).
- Measurement flow: phase and frequency are estimated by comparing GNSS timing markers against the local clock domain.
- Control flow: a disciplining/steering engine converts measured errors into actuator commands (DAC word, DDS bias, or DCO tuning).
- Output & distribution flow: clock synthesis, buffering, pulse shaping, and time-code formatting produce usable 10 MHz/1PPS/ToD outputs.
Core blocks (what must exist in any “real” GPSDO)
The implementation details vary, but the functional blocks below are unavoidable if the system must deliver low phase noise and bounded long-term drift.
- GNSS receiver: produces 1PPS + time solution; also exposes integrity cues (solution validity, C/N0 trends, leap/epoch status).
- Phase/frequency measurement: computes phase error (time offset) and frequency error (rate offset) between reference and local clock.
- Disciplining engine (steering law): selects bandwidth/time constants, filters measurement noise, and applies safe-state rules during degraded reference.
- Actuator: DAC or digitally-controlled oscillator interface that steers the local timebase without injecting avoidable noise/steps.
- Local timebase: OCXO/CSAC that dominates short-term stability; its temperature drift and aging dominate holdover growth.
- Clock synth + output conditioning: derives required frequencies, shapes 1PPS edges, and provides fanout with controlled additive jitter.
- Health/quality reporting: lock/holdover state, phase-step detection, and estimated time error (ETE) for system-level decision making.
Time path vs frequency path (the key mental model)
Although both outputs originate from the same timebase, the control objectives differ: 1PPS enforces an epoch boundary (phase/time alignment), while 10 MHz enforces a continuous tick-rate (frequency accuracy and short-term noise). Treating them as a single “lock” often causes a classic failure: either GNSS measurement noise is injected into the oscillator (too fast), or convergence becomes slow and recovery/holdover transitions become unpredictable (too slow).
What must be observable (signals that prove trustworthiness)
A GPSDO that cannot report internal evidence behaves like a black box. A system-ready design should expose at least: phase error, frequency error, PPS jitter, reference health, lock/holdover state, and quality flags. These are not “nice-to-have”; they are the only practical way to prevent silent time-base corruption.
This architecture page intentionally stops at “clean reference outputs.” Network distribution mechanisms (e.g., PTP/SyncE) belong to sibling pages and should be linked, not expanded here.
Oscillator selection boundaries: OCXO vs CSAC vs TCXO vs Rb (engineering view)
Selection principle (avoid “best oscillator” thinking)
Oscillator selection is a time-scale problem. A GPSDO is judged across multiple time windows: short-term cleanliness (phase noise / integrated jitter), mid-term stability (how the rate wanders over seconds to minutes), and long-term predictability (temperature drift + aging that dominates holdover time error). The correct choice is the oscillator that wins in the time window that drives the mission requirement, while meeting power, warm-up, and cost constraints.
What must be defined before choosing
- Primary deliverable: is the system dominated by frequency cleanliness (derived clocks) or by time-boundary accuracy (1PPS/time tags)?
- Holdover objective: target holdover duration and maximum allowed time error growth (minutes vs hours vs longer).
- Operating envelope: temperature range, vibration exposure, and whether warm-up time is acceptable.
- Operational budget: available power, size, cost, and calibration/test capability.
Engineering comparison (wins by time scale, not by marketing)
| Timebase | Primary win window | What it improves most | Key trade-offs | Best fit |
|---|---|---|---|---|
| TCXO | Fast start, low power | Practical stability for cost/power-limited systems; minimal warm-up overhead. Suitable when jitter requirements are not extreme and holdover tolerance is relaxed. | Holdover time error grows faster due to higher sensitivity to temperature and aging. Less headroom for tight time-boundary guarantees during GNSS loss. | Power-first designs |
| OCXO | Short-term cleanliness | Low close-in phase noise and strong short-term stability once warmed up. Often the best choice when derived clocks must remain “clean” while still disciplining to GNSS long-term truth. | Warm-up time and higher power draw; thermal design matters. Predictable holdover requires temperature modeling and aging estimation. | Low jitter priority |
| CSAC | Mid-term holdover | More predictable mid-term rate stability, reducing time-error growth during GNSS outages. Excellent when holdover is critical but a larger atomic reference is impractical. | Cost and integration constraints; short-term phase noise may not beat a top-tier OCXO. Output frequency options and tuning interface may limit architecture choices. | Holdover-first designs |
| Rb (engineering) | Longer holdover | Strong holdover performance when long outage windows must be bounded. Most valuable when system-level safety cases require explicit time-error limits through extended GNSS loss. | Higher power, size, and cost; integration and qualification complexity. Use only when the holdover requirement cannot be met with OCXO/CSAC and modeling. | Strict holdover bounds |
Clear boundary statements (when each becomes “necessary”)
- Choose OCXO when short-term cleanliness is the gating requirement and warm-up power is acceptable.
- Choose CSAC when mid-term holdover predictability is the gating requirement and a higher-cost timebase is justified.
- Choose Rb when extended holdover time-error bounds are mandatory and power/size/cost can be allocated.
- Choose TCXO when power/startup dominates and the system can tolerate larger time-error growth during GNSS loss.
The map shows “win windows,” not absolute rankings. The disciplining loop (H2-4) must still be tuned to avoid injecting GNSS measurement noise into the timebase.
Disciplining loop deep dive: PLL/FLL, bandwidth, time constants, and “steering”
Core idea (what the loop is really doing)
Disciplining is not “locking everything tightly.” It is a controlled fusion of two truths: GNSS is accurate but noisy in the short term, while the local oscillator is clean in the short term but drifts long term. The loop must follow GNSS only where GNSS is trustworthy (slow time scales), and preserve the oscillator where the oscillator is superior (fast time scales).
Why GPSDOs commonly use FLL + PLL (engineering roles)
- FLL (frequency loop): corrects rate error. It is best at pulling a large initial frequency offset into a small, manageable range and stabilizing the long-term slope of time error.
- PLL (phase loop): corrects residual phase/time offset (e.g., 1PPS alignment). It is best at enforcing a stable epoch boundary once the rate is under control.
- Combined behavior: FLL prevents slow “runaway” during outages and reduces recovery shock; PLL cleans up phase alignment without demanding unrealistic correction speed.
The tuning knobs (what to set, what it changes)
| Knob | If increased | If decreased |
|---|---|---|
| Loop bandwidth | Faster convergence, but more GNSS measurement noise is injected into the oscillator (worse 1PPS jitter / phase noise). | Cleaner short-term output, but slower convergence and harder recovery from GNSS loss (longer time to re-discipline). |
| Time constant / averaging | More smoothing of measurement noise, but slower response to real drift changes. | More responsive, but more sensitive to short-term measurement noise and outliers. |
| Max steering rate | Quicker correction, but higher risk of phase steps and spur-like behavior (especially with quantized actuators). | Safer transitions, but longer time to correct large offsets after holdover. |
| Integrity gating | Better protection against bad GNSS; may hold timebase longer in holdover if gating is conservative. | Risk of “tracking bad reference,” which can silently corrupt time and invalidate logs or mission timing. |
Steering implementations (what matters in system behavior)
- DAC tuning: fine control, but DAC noise/quantization and update behavior can couple into phase noise. Requires clean reference voltages and careful filtering.
- DCO / digital tuning word: compact control interface, but quantization steps can create discrete phase/frequency jumps if not rate-limited and filtered.
- Fractional-N / synthesizer bias: flexible frequency generation, but spur management and additive noise of synthesis stages must be controlled.
- DDS offset: strong fine resolution, but spur and image control are central; apply only where the spur budget is acceptable.
Common failure patterns (symptom → likely cause → corrective action)
| Symptom | Likely cause | Corrective action |
|---|---|---|
| 1PPS jitter worsens after lock | Bandwidth too wide; GNSS measurement noise dominates; poor integrity gating. | Reduce bandwidth; increase averaging; gate corrections on reference health; verify phase measurement noise floor. |
| Very slow convergence to “good” state | Bandwidth too narrow; insufficient FLL pull-in; actuator range not centered. | Use a staged strategy (fast pull-in then slow discipline); ensure actuator has headroom; verify initial frequency estimate. |
| Phase steps during recovery | Actuator quantization or too aggressive steering; no maximum phase-step guard. | Limit steering rate; apply smooth re-entry (gradual re-discipline); add phase-step detection and controlled correction policy. |
| Silent time corruption during degraded GNSS | Tracking bad reference (integrity not enforced); outliers not rejected. | Implement integrity checks (health flags, outlier rejection); freeze or slow-steer under suspicion; report quality flags. |
Figure F2 highlights the three dominant noise-injection paths. Later sections quantify their impact using phase-noise/jitter metrics and holdover time-error growth.
Phase noise and jitter: from L(f) plots to a usable jitter budget
What the plot must become (the only output that budgets can use)
A phase-noise plot is not the goal. The engineering goal is a single, comparable number: RMS time jitter over a stated integration window and carrier frequency. Without the window and the carrier, a “jitter value” is not comparable across clocks or architectures.
Executable conversion workflow (L(f) → trms)
- Step 1 — Identify the carrier: choose the clock of interest at carrier frequency f0 (e.g., 10 MHz or a derived higher-rate clock).
- Step 2 — Choose the integration window: select [f1, f2] in offset frequency that matches what the downstream system converts into timing error.
- Step 3 — Integrate to phase jitter: integrate the phase-noise density over the window to obtain φrms (phase jitter).
- Step 4 — Convert to time jitter: trms = φrms / (2π f0).
How to choose the integration window (engineering rules, not a standards lecture)
| Window edge | What it represents | Engineering selection rule |
|---|---|---|
| f1 (low offset) | Where “slow wander” stops being treated as drift/steering behavior. | Set f1 high enough that very slow variations (disciplining/holdover effects) are not counted as “jitter.” If the system corrects or models slow drift, that energy belongs in stability/holdover analysis, not in the jitter number. |
| f2 (high offset) | Where downstream stops converting phase variation into timing error. | Set f2 to match the effective bandwidth of the clock distribution and the sensitivity of the receiving edge/measurement. Integrating far beyond what the receiving system “sees” inflates jitter without improving truth. |
Why different systems care about different offsets (kept generic to avoid scope creep)
- Edge/instant timing sensitivity: systems that convert clock edge uncertainty directly into measurement error are most sensitive to the offsets that dominate edge timing variance.
- Synthesis chain sensitivity: derived clocks may be dominated by mid-band noise and spur behavior from synthesis and division stages.
- Distribution sensitivity: a clock tree can dominate far-out noise through additive jitter from buffers and fanout.
Who dominates the jitter (a diagnostic view)
A clock system is rarely limited by “the oscillator” alone. Dominance shifts with offset region: near-in noise often comes from the local timebase, mid-band from synthesis stages, and far-out from distribution buffers and power coupling. A usable budget separates these contributors instead of chasing one expensive component.
| Offset region | Typical dominant block | What to do first (engineering action) |
|---|---|---|
| Near-in | Local oscillator intrinsic noise; how tightly the system follows reference at slow time scales. | Choose a timebase that fits the mission window; avoid bandwidth settings that import reference measurement noise into short-term output. |
| Mid-band | Synthesizer / divider / tuning path noise; spur management and loop interactions. | Reduce additive noise in synthesis stages; control tuning quantization and update behavior; keep spur energy out of sensitive regions. |
| Far-out | Clock distribution: fanout buffers, clock tree, power coupling, layout-induced additive jitter. | Budget additive jitter per buffer stage; minimize unnecessary fanout; improve power isolation and reduce coupling paths. |
A “good jitter number” is one that matches the receiving system’s sensitivity. Always publish f0 and the integration bounds with the RMS jitter result.
Allan deviation σy(τ): interpreting stability across time scales
What σy(τ) means (in engineering terms)
Allan deviation, σy(τ), is a stability map: it shows how fractional frequency stability changes when measurements are averaged over time τ. It is the most practical way to answer: “which time scale dominates the clock’s wandering, and what does that imply for holdover predictability?”
τ is not abstract: it maps to real engineering questions
- τ ≈ 1 s: short-term stability—how smoothly the clock rate behaves second-to-second.
- τ ≈ 10–100 s: mid-term wandering—how the rate drifts over tens of seconds to minutes (critical for short holdover windows).
- τ ≈ 1000 s and beyond: slow drift and aging—how temperature sensitivity and aging start to dominate long holdover behavior.
How to read the curve (dominant noise by region, explained without heavy math)
The curve shape tells which mechanisms dominate. When σy(τ) decreases with τ, short-term noise is being averaged out. A plateau often indicates flicker-like behavior or limits of the system’s control/modeling. An upward trend at large τ signals slow drift mechanisms that will dominate long holdover unless temperature and aging are managed.
| What the curve does | What it usually implies | Engineering interpretation |
|---|---|---|
| Falls with τ | Short-term noise averages down | Short holdover and second-to-second rate behavior are likely predictable; jitter and measurement noise are less dominant at longer τ. |
| Flattens (plateau) | Flicker-like or limit region | The clock hits a “floor” for that τ range; improving holdover in that window may require a better timebase or better modeling. |
| Rises at large τ | Slow drift dominates | Long holdover will be bounded by temperature/aging; error growth becomes harder to constrain without compensation and calibration. |
Using σy(τ) to judge holdover potential (without turning into a full holdover chapter)
- Match τ to the outage window: focus on σy(τ) near the holdover time scale that matters operationally.
- Look for stability floor: a low, stable region around that τ suggests more predictable time-error growth.
- Watch large-τ rise: if σy(τ) rises strongly at large τ, long holdover bounds will require temperature control, aging estimation, or a higher-grade timebase.
Allan vs TDEV vs MTIE (selection guidance only)
Use Allan deviation for “stability vs time scale” and dominance diagnosis. Use TDEV when the project needs a time-error flavored RMS view across τ. Use MTIE when the project needs an upper-bound style view of maximum time deviation within a window. This page uses them only to choose the right lens; pass/fail criteria belong in the validation chapter.
This figure intentionally stays generic. It teaches interpretation without turning into a standards tutorial or a full holdover error-budget chapter.
Holdover design: predicting time error when GNSS is lost
What holdover must deliver (system-facing outputs)
Holdover is not a binary “GNSS lost” mode. A usable design must output two items that let the system make decisions: (1) a predicted time-error bound over mission windows (e.g., 30 min / 2 h / 24 h), and (2) a confidence grade that states how trustworthy that bound is right now.
Provide TE upper bounds at fixed windows such as TE@30m, TE@2h, TE@24h. These bounds must remain consistent across state changes (lost → holdover → recovery).
Export a grade (A/B/C or 0–3) derived from reference integrity and model fit. A lower grade can trigger downstream fallback actions.
Time error is the integral of frequency error (a practical framework)
Time error grows when the clock frequency is wrong. The engineering model starts with fractional frequency error Δf/f. In the simplest case, a bounded frequency offset produces a time-error bound that grows with time: TE(t) ≈ |Δf/f| · t. When drift terms dominate, time error accelerates, so long holdover depends heavily on temperature and aging management.
Where holdover error comes from (layered, so the dominant term can be attacked)
| Error layer | What dominates it | Why it matters |
|---|---|---|
| Deterministic drift | Temperature sensitivity and its model error; aging rate and its estimation error. | Often dominates long holdover windows; it sets the time-error upper bound if not modeled and bounded. |
| Stochastic wandering | Noise types that dominate certain τ ranges (as seen in σy(τ)). | Limits predictability across minutes-scale windows and defines how “tight” a bound can be without false confidence. |
| Switching transient | State transitions: reference loss and recovery; control-policy changes. | Creates phase steps or rate jolts that can exceed steady-state bounds if not guarded. |
| Model uncertainty | Insufficient calibration coverage; stale parameters; mismatch between lab and field conditions. | Turns a computed bound into a guess. Confidence grading must penalize poor model fit. |
How to keep time error bounded (three hard tools + one safe recovery rule)
- Temperature compensation model: use temperature sensing plus a calibrated mapping from temperature to frequency correction; track residuals and widen bounds when residuals grow.
- Aging-rate estimation: update slow drift parameters using long observation windows; prevent short-term noise from corrupting aging estimates.
- Noise-aware policy: keep holdover tuned for the τ range that dominates the outage window; avoid importing reference noise into the holdover estimate.
- Recovery smoothing (phase-step guard): when GNSS returns, limit the maximum phase correction per second and ramp steering back gradually to avoid a discontinuous phase step.
This state machine emphasizes safety: do not keep steering from a suspicious reference; prefer a bounded holdover mode and an explicit confidence grade.
GNSS reference integrity in the timing domain (without an anti-jam deep dive)
The core safety problem
The most dangerous failure mode is not losing GNSS. It is following a bad GNSS reference and silently biasing the timebase. Timing integrity focuses on detecting suspicious timing behavior and switching policies that protect the disciplined oscillator.
Timing-domain detectors (what can be checked without signal-processing details)
| Detector group | What is observed | Typical interpretation |
|---|---|---|
| PPS phase residual | PPS residual magnitude; jump detection; residual rate-of-change. | Phase steps and inconsistent residuals suggest the reference is not safe to steer. |
| Frequency sanity checks | Estimated Δf/f and drift vs expected temperature/aging behavior. | If the rate change is physically implausible for the oscillator, treat GNSS as suspicious. |
| Receiver health flags | C/N0 thresholds (timing-gating use), satellite count, geometry degradation flags. | Use as gating signals. Poor geometry or weak signals raise suspicion and reduce trust in steering. |
| Time solution consistency | ToD step detection; internal consistency between PPS and time-of-day. | ToD discontinuities are a hard warning. Protect the timebase by freezing or entering holdover. |
Policy responses (graded actions that protect the timebase)
- Level 1 — Degraded trust: reduce steering bandwidth, increase averaging, tighten outlier rejection; continue monitoring residuals.
- Level 2 — Suspicious reference: freeze steering (hold last good), enter holdover, start TE-bound timers, and publish a lower confidence grade.
- Level 3 — Persistent anomaly: remain in holdover, widen bounds as needed, and publish explicit fault flags for downstream safe degradation.
Cross-checking sources (mention-only, timing-domain use)
Integrity improves when a suspect reference can be compared against an independent source: dual GNSS receivers, GNSS + CSAC behavior consistency, or an external 1PPS reference. Cross-checking is used here only as a timing-domain consistency gate, not as an anti-jam algorithm.
Timing integrity is a protection layer. It detects suspicious reference behavior, selects a safe policy, and exports explicit flags so downstream systems can degrade safely.
Timestamp units: time tagging architecture and error sources
Why a “good clock” can still produce “bad timestamps”
Timestamp accuracy is determined by the entire time-tagging chain, not only by the reference oscillator. A clean timebase can still yield poor timestamps when event edges are distorted, clock-domain crossings add uncertainty, or the UTC/ToD mapping is stale or invalid. A robust design therefore reports both timestamp values and timestamp quality.
Time-tagging data path (what each block actually does)
| Block | Role in the chain | Typical failure symptom |
|---|---|---|
| Edge conditioner | Shapes the event edge into a clean digital transition; defines the threshold and edge timing behavior. | Input noise/threshold drift becomes time-walk; timestamps “wander” with amplitude, temperature, or supply. |
| Coarse counter | Counts reference ticks; provides the coarse time base at a known tick period. | Quantization limits; off-by-one cycles appear when capture is near a tick boundary. |
| Fine interpolator (TDC) | Interpolates within one tick to achieve sub-tick resolution (fine bins). | Bin nonlinearity and calibration drift distort fine timing; “resolution” exists but accuracy does not. |
| Clock-domain crossing | Transfers event time from capture domain to processing domain safely and deterministically. | Variable cycle latency or metastability protection adds uncertainty; appears as random-looking jitter. |
| ToD/UTC mapper | Maps counter/TDC time to UTC/ToD using the disciplined timebase state and alignment updates. | Stale/invalid ToD lock causes step changes or growing bias; “timestamp is precise but wrong.” |
Error sources (classified so the dominant term can be attacked)
Coarse tick quantization and fine-bin quantization define a lower bound, but do not guarantee accuracy. Nonlinearity and temperature drift must be calibrated and monitored.
Threshold drift, edge slewing, and input noise convert into time-walk. This is often the dominant “bad timestamp” cause when event signals are not well conditioned.
Synchronizers protect against metastability but can add 1–2 cycle uncertainty if the architecture is not fully deterministic.
A stable oscillator still drifts with temperature/aging. If UTC/ToD mapping is not locked or not current, timestamps become biased.
Engineering techniques that make timestamps trustworthy
- Per-channel delay calibration: measure and store deterministic delays for each input path (cable + isolator/conditioner + I/O path) and apply correction.
- Deterministic capture domain: keep capture and counter in a controlled clock domain; reduce CDC ambiguity by design (not by hope).
- TDC calibration / linearization: correct bin nonlinearity and temperature drift using a calibration table; widen uncertainty when calibration is stale.
- Quality flags: publish lock validity, calibration status, and an uncertainty estimate so the system can reject or down-grade timestamps safely.
Outputs and distribution: 1PPS / 10MHz / IRIG-B / ToD and redundancy
Goal: deliver a clean reference without creating new jitter or failure modes
Outputs are not “free.” Conditioning, isolation, cabling, and fanout buffers can add jitter, distort edges, or introduce reflections. A high-quality GPSDO output stage therefore treats each output as a controlled interface with a defined impedance, edge rate, and explicit health flags.
Output types and electrical boundaries (practical choices)
| Signal | Typical electrical forms | Primary engineering risks |
|---|---|---|
| 10 MHz | Square or sine; LVTTL/LVDS; 50Ω drive where applicable; buffered fanout tree. | Edge sensitivity, reflections, additive jitter accumulation across buffers. |
| 1PPS | Edge-defined pulse; controlled threshold; isolation and clean return path. | Time-walk from thresholds, ringing on long lines, ground bounce coupling into timing edge. |
| IRIG-B | Digital-level or conditioned output; include validity/lock flags. | Misinterpretation when time validity changes; ensure clear “time_valid” semantics. |
| ToD | Serial/parallel time-of-day interface with explicit lock and holdover status. | Stale time mapping; unclear validity can propagate wrong time through the system. |
Distribution and fanout (where additive jitter accumulates)
- Keep fanout deterministic: use a planned distribution tree instead of ad-hoc buffering; limit cascade depth where possible.
- Control impedance and termination: treat 50Ω lines as transmission lines; avoid unterminated stubs that create reflections.
- Protect the reference from digital noise: isolation and clean power/ground segmentation prevent return currents from corrupting edges.
- Export output health: each output domain should publish status so downstream logic can degrade safely.
Redundancy A/B timebases (selection and switching semantics)
Redundancy is not only having two sources; it is having a defined switching meaning. Switching may be glitchless (no phase step), a controlled phase step (bounded and logged), or an alarmed cutover (fast switch with explicit flags). The correct choice depends on system tolerance, but the policy must be explicit and observable.
No output glitches and no phase step. Strongest continuity, highest implementation complexity.
Phase step allowed but bounded, scheduled, and reported. Enables predictable recovery behavior.
Fast switch prioritized. System receives explicit alarms and can degrade immediately.
Use hysteresis and holdoff timers so minor excursions do not cause A↔B oscillation.
Validation & production checklist: proving low noise and trustworthy holdover
“Done” is not a single plot. A GPSDO/atomic-clock subsystem is complete only when low noise, stable holdover, and safe recovery are demonstrated with repeatable setups, clear pass/fail gates, and an evidence pack that can be reproduced in the lab, on the production line, and in the field.
1) Phase noise & integrated jitter — measurement that is actually credible
Define output form (sine/square), impedance (50Ω or logic), and conditioning chain (attenuator/limiter/isolation). Use a low-noise supply and fixed cabling to avoid measurement-to-measurement drift.
Confirm whether the result is limited by the analyzer noise floor. If needed, use correlation/averaging modes or residual measurement techniques and document the mode in the report.
Jitter must specify an integration band. Record the exact lower/upper offsets used and show the integration region on the plot.
Deliver: L(f) plot, integrated jitter number with band, and a one-line summary of setup (output mode, impedance, analyzer mode).
2) Allan/stability planning — τ points, temperature chamber, and aging windows
- τ point selection: choose τ values that map to engineering use (e.g., short-term, mid-term, and long-term holdover relevance), and keep the same τ set across builds for comparability.
- Temperature chamber profile: define ramp rate, soak time per plateau, and logging cadence. Plateaus must be long enough for oscillator thermal settling before sampling stability metrics.
- Aging observation window: track frequency control effort and drift over a longer window so “slow wander” is visible (not just short snapshots).
- Artifacts: Allan (or stability proxy) vs τ curve, chamber profile, and a CSV of key flags/control outputs vs time.
3) Holdover drill — scripted GNSS loss + recovery, with phase-step accountability
Run repeatable GNSS loss windows (e.g., 30 min / 2 h / 24 h as applicable) and record: phase error trace, frequency steering/DAC code, holdover grade, time_valid/tod_locked, and alarm bits. Recovery must be evaluated for phase step and for how quickly noise returns to baseline (avoid “fast relock that injects GNSS noise”).
(a) holdover_TE_slope_max, (b) DAC_headroom_min (no rail-hitting), (c) recovery_phase_step_max, (d) time_valid semantics must match actual lock state, (e) recovery must log a switch/relock event.
4) Environmental robustness — stress only what impacts timing trust
- Temperature: verify no control saturation (steering pegged), no abnormal jitter modulation, and no invalid lock flags during ramps and soaks.
- Vibration (timing impact only): look for phase/jitter modulation correlated with the vibration state label; record the state label in logs.
- Supply disturbance: ensure output edge timing does not degrade under realistic supply ripple/step events; isolate and document any coupling paths.
5) Production screening — compress lab proof into 60–180 seconds
| Gate | What to measure | Why it catches real failures |
|---|---|---|
| Warm-up | Warm-up curve shape and time-to-stable window; record temperature if available. | Separates healthy oscillators from units with abnormal thermal behavior or poor assembly coupling. |
| Lock + headroom | Steering/DAC code range and margin; ensure control is not near rails. | Predicts holdover risk and prevents “works today, fails tomorrow” drift due to limited control authority. |
| Quick PPS stats | Short-window 1PPS time-interval stats (std dev / histogram) and validity flags. | Fast indicator for edge conditioning issues, noise coupling, or CDC problems without full phase-noise runs. |
| Mini holdover | Short GNSS-loss drill; evaluate TE slope proxy + holdover flags + recovery behavior. | Detects model/flag errors and unstable steering logic that only appears during state transitions. |
| Pack & label | Write an evidence summary into unit record: thresholds, results, flags, and key traces IDs. | Makes field RMAs actionable; “trustworthy holdover” becomes traceable per serial number. |
Required logs & quality flags (minimum set for field proof)
Example part numbers (instruments / fixtures / reference modules)
These are representative examples to make procurement and test planning concrete. Equivalent instruments/modules are acceptable if the same measurements and evidence artifacts are supported.
| Category | Example part number | Use in this checklist |
|---|---|---|
| Phase noise analyzer | R&S FSWP | Phase-noise L(f) verification and low-floor measurement configurations (document mode/settings in evidence pack). |
| Signal source analyzer | Keysight E5052B | Alternative phase-noise/jitter characterization approach, especially for comparing builds and verifying setup repeatability. |
| Phase noise + Allan test set | Microchip/Microsemi 5125A | Stability vs τ style verification workflow with a dedicated timing measurement toolchain. |
| Time interval counter | Keysight 53230A | Fast 1PPS time-interval statistics for production screening and regression checks. |
| Time interval analyzer | Pendulum CNT-91 / CNT-91R | High-resolution time interval measurement for PPS jitter histograms and short-window stability proxies. |
| Temperature chamber | ESPEC (example series) | Controlled temperature profiles for τ planning, drift vs temperature, and holdover robustness under thermal stress. |
| Timing GNSS module | u-blox ZED-F9T | Representative timing receiver module category for disciplined timing reference integration and drill scripting. |
| 50Ω attenuator | Fixed 6–20 dB, 50Ω (example) | Stabilizes analyzer input levels and reduces distortion/overload risk during phase-noise and jitter measurements. |
| Low-noise DC supply | Programmable bench PSU (example) | Repeatable supply conditions; supports supply-step tests and avoids measurement contamination from noisy adapters. |
FAQs: practical boundaries, debug logic, and validation signals
These answers focus on GPSDO/atomic-clock engineering tradeoffs and verification signals: oscillator choice, disciplining behavior, holdover time-error budgeting, output semantics, and production screening.
1) OCXO vs CSAC: when does holdover require a CSAC?
CSAC becomes “required” when the mission sets a tight time-error bound over long GNSS outages and temperature cannot be controlled. OCXOs can win on short-term phase noise and may meet holdover if thermal stability is excellent and aging is managed, but CSAC often provides stronger mid/long-term stability. Choose by TE@1h/24h targets and ensure control headroom.
2) Why does a wider GPSDO loop bandwidth sometimes make performance worse?
A wide disciplining bandwidth can inject GNSS measurement noise into the oscillator, degrading phase noise and increasing short-term jitter. Narrower bandwidth filters the reference noise but slows convergence and can complicate state transitions. A practical approach is a staged strategy (frequency pull-in first, then phase alignment) and bandwidth limits that protect near-carrier noise.
3) 10 MHz looks stable but 1PPS is still jittery—where is the problem likely located?
Frequency and time paths are not the same. A clean 10 MHz can coexist with a noisy 1PPS if edge conditioning, thresholds, ground bounce, reflections, or output buffering add timing uncertainty. It can also be a phase-alignment/measurement issue where the PPS comparator path is dominated by noise even while frequency is well disciplined. Treat PPS as a separate integrity chain.
4) How is system jitter estimated from a phase-noise plot, and how should the integration band be chosen?
Phase noise L(f) becomes jitter by integrating phase variance over a defined offset-frequency range and converting to time jitter using carrier frequency. The integration band must match what the downstream system is sensitive to: lower offsets capture near-carrier noise, while higher offsets can be dominated by synthesizers, buffers, and distribution. Always report the exact band; otherwise “jitter” numbers are not comparable.
5) Allan deviation: what do τ = 1 s / 10 s / 100 s mean in engineering terms?
Allan deviation σy(τ) expresses fractional frequency stability over the averaging time τ. Around τ=1 s it reflects short-term stability, τ=10 s often captures mid-term control and environmental coupling, and τ=100 s begins to expose slower wander that matters for holdover. The curve shape indicates which noise processes dominate at each time scale and guides which oscillator best fits outage requirements.
6) How is allowed time error for a 1-hour GNSS outage budgeted?
Time error accumulates from frequency error over the outage, so the budget starts with expected frequency offset and drift from temperature, aging, and noise processes (often informed by σy(τ)). Add uncertainty for model mismatch and the transition into holdover. The output should be an upper bound (or confidence interval) for TE@1h, not a single optimistic number.
7) After GNSS returns, how can recovery avoid a damaging time/phase step?
Recovery must prioritize controlled phase behavior. Common tactics include freezing or limiting steering initially, reducing bandwidth, and then re-engaging correction with a bounded phase ramp rather than an abrupt step. Some systems allow a controlled step with explicit alarm semantics, while others require glitchless continuity. The key is to make the recovery policy deterministic and observable.
8) How can GNSS reference “suspicion” be detected in the timing domain (without anti-jam algorithms)?
Timing-domain suspicion can be flagged by sudden PPS phase jumps, abnormal drift in steering effort, time-of-day discontinuities, sustained low C/N0, or degraded satellite geometry indicators. The safe response is policy-based: freeze steering, reduce bandwidth, enter holdover, and raise alarms with explicit quality flags so downstream systems can react predictably.
9) What determines timestamp accuracy: the counter or the TDC?
The coarse counter sets the base granularity, while the TDC can refine sub-tick timing—but only if it is calibrated and stable across temperature and voltage. In practice, major errors often come from edge conditioning, metastability/clock-domain crossings, and incorrect time-of-day mapping rather than raw TDC resolution. A good timestamp includes both a value and a quality field that reflects capture conditions.
10) How should A/B redundant timebases switch without uncontrolled jumps?
Switching needs a defined semantic: (a) glitchless continuity with phase alignment, (b) a controlled step with a bounded magnitude and explicit alarm, or (c) a hard cutover that is always alarmed. Add hysteresis and holdoff to prevent flapping. Downstream systems must receive the selected reference ID and the measured step/offset so “time trust” remains auditable.
11) In production, how can units with poor phase noise or excessive drift be screened quickly?
Production screening should compress lab proof into short gates: warm-up curve sanity, lock confirmation with healthy flags, steering headroom (no rail-hit), short-window PPS time-interval statistics, and a mini holdover drill that checks TE slope and recovery behavior. These gates correlate strongly with real field failures while staying practical for ATE timing.
12) Why is “ppm” not enough to judge a timebase, and what should be checked instead?
Ppm is a coarse long-term accuracy metric and does not capture near-carrier phase noise, integrated jitter, or time-scale dependent stability that drives holdover. A timebase must be judged across time scales: L(f) and integrated jitter for short-term behavior, σy(τ) for stability vs averaging time, and TE@outage for mission holdover. Quality flags and evidence traces complete the trust story.