Gas Analyzer Front-End (NDIR / TCD / EC)
← Back to: Industrial Sensing & Process Control
This page explains how to build a stable gas analyzer analog front-end across NDIR, TCD, and electrochemical sensors—covering source drive, TIA + lock-in detection, temperature control, isolation, EMC hardening, and calibration. It also provides a deterministic evidence-chain for debugging drift, noise, and long-term accuracy in the field.
Mission & System Boundary
Mission: build a repeatable front-end blueprint that keeps readings stable across temperature gradients, long-term drift, and electrical noise—while making failures diagnosable from measurable evidence rather than guesswork.
What this page covers (strict boundary)
- Electrical chain only: source/excitation → sensor output domain → AFE (TIA/bridge/INA) → synchronous detection / filtering → ADC/MCU → isolation → comms I/O.
- Stability loops: temperature measurement + heater control, compensation models, baseline tracking, and drift management hooks.
- Evidence-first debug: which waveforms/fields to capture, where to tap them, and what “healthy vs failing” looks like.
- Design-for-noise: grounding, isolation partitioning, leakage control, EMI coupling paths, and coherent sampling constraints.
Evidence fields (used as a contract across chapters)
- Chain gain: sensor-domain input → ADC codes (include demod/filter gain). Required for repeatable calibration and for detecting hidden saturation.
- Noise floor (in-band): measured after the same bandwidth/decimation used for the final reading—otherwise noise numbers are misleading.
- Drift vs temperature: ppm/°C or LSB/°C after compensation; track both “before” and “after” to prove the model works.
- Baseline stability: zero condition variance over seconds/minutes/hours; use it to separate electrical drift from process variation.
- Recovery time: time to return from saturation/over-range/step disturbance back to spec (includes thermal settling and filter convergence).
- Calibration coefficients: versioned, checksum-protected, with validity flags and temperature segment tables to prevent silent corruption.
Modality Selection Map (NDIR vs TCD vs EC) — What Changes Electrically
NDIR, TCD, and electrochemical sensors often share the same downstream ADC/MCU and isolation strategy, but the electrical signal domain, the required excitation/reference, and the dominant error source change the entire front-end constraint set. This chapter maps each modality to a measurable budget so the architecture stays stable when the sensing element changes.
The 5 electrical questions that decide the front-end
- Output domain: current (pA–µA), voltage (µV–mV), or resistance change (mΩ–Ω).
- Reference requirement: synchronous reference (lock-in), ratiometric bridge excitation, or electrode bias control.
- Dominant noise: 1/f + leakage, resistor thermal noise, source/thermal drift, or mains/EMI coupling.
- Time constant: optical/thermal settling vs bridge thermal mass vs electrochemical diffusion—sets filter/decimation strategy and recovery behavior.
- Isolation sensitivity: common-mode and ground-loop risk increases with long cabling and high-impedance nodes; architecture must choose where the ADC sits.
Engineering comparison matrix (use as a routing table)
| Constraint | NDIR | TCD | Electrochemical (EC) |
|---|---|---|---|
| Output domain | Photodiode current or thermopile voltage | Resistive bridge imbalance (ΔR) | Cell current (nA–µA) with bias |
| Must-have excitation | Chopped/modulated source + phase reference | Stable bridge excitation (often ratiometric) | Reference/working electrode bias + compliance |
| Dominant errors | Source temperature/aging, phase error, ambient IR drift | Excitation drift, self-heating, thermal gradients | Leakage, bias drift, humidity/temp influence, aging |
| Best stabilization lever | Lock-in demod + coherent sampling + baseline tracking | Ratiometric readout + controlled excitation (CTA if used) | Guarding + ultra-low leakage + stable bias/reference |
| Isolation priority | Medium–High (long sensor head cables can inject CM noise) | Medium (bridge can be robust, but loops can pick up hum) | High (high impedance nodes are ground-loop sensitive) |
| First debug capture | Raw detector + reference phase + demod output | Excitation stability + bridge diff output + thermal response | Bias voltage + input leakage check + zero baseline trend |
| “What breaks first” | Phase/alignment or source thermal drift | Excitation drift or loop stability (if CTA) | Leakage paths or bias/reference instability |
NDIR Source Driver Architecture (Lamp / MEMS IR / LED) + Modulation Strategy
NDIR stability is constrained first by the source: modulation depth must be predictable, the reference must be phase-stable, and the drive noise must not fold into the demodulated band. The driver is treated as a measurable subsystem with explicit evidence taps, not as a black box.
Source type → electrical constraints (what changes in the driver)
- Lamp: strong warm-up drift and thermal inertia; modulation frequency is limited by heating/cooling time constants; overdrive trades lifetime for faster settling.
- MEMS IR emitter: supports higher modulation rates but can exhibit resonance-sensitive behavior; drive wave-shape and peak current limiting are critical.
- IR LED: fast response and easy modulation; optical output tracks junction temperature tightly, so temperature sensing + derating and/or power-aware control becomes the stability lever.
Constant-current vs constant-power drive (stability budget decision)
Constant-current makes electrical excitation repeatable, but optical output still drifts with temperature and aging. Constant-power attempts to reduce thermal drift but introduces new error sources from voltage sensing noise, switching ripple, and estimator bandwidth. The choice is judged by evidence fields, not by preference.
| Decision axis | Constant-current drive | Constant-power drive |
|---|---|---|
| Primary benefit | Repeatable modulation depth (electrical) | Reduced optical drift vs temperature (target) |
| Hidden risk | Optical output still temperature/aging dependent | Power estimate noise + ripple can leak into demod band |
| Key evidence | Drive current ripple, settling time, source temperature | V/I sensing noise, estimator bandwidth, EMI signature |
| First failure symptom | Baseline drift tracks temperature | Residual ripple or “breathing” at demod output |
Modulation waveform + frequency selection (rules that prevent false drift)
- Waveform: square waves maximize amplitude but add harmonics (EMI coupling risk); sine waves reduce harmonics but demand tighter amplitude control.
- Frequency: place modulation above low-frequency drift regions, avoid 50/60 Hz mixing paths, and avoid emitter/fixture resonance bands; keep sampling coherent with the modulation period.
- Sampling alignment: define explicit sample windows and integration windows so the demod result is repeatable across runs and temperature.
Drive topology + EMI containment (control the coupling path)
- Low-side switching: simplest, but ground bounce can pollute high-impedance AFE references; return current paths must be short and predictable.
- H-bridge (when needed): enables symmetric drive but adds switching nodes; containment focuses on loop area, edge-rate control, and separation from AFE inputs.
- Regulation bandwidth: too slow reduces modulation depth; too fast can amplify switching noise. The loop is tuned against measured ripple and demod residuals.
Safety hooks (measured, not assumed)
- Open/short detection: protects against “silent failure” where algorithms chase a broken source.
- Over-temp derate: reduces drift and extends life; derate state should be logged as a first-class evidence field.
- Aging compensation concept: track run-time and temperature history to justify coefficient updates without hiding real faults.
NDIR Detector Front-End: TIA + Lock-In Detection Chain
NDIR detectors produce small signals that are easily buried by offset, leakage, and ambient drift. The front-end is built as a coherent signal path: the modulation reference defines what is “signal,” and everything else becomes rejectable noise—if phase, bandwidth, and sampling are engineered as a unit.
Detector choice changes the input model (and the failure modes)
- Photodiode (current): the input looks like a current source with junction capacitance. Stability is dominated by input capacitance, feedback compensation, and leakage paths.
- Thermopile (voltage): the input looks like a small voltage with high source resistance. Amplifier input noise and bias/leakage currents become first-order error sources.
TIA design knobs (each knob has a measurable consequence)
- Feedback R (gain): increases sensitivity but raises susceptibility to leakage and slows recovery from saturation; must be validated by recovery time evidence.
- Feedback C (stability): closes the phase margin against detector capacitance; too small causes ringing/oscillation, too large reduces demod SNR by narrowing bandwidth.
- Leakage & bias current: at high feedback resistance, picoamp-level leakage becomes a large offset with temperature sensitivity; guarding and surface cleanliness become electrical requirements.
- Input protection: clamps must be chosen for ultra-low leakage; “safe but leaky” protection can silently destroy baseline stability.
Lock-in options: analog demod vs digital demod (choose by evidence needs)
| Axis | Analog demod (switch/mixer) | Digital demod (multiply + filter) |
|---|---|---|
| Where signal is compressed | Before ADC (reduces ADC dynamic range demand) | After ADC (requires clean sampling + headroom) |
| Common risk | Switch injection and ripple leaking into baseband | Clock jitter and non-coherent sampling → phase wander |
| Key evidence | Residual ripple at demod output, phase symmetry | Phase error, 50/60 fold-in, in-band noise after decimation |
Anti-alias + coherent sampling + decimation (where “false drift” is born)
- Coherent sampling: sampling windows must align to the modulation period; phase slip turns a stable signal into an apparent baseline drift.
- Anti-aliasing: keep switching harmonics and mains components from folding into the baseband after decimation.
- Decimation defines the real noise bandwidth: evaluate noise floor only after applying the same LPF/decimation used for the final reading.
- 50/60 Hz rejection: implement notch/strategy before or during demod/decimation so mains interference cannot masquerade as concentration drift.
Offset management (use only when the evidence justifies it)
- Chopping/auto-zero: reduces low-frequency offset but can introduce ripple and switching artifacts; verify residual ripple in the demod band.
- Correlated sampling: useful when the modulation permits paired samples; validate that it improves baseline stability without increasing recovery time.
TCD Excitation & Bridge Readout (Constant-Temp / Constant-Current) + AFE
TCD readout is a coupled electro-thermal system: excitation defines both the electrical reference and the sensor heating power. Instability usually appears as slow baseline drift, sensitivity changes with ambient conditions, or (in constant-temperature mode) loop oscillation. This chapter treats excitation, bridge, and readout as one measurable chain.
Bridge fundamentals (where drift is born)
- Excitation stability: any drift in excitation appears as an apparent bridge imbalance unless the measurement is ratiometric.
- Lead/contact resistance: wire and connector resistance changes with temperature and mechanical stress; it can dominate low-level drift in long cable assemblies.
- Self-heating: excitation is power; power changes modify element temperature, shifting the bridge in a way that looks like “real signal.”
Constant-current vs constant-temperature (control implication)
| Axis | Constant-current (CC) | Constant-temperature (CTA) |
|---|---|---|
| Main advantage | Simpler; fewer stability risks | Improves immunity to ambient & flow disturbances |
| Primary risk | Baseline drift vs ambient temperature / airflow | Loop instability (ringing/oscillation), thermal runaway |
| Key evidence | Excitation ppm, bridge drift, step response | Stability margin (or overshoot), step response, recovery |
| First debug capture | Excitation monitor + bridge diff + temperature | Error signal + drive power + bridge diff + temperature |
Readout chain (reduce excitation-induced drift at the root)
- Instrumentation amplifier: robust common-mode rejection for small differential signals; validate input bias/current noise vs bridge impedance.
- Differential ADC: enables direct digitization and cleaner ratiometric strategies; requires a deliberate anti-alias approach.
- Ratiometric measurement: measure bridge output relative to the same excitation reference so excitation drift becomes second-order.
- Chopping / auto-zero: reduces low-frequency offset but can introduce ripple; confirm with in-band noise after final filtering/decimation.
Protect & detect (make “quiet failures” visible)
- Bridge open/short: detect via out-of-range bridge node voltages and saturation patterns in the readout path.
- Overload & thermal runaway: detect power increasing without convergence; enter a safe derate/disable state and log the event.
- Recovery validation: after overload, measure step response and baseline return time to prevent “stuck drift” being mistaken for concentration change.
Electrochemical (EC) Sensor AFE: Biasing, TIA, and Interference Control
EC sensors often fail quietly: leakage, bias errors, and contamination shift the baseline without obvious alarms. A robust EC front-end is designed around bias stability, leakage budgeting, and controlled recovery after overload—using a potentiostat-like topology with explicit evidence taps.
2-electrode vs 3-electrode cells (electrical meaning)
- 2-electrode: simpler wiring, but the reference point is less controlled; bias errors can translate into drift and non-repeatable sensitivity.
- 3-electrode: separates working (WE), reference (RE), and counter (CE) roles; enables controlled bias but increases compliance and protection requirements.
Bias generation (the foundation of repeatability)
- Low-noise reference: bias noise becomes measurement noise after the TIA and filtering chain.
- Compliance voltage: ensure the loop can hold the intended electrode potential under expected current and temperature conditions.
- Transient protection: hot-plug, ESD, and surge events must not permanently shift the bias point or increase leakage.
TIA + input protection (leakage budget dominates)
- Picoamp bias & leakage paths: board contamination, humidity, and “safe but leaky” clamps can create offsets that dwarf the signal.
- Guarding: guard rings and driven guards are electrical requirements for high-impedance nodes; validate with humidity/handling sensitivity tests.
- Input protection: select protection components by leakage (not only by voltage rating); confirm zero-baseline drift after assembly and cleaning.
Interference control (hooks, not chemistry deep dive)
- Cross-sensitivity: treat as a monitoring and flagging problem at the AFE boundary (baseline anomaly, recovery anomaly) rather than chemistry modeling here.
- Humidity/temperature influence: capture T/RH fields and expose them to compensation tables and health diagnostics.
- Overload recovery: define a recovery timer and validity flag to prevent “polarization tail” from being interpreted as true concentration.
Temperature Control & Compensation (Shared Core for NDIR / TCD / EC)
Temperature is the dominant error source across all three modalities. Treat it as a first-class subsystem with a measurable thermal map, explicit control loops (where required), and a compensation model that is validated by residual temperature coefficient rather than assumed “good enough.”
Where to measure temperature (make gradients observable)
- Source zone (NDIR): emitter temperature drives optical output drift and warm-up behavior; use it to gate validity and to stabilize modulation depth.
- Detector / AFE zone (NDIR/EC): offsets and leakage are strongly temperature dependent; measure near high-impedance nodes and low-noise references.
- Flow cell / sensor chamber (all): airflow coupling and enclosure thermal inertia set the baseline drift and step response seen in the measurement.
- Ambient / enclosure: provides boundary conditions for feedforward and for diagnosing unexpected gradients.
- PCB hot spots: isolate power and digital heat sources; track them as health signals to prevent compensation from masking thermal design faults.
Control options (feedback and feedforward as a system)
- Heater + thermistor/RTD + PID: validate with settling time and overshoot under step disturbances rather than only steady-state error.
- Feedforward: use ambient and power-state predictors to reduce PID workload, minimizing oscillation and reducing warm-up time.
- Combined strategy: feedforward handles slow drift; feedback closes residual error and rejects disturbances (airflow, enclosure changes).
Compensation model (engineered for maintainability)
- Piecewise linear: production-friendly, stable across operating regions; easy to update with calibration points.
- Polynomial: smooth but risky outside calibrated ranges; requires careful validation to avoid “good fit, bad physics.”
- Reference channel: NDIR dual-channel approaches can separate optical/ambient drift from absorption changes without relying on a single sensor temperature.
- Ratiometric strategies: in TCD, reference-based measurement reduces excitation drift so temperature compensation targets true thermal effects.
Thermal design hooks (reduce unmodeled coupling)
- Isolation barrier effects: isolators and isolated power can create local hot spots and gradients; treat them as thermal zones with sensors if needed.
- Self-heating: excitation and high-value feedback networks can introduce internal heating; include power state in the compensation context.
- Airflow coupling: enclosure and flow changes move gradients; compensation should use the right measurement zones, not a single “board temp.”
Isolation, Safety, and Grounding (Sensor Head vs Mainboard)
Gas analyzers often operate with long cables, metal enclosures, and harsh EMI/ESD environments. Isolation is not only for safety: it breaks ground loops, improves common-mode noise immunity, and prevents quiet baseline corruption in high-impedance analog front ends.
Why isolate (convert hidden coupling into controlled paths)
- Ground loops: low-frequency hum and drift appear as baseline movement, especially in bridge and TIA measurements.
- Common-mode noise: CM excursions can saturate front ends and fold into baseband after filtering/demodulation.
- Safety and touch/ESD: protects sensor head electronics and user-accessible surfaces in the presence of transients and unknown grounds.
Partitioning (two islands with clear responsibilities)
- Sensor head island: AFE, temperature control, and (often) ADC/clock live close to the sensor to minimize leakage and pickup.
- Processing island: MCU, storage, host interfaces, and high-noise digital subsystems remain on the mainboard side.
Isolation choices (pick the noise boundary on purpose)
- Digitize-then-isolate: common approach; keeps analog small and local, then isolates digital data across the barrier.
- Analog isolation: used only when necessary; validate linearity, drift, and bandwidth against the measurement chain requirements.
- Isolated power: required for sensor head autonomy; treat its switching noise and thermal impact as managed hot spots.
- Isolated comms: SPI/I²C/UART/RS-485 selections follow cable length and noise environment; confirm with CM noise measurements.
Creepage/clearance and leakage paths (barrier integrity under reality)
- Barrier leakage: humidity and contamination can create unexpected leakage across the isolation boundary.
- Parasitic capacitance: high-frequency CM currents can couple across the barrier and return through the AFE reference if the shield/earth strategy is wrong.
Cable shield termination patterns (avoid “shield becomes antenna”)
- Define return paths: route CM currents to a controlled chassis/earth path instead of through sensitive analog grounds.
- Single-point vs multi-point: select termination based on dominant frequency content; validate by measuring noise difference with and without shield changes.
Noise/EMC Hardening for µV/nA Signals (Layout + Filtering + Timing)
Ultra-low signals fail through hidden coupling paths: EMI, leakage, microphonics, and sampling jitter. Hardening is most effective when each aggressor is mapped to a victim node and a specific mitigation, then verified by PSD/FFT and leakage/jitter sensitivity tests.
Layout rules (protect the victim nodes)
- Guard rings and driven guards: surround high-impedance inputs and feedback networks to reduce humidity/contamination leakage and hand-touch sensitivity.
- High-impedance routing: keep sensitive nodes short and isolated from switching edges (DCDC, isolators, MCU) and from board edges/cable entry points.
- Split planes (done correctly): do not break return paths; instead, guide noisy return currents away from analog references and ADC grounds.
Filtering (match frequency budgeting to sampling)
- Input RC: reduces RF/ESD energy and prevents non-linear rectification that turns HF into baseband drift.
- Anti-alias: define the analog bandwidth contract against ADC sampling; prevent out-of-band pickup folding into the demodulated/decimated band.
- 50/60 Hz notch: suppress mains hum without breaking coherent lock-in constraints; validate with FFT peak reduction.
- Spread-spectrum vs coherent lock-in: spread-spectrum may be used for power/digital clocks, but lock-in reference integrity must remain coherent and phase-stable.
Clocking and jitter (preserve reference integrity)
- Lock-in reference integrity: phase/period stability determines demod gain and residual ripple; cross-barrier timing must be controlled.
- ADC aperture jitter: converts timing noise into amplitude noise near modulation/signal bands; quantify with jitter-to-noise sensitivity tests.
- Synchronous sampling: align sampling windows to the modulation reference and use coherent decimation to avoid spectral leakage.
Mechanical/electrical coupling (make “mystery noise” repeatable)
- Vibration coupling: thermopile/optical assemblies can exhibit microphonics; correlate PSD changes with controlled vibration.
- Cable triboelectric noise: cable motion can inject low-frequency bursts into EC paths; fix routing and shielding to stabilize.
Calibration, Self-Test, and Drift Management (Design for Truth Over Time)
Long-term accuracy requires engineered hooks: zero/span workflows, on-board self-test injections, drift tracking by modality, and a data model that preserves coefficients, validity flags, and evidence logs. “Working today” is not sufficient without measurable truth over time.
Zero/span workflows (one concept, three implementations)
- NDIR: use a reference channel concept to separate optical/ambient drift from absorption changes; validate with calibration residuals and repeatability.
- TCD: use ratiometric strategies so excitation drift does not corrupt calibration; track zero/span residuals over time.
- EC: baseline tracking requires overload-recovery gating; prevent baseline updates during polarization tails.
On-board self-test (stimulus injection points)
- Injected test current/voltage: verifies TIA gain, linearity, saturation and recovery without external gas changes.
- Simulated modulation: validates the lock-in chain (gain/phase/filtering) as a digital/analog loop.
- Bridge test mode: inject known imbalance or switch in reference legs to validate the differential readout and protection logic.
Drift sources (tracked, separated, and managed)
- NDIR source aging: track modulation depth and reference ratios; trigger recalibration or derate when drift rate exceeds thresholds.
- EC contamination/leakage growth: track zero baseline drift and recovery time; surface health degradation and maintenance needs.
- TCD resistor drift: track long-term zero drift and excitation stability; rely on ratiometric design plus scheduled calibration.
Data model (coefficients + tables + validity flags)
- Coefficient storage: versioned, timestamped, and scoped by temperature region; store fit error and last residual.
- Temperature tables: piecewise nodes or polynomial ranges with validity limits to avoid unsafe extrapolation.
- Validity flags: warm-up valid, self-test valid, calibration valid, overload recovery valid, sensor health degraded.
Diagnostics & Telemetry: What to Log to Debug Fast
Troubleshooting becomes deterministic when logs form an evidence chain: raw signals → computed metrics → fault flags → actionable conclusions. The goal is a minimal debug packet that can separate AFE issues, timing/lock-in errors, thermal instability, and isolation/power faults without requiring a scope on-site.
Logging layers (raw → health → quality → decision)
- Raw signals: ADC (pre-demod), demod outputs (I/Q or post-demod), temperatures, heater duty, excitation current/voltage, key rails.
- Fault flags: open/short, saturation, lock-in phase error, isolation fault, over-temp, end-of-life/health degraded.
- Quality metrics: SNR estimate, baseline stability, convergence status, calibration age/validity, recovery timers.
- Decision outputs: compact action codes (e.g., CHECK_SYNC, CHECK_LEAKAGE, CHECK_EXCITATION, RECAL_REQUIRED).
Minimum Debug Packet (field → unit → update rate → why it matters)
| Field | Unit | Rate | Why it matters (evidence chain) |
|---|---|---|---|
| timestamp, device_id, fw_ver, coeff_ver, log_schema_ver | — | per packet | Ensures traces are comparable; without versions, logs cannot explain drift across updates. |
| adc_raw_pre_demod[ch] | LSB | event window (50–500 Hz) | Reconstructs pickup/saturation and verifies whether noise is analog or introduced by DSP. |
| demod_I, demod_Q (or post-demod) | LSB or engineering | mid (10–50 Hz) | Shows lock-in gain/phase behavior; Q growth often indicates phase misalignment or coherence loss. |
| phase_err (lock-in) | deg | mid (1–10 Hz) | Direct evidence for reference integrity; correlates with residual ripple and SNR collapse. |
| temp_source, temp_det, temp_cell, temp_amb, temp_pcb_hot | °C | low (0.5–2 Hz) | Explains baseline drift and warm-up; enables gradient diagnosis vs compensation masking. |
| heater_duty (and optional PID e/u) | % | low (0.5–2 Hz) | Distinguishes thermal instability from sensor drift; validates settling time and disturbance rejection. |
| excitation_I/excitation_V (TCD/NDIR source) | mA / V | low (1–10 Hz) | Detects excitation drift or modulation depth loss; separates “sensor changed” vs “drive changed.” |
| rail_afe, rail_adc_ref, rail_iso, rail_main | V | low (1–10 Hz) | Captures brownout/UVLO events and noise coupling across islands; supports isolation diagnosis. |
| snr_est, baseline_rms, drift_slope | dB, LSB, LSB/s | low (0.5–2 Hz) | Quantifies “can trust output” and correlates with EMI/leakage/thermal disturbances. |
| converged, warmup_valid, overload_recovery | bool / s | low (0.5–2 Hz) | Prevents baseline updates during transient tails; explains “works then suddenly wrong” reports. |
| cal_age, cal_valid, last_residual | h / bool / %FS | low (0.1–1 Hz) | Links output accuracy to calibration truth; enables deterministic maintenance triggers. |
| fault_flags (open/short/sat/iso/ot/eol) | bitmask | event + low | Drives triggers for raw capture; provides immediate bucketization for fault trees. |
| action_code (CHECK_SYNC, CHECK_LEAKAGE, …) | enum | event | Turns evidence into a first-step; reduces “random debugging” time dramatically. |
Hardware hooks (example MPNs) that make the logs trustworthy
Telemetry is only as good as the measurement chain. The following example parts are commonly used to implement stable sensing, reference integrity, isolation, and event-safe logging.
| Subsystem | Example MPNs | Why it helps diagnostics |
|---|---|---|
| Precision ADC | TI ADS124S08, ADS1220; ADI AD7124-4/AD7124-8; Microchip MCP3564 | Stable raw channels (pre/post demod) with low noise; enables meaningful PSD and drift tracking. |
| Low-noise op-amp (TIA/filters) | TI OPA188, OPA189, OPA140; ADI ADA4522-2, ADA4530-1; Microchip MCP6V51 | Reduces baseline drift and offset uncertainty so logs reflect environment faults rather than amplifier artifacts. |
| Instrumentation amp (bridge) | TI INA333, INA826; ADI AD8421; Maxim MAX4208 | Improves bridge differential integrity; makes excitation/bridge imbalance telemetry interpretable. |
| Voltage reference | TI REF5025 / REF5050; ADI ADR4525 / ADR4550; Maxim MAX6070 | Reference stability ties directly to raw ADC credibility, drift/day estimates, and calibration residuals. |
| Digital isolator | TI ISO7741 / ISO7842; ADI ADuM141E / ADuM1250; Silicon Labs Si86xx | Prevents ground-loop corruption; logs can separate sensor-head issues from mainboard noise. |
| Isolated DC-DC | Murata NXJ1S; RECOM R05P05S; TI SN6505 + transformer (driver) | Sensor-island supply stability; rail telemetry becomes meaningful and repeatable across installations. |
| High-side / eFuse (rail monitor) | TI TPS25940, TPS2660; ADI LTC4368; Infineon PROFET family (varies) | Captures surge/brownout events; supports deterministic “rail fault → recovery” logging. |
| TVS / protection | Littelfuse SMBJxx, SMAJxx; Nexperia PESD series; TI TPD1E10B06 | Enables ESD event classification; reduces “mystery resets” and protects high-impedance inputs. |
| Temp sensor | TI TMP117, TMP102; ADI ADT7420; Maxim MAX31865 (RTD interface) | Accurate thermal evidence (gradient + settling time) for compensation validation and drift separation. |
| Non-volatile storage | Winbond W25Qxx (SPI NOR); Microchip 24LCxx (I²C EEPROM); Cypress/Infineon FM24Cxx (FRAM) | Stores coefficients, validity flags, and debug snapshots safely across brownouts and field events. |
| RS-485 PHY (DMX/long cable, optional) | TI SN65HVD178x; ADI ADM485; Maxim MAX13487E | Robust comm telemetry over long cables; supports isolation diagnosis and CRC/err-rate logging. |
Evidence dashboard logic (signals → metrics → fault tree)
- If baseline drifts: check temp gradients + heater duty + leakage indicators + rail noise; only then adjust compensation.
- If SNR collapses: check phase_err + coherence + ADC saturation + excitation ripple; verify with event-window raw capture.
- If “random jumps” appear: correlate with cable/ESD flags, rail dips, and isolation fault counters.
FAQs (Troubleshooting Accordion)
Each answer follows the same evidence-chain structure: 1) short answer, 2) what to measure, 3) first fix, 4) where to read (chapter link).
1 NDIR reading drifts with ambient temp—source aging or detector/TIA drift? Maps to: H2-4 / H2-7 / H2-10
- Detector/TIA baseline vs temp_det and temp_amb (slope + repeatability).
- Source drive telemetry: excitation_I/mod depth proxy (or current ripple) vs time.
- Lock the measurement to a stable thermal state (warm-up valid) and re-check drift; then update compensation tables only after stability is proven.
2 Lock-in output unstable—phase misalignment or aliasing? Maps to: H2-4 / H2-9
- phase_err (or I/Q balance) while holding modulation frequency constant.
- FFT/PSD of adc_raw_pre_demod to identify strong out-of-band aggressors near harmonics.
- Enforce coherent sampling (reference-aligned windows) and tighten the anti-alias filter before changing demod math.
3 Works on bench, fails in enclosure—thermal gradient or ground loop? Maps to: H2-7 / H2-8
- Temperature map: temp_cell, temp_pcb_hot, temp_amb and heater duty during the failure.
- Noise delta with/without external cable/shield connection (CM sensitivity test).
- Stabilize thermal zones first; if the symptom persists, re-terminate shield/earth consistently and validate isolation partitioning.
4 TCD bridge saturates at startup—excitation loop or sensor overheat? Maps to: H2-5 / H2-7
- Excitation step response: excitation_V/I and bridge differential during the first seconds.
- Local temperature rise near the bridge element (temp_pcb_hot or sensor temp if available).
- Implement a soft-start (slew-limit) on excitation and verify loop stability margins before increasing drive.
5 TCD sensitivity varies with airflow—mechanical/thermal time constant issue? Maps to: H2-5 / H2-7
- Step response of bridge output vs airflow changes (time-to-settle and overshoot).
- Temperature gradient between sensor region and ambient (temp_cell vs temp_amb) under airflow.
- Re-tune the excitation/CTA control dynamics (or sampling/averaging) to match the dominant thermal time constant.
6 EC zero baseline won’t settle—leakage, bias error, or contamination? Maps to: H2-6 / H2-9 / H2-10
- Bias node stability (reference/bias voltage) and baseline_rms / drift slope over time.
- Leakage sensitivity: baseline shift vs humidity/board state (cleaning vs contaminated).
- Audit leakage paths (input protection, contamination, guarding) before changing the baseline algorithm.
7 EC signal noisy when cable moves—triboelectric noise or shielding termination? Maps to: H2-9 / H2-8
- Event-window raw capture while moving the cable (PSD/burst signature correlation).
- Noise delta vs shield termination strategy (single-point vs both-ends, earth vs floating).
- Apply strain relief and standardize shield termination at the isolation boundary (do not “guess” per install).
8 50/60 Hz hum dominates—filtering or grounding? Maps to: H2-9 / H2-8
- Hum peak amplitude in FFT (50/60 + harmonics) under different shield/earth states.
- Notch effectiveness and residual ripple after demod/decimation (post-demod metrics).
- Fix ground/shield termination first if hum is contact-sensitive; otherwise implement notch + anti-alias without breaking coherence.
9 Calibration fails after weeks—source power drift or coefficient storage corruption? Maps to: H2-3 / H2-10
- Drive telemetry trend: excitation_I stability and modulation depth proxy vs time.
- Coefficient integrity: version, CRC/pass flags, last residual, and write events (brownout history).
- Add atomic/verified writes (CRC + version + rollback) for coefficient storage before re-tuning calibration logic.
10 Self-test passes but readings wrong—test injection point missing a failure mode? Maps to: H2-10 / H2-11
- Compare self-test response and real-signal response at the same observation point (pre/post demod where applicable).
- Check whether the test stimulus excites phase/coherence and anti-alias behavior (not just DC gain).
- Move/add injection points so the test traverses the full chain (AFE → demod → ADC → DSP), not a shortcut.
11 After isolation added, noise improved but offset changed—bias path or barrier leakage? Maps to: H2-8 / H2-6
- Bias node and reference voltages on both sides of isolation (before/after change).
- Offset shift vs humidity/temperature to detect barrier leakage or board contamination effects.
- Define a single authoritative bias/reference domain and ensure the isolated island references are explicit, not accidental.
12 Heater control oscillates—PID tuning or sensor placement? Maps to: H2-7
- Heater duty and temperature step response: overshoot, settling time, and disturbance rejection.
- Gradient evidence: delta between temp_pcb_hot, temp_cell, and ambient during oscillation.
- Move the sensor closer to the controlled mass (or add a second sensor) before tightening PID gains.