123 Main Street, New York, NY 10001

Tankless Water Heater Control Board: Sensing, Power & Safety

← Back to: Smart Home & Appliances

Core Idea • Tankless Water Heater Control Board

A tankless water heater controller is won or lost on truth and safety—prove flow and temperature with clean evidence, then modulate SCR/IGBT power while enforcing hardware-level dry-fire, over-temperature, and RCD leakage protection.

When faults happen, correlate a few signals (Vrail/reset reason, gate/Vsw edges, flow pulses, and ΔT slope) to separate real hazards from sensor glitches or EMI quickly.

H2-1 — Definition & Boundary (what this page covers)

Intent: lock scope • prevent cross-topic overlap • set the evidence chain

A tankless water heater control board regulates outlet water temperature in real time by combining flow truth (pulse/pressure plausibility) with temperature truth (inlet/outlet sensing), then modulating heater power through an SCR/triac or IGBT stage. It must also enforce a hardware-first safety chain: leakage/RCD, over-temperature, and dry-fire/no-flow interlocks.

This chapter defines the board-level boundary: sensing AFEs, power modulation, interlocks, and the minimum evidence needed to debug field failures—without expanding into boiler/furnace, HVAC system design, or HEMS/platform architecture.

Typical board blocks (from mains to water safety):

  • Mains entry → fuse/NTC/inrush → EMI filter & surge clamps
  • Power stage (SCR/triac phase control or IGBT switching) → heating element
  • Sensing → flow pulses/pressure (optional) + inlet/outlet temperature → AFE/filter → ADC/capture
  • Controller → state machine + modulation command + fault latch + event logs
  • Safety interlocks → leakage/RCD trip, over-temp cutoff, dry-fire/no-flow inhibit
In scope: evidence, thresholds, probe points, interlocks Out of scope: boiler/furnace control, HVAC zoning, HEMS/cloud

What this page delivers (actionable outputs):

  • A board-level map that separates sensing, control, and power & safety chains.
  • Clear discriminators for real overheat vs sensor glitch, and dry-fire truth vs false no-flow.
  • SCR/IGBT modulation failure signatures tied to probe points and fault logs.
  • A hardware-first safety view: what must remain safe even during MCU reset/brownout.
  • A repeatable debug entry: the first measurements and first logs to capture on-site.
Figure F1 — Tankless Control Board Boundary ICNavigator Tankless Control Board — Boundary Chain Mains In AC entry EMI/Inrush filter + clamp Power Stage SCR / IGBT Heater thermal load Sensing Flow + Temp Controller MCU + Logs Safety RCD / OTP / Dry-Fire Scope lock: Board-level evidence chain only (signals • modulation • interlocks • logs). No boiler/furnace, HVAC system, or HEMS platform expansion.
Cite this figure Figure ID: F1 · Tankless Control Board Boundary · ICNavigator
Figure F1 compresses the page boundary into a single evidence-ready chain: energy path (mains→switch→heater) plus truth sources (flow/temp) and hardware-first interlocks (RCD/OTP/dry-fire).

H2-2 — System Block Diagram: Signal Chain + Power Chain + Safety Chain

Intent: a debug-first mental map • three lanes • probe/log entry points

A tankless heater is easiest to debug when the design is viewed as three parallel lanes that interact through a small number of measurable choke points. The sensing lane defines truth signals (flow and temperatures). The control lane defines decisions and records (state machine, modulation command, fault latches, event logs). The power & safety lane defines energy delivery and independent shutdown (gate/isolation, SCR/IGBT, current sense, thermal cutoffs, leakage/RCD trip).

The diagram below is intentionally drawn to match field workflow: verify truth sources → verify command → verify switching → verify safety latch, then compare with fault logs.

Lane meanings (what each lane must prove):

  • Sensing lane: proves water is moving and temperatures are credible (detect open/short/noise; enforce plausibility).
  • Control lane: proves the firmware commanded the correct power level and recorded why it changed (states, counters, resets).
  • Power & safety lane: proves the switch actually delivered the commanded power, and safety can shut down even during MCU faults.

Top 6 probe points (minimum set that separates most failures):

  • AC input / post-EMI node: line dip or surge correlation with trips and resets.
  • MCU rail (3.3V/5V): brownout vs watchdog vs noise-induced reset.
  • Flow pulse at MCU pin: integrity, missing edges, EFT/ESD susceptibility.
  • Temp AFE node (NTC/RTD divider): noise, intermittent open/short, reference coupling.
  • Gate command + switch node: drive timing, dv/dt overshoot, abnormal commutation.
  • RCD/leakage trip output: real leakage trip vs false trigger; latch behavior verification.

Top logs to capture (first-pass evidence for remote diagnosis):

  • Fault code + latch flags (RCD / over-temp / dry-fire / over-current).
  • Inlet/outlet temperatures (raw + filtered), plus sensor plausibility status.
  • Flow estimate (pulse count / period) and “no-flow” timer counters.
  • Commanded power level (phase angle or duty) and ramp/limit events.
  • Reset reason (BOR/UVLO/WDT) and last rail snapshot if available.
  • Trip timing markers (time from enable to trip; last stable state).
Figure F2 — Three-Lane Debug Map (Sensing • Control • Power & Safety) ICNavigator Debug-First System Map (3 Lanes) Sensing lane Control lane Power & safety lane Flow pulse truth Temp AFE inlet / outlet ADC/Capture filters MCU State enable • ramp • fault Modulation phase / duty Logs/Latches codes • snapshots Gate/ISO default-off SCR/IGBT switching Heater thermal RCD/OTP trip latch First probes AC in MCU rail Flow pin Temp node Gate RCD out
Cite this figure Figure ID: F2 · Three-Lane Debug Map · ICNavigator
Figure F2 is structured for diagnosis: validate truth sources (flow/temp) → validate command and state/logs → validate switching → validate independent safety trip/latch.

H2-3 — Temperature Sensing & AFE (accuracy + placement + fault detection)

Intent: build “temperature truth” • prevent false over-temp • keep control stable

Temperature sensing in a tankless heater is not just a sensor choice—it’s a truth pipeline. The pipeline must deliver a temperature value that stays credible under switching noise, wiring stress, and thermal lag. Two targets dominate: stable outlet regulation and hardware-safe over-temperature decisions.

Truth: inlet/outlet credibility Robust: noise + wiring faults Fast: minimal lag Safe: over-temp must not be fooled

Sensor types: NTC vs PT1000 (engineering tradeoffs)

  • NTC: high sensitivity and low cost, but accuracy depends strongly on tolerance, self-heating, and how the divider is biased. Long wiring can increase noise pickup and intermittent contact faults.
  • PT1000: more linear and repeatable over temperature, but needs a controlled excitation path; wiring resistance and reference stability must be managed to keep drift predictable.
  • Isolation needs: if the sensor is exposed to harsh wiring/harness conditions, treat the interface as a noise antenna; the AFE must protect ADC/reference integrity first, then chase accuracy.

Design rule: pick the sensor based on the failure you cannot tolerate (false over-temp vs slow response vs drift), then design the AFE so the ADC sees a low-impedance, filtered, and reference-stable node.

AFE fundamentals (divider / excitation / filtering / anti-alias)

  • Divider & excitation: keep the measurement node impedance low enough that ADC sampling does not pull it around; bias so the critical operating range uses the ADC resolution efficiently.
  • RC filter: filter the switching-coupled ripple, but avoid a time constant so large that outlet control becomes “late.” A filter that is too heavy hides real slope changes.
  • Anti-alias: if switching noise or phase-control ripple has energy near the sampling band, the ADC can fold it into a fake temperature wobble. Align filtering and sampling timing to avoid aliasing.
  • Reference & grounding: the temperature node is only as good as its reference. Poor return routing can turn power-stage current into apparent temperature movement.
Too high impedance: noisy ADC Too much RC: slow response Bad timing: periodic wobble Bad return: step jumps on heating

Placement strategy (inlet / outlet / heater block)

  • Inlet temperature: explains demand changes; a cold-water step should be visible before outlet errors accumulate.
  • Outlet temperature: primary control target; this channel must be stable and resistant to switching noise because it drives modulation.
  • Heater block / heat exchanger surface: safety-centric sensing; this channel helps catch localized overheating earlier than outlet sensing, especially at low flow.
  • Thermal lag: every placement introduces delay. The correct design makes lag explicit and uses it in fault discrimination (slope and timing), rather than pretending the sensor is instantaneous.

Fault patterns & detection (what to flag, what to trust)

  • Open/short: readings saturate to rail-like extremes or become immovable; flag immediately and inhibit heating when temperature truth is lost.
  • Intermittent contact: step-like spikes often synchronized with vibration or harness movement; use glitch counters and plausibility checks.
  • Slow response / thermal lag: temperature changes appear late compared to power changes; control may hunt if the loop expects faster physics.
  • Grounding noise coupling: periodic “texture” or step jumps appear when the power stage switches; the temperature node mirrors switching activity rather than water thermal dynamics.

Evidence hook: “real over-temp” vs “sensor glitch” (minimum proof)

  • Check waveform texture: a sensor glitch from coupling often shows ripple or spikes aligned with power switching/phase changes at the ADC node.
  • Check energy consistency: real overheating correlates with sustained power input and a physically plausible slope; random instant jumps without slope continuity are suspect.
  • Check multi-point agreement: inlet/outlet/heater-block channels should move with coherent timing. A single channel jumping alone suggests interface or reference issues.
  • Log snapshot: store raw ADC + filtered value + plausibility flags at trip time; the trip must be explainable after the fact.
Figure F3 — Temperature Truth Chain Temperature Truth Chain ICNavigator Sensor NTC / PT1000 AFE divider + filter ADC sampling Plausibility open/short/noise Over-Temp Decision trip / limit Log Snapshot raw + flags Noise coupling path power switching → ADC node texture Placement hints inlet outlet heater block
Cite this figure Figure ID: F3 · Temperature Truth Chain · ICNavigator
Figure F3 focuses on credibility: sensor→AFE→ADC→plausibility→trip/log, plus a simple coupling arrow showing how switching activity can imprint “texture” on the ADC node.

H2-4 — Flow Sensing & AFE (Hall/turbine/ΔP) + “dry-fire truth”

Intent: build “flow truth” • prevent dry-fire • reject false pulses

Dry-fire protection depends on a simple principle: heater power is only allowed when flow truth is credible. A robust design treats the flow channel as an EMI-exposed digital interface and validates it with plausibility checks. For safety, flow truth should be paired with a second key: temperature rise behavior under commanded power.

Key A: flow plausible Key B: temp slope plausible Result: enable vs trip latch

Flow sensor options (and how they fail)

  • Hall turbine pulse: common and inexpensive; failures include missing pulses, bursty noise pulses, or a stuck rotor that reports near-zero frequency.
  • Reed switch: simple interface but needs careful debouncing; contact bounce can look like flow spikes.
  • Differential pressure (ΔP): provides an analog truth channel; issues include drift, clog sensitivity, and AFE offset errors that mimic low-flow.

AFE & digital capture (debounce, pullups, EMI hardening)

  • Input conditioning: ensure a defined logic level in the presence of long harnesses; avoid floating inputs that self-trigger.
  • Debouncing / edge qualification: reject microsecond-scale spikes that do not match physical rotor behavior.
  • Pulse integrity: preserve clean edges into the capture pin; excessive RC can smear edges and create missed counts at higher flow.
  • EMI hardening: treat the flow line as an antenna; clamp and route it to avoid coupling from the power stage.

Flow plausibility (minimum checks that prevent unsafe heating)

  • Minimum flow threshold: below a safe threshold, inhibit heater enable (prevents localized overheating).
  • Pulse-rate sanity: reject physically impossible jumps (e.g., instant transitions from no pulses to extreme frequency bursts).
  • Stuck detection: detect long periods of constant or missing pulse behavior inconsistent with user demand.
  • No-flow timer: when pulse absence persists beyond a short window, force heater disable and record a dry-fire risk event.

Safety rule: when flow truth is uncertain, the safe action is to inhibit heating and request a re-check, not to “guess” flow from temperature alone.

Dry-fire truth: combining flow truth + temperature rise behavior

  • Dual-key gating: heater enable requires both a plausible flow estimate and a plausible temperature rise signature under commanded power.
  • Fast-local overheat: low/no flow causes heater-adjacent temperature to climb faster than outlet dynamics can explain; use slope checks to detect this early.
  • False pulse defense: if flow pulses exist but temperature behavior contradicts expected energy transfer, treat pulses as suspect and latch a safety event.

Evidence hook: minimum “two measurements”

  • Measurement 1 — Pulse train integrity: observe the flow pin waveform for clean edges and absence of narrow spikes; correlate pulse timing with real faucet behavior.
  • Measurement 2 — Inlet/outlet temperature slope: during a controlled power step, verify temperature rise timing and slope are physically consistent with the reported flow.
Figure F4 — Dry-Fire Decision Chain (Two-Key Gating) Dry-Fire Decision Chain ICNavigator Flow Hall / reed / ΔP Condition debounce Capture edges Plausibility min / sanity / stuck Temp Slope inlet vs outlet AND two keys Enable heater power Trip/Log dry-fire risk
Cite this figure Figure ID: F4 · Dry-Fire Decision Chain · ICNavigator
Figure F4 enforces two-key safety: flow plausibility and temperature rise behavior must agree before heater enable; otherwise the trip/latch path records a dry-fire risk event.

H2-5 — Sensor Fusion & Control Targets (stable outlet temperature)

Goal: stable outlet temperature • fast flow-step recovery • no hunting

Outlet temperature stability in a tankless heater is limited by two realities: flow steps change heat demand instantly, while the system reacts with measurement delay and thermal inertia. A practical controller treats the system as a two-channel problem: flow-based feedforward absorbs the big disturbance, while outlet feedback trims the residual error.

Target: low ripple Target: fast recovery Guard: safe ramp Guard: trust-aware

Why outlet temperature oscillates (the mechanisms)

  • Flow step response: a faucet change alters the required power immediately, but the commanded power updates later (sampling, compute, power modulation update).
  • Thermal inertia: the outlet sensor reports a delayed result of heat transfer, so the controller “sees the past” and can over-correct.
  • Sensor noise vs control gain: chasing small ADC noise with aggressive gain creates frequent power toggling and audible/visible hunting.
  • Quantized power modulation: phase-angle or PWM quantization can introduce a limit cycle if the loop tries to regulate within a narrower band than the actuator can resolve.

Practical control structure: feedforward + feedback

  • Feedforward (flow → baseline power): compute a baseline power estimate from flow and inlet temperature to preempt outlet error during flow steps.
  • Feedback (outlet error → trim power): use outlet temperature error to correct model mismatch, sensor drift, and unmodeled heat loss.
  • Final command: Power_cmd = Base(flow, inlet) + Trim(outlet_error), then apply safety limits and rate control.
  • Separation of roles: feedforward handles “big and fast,” feedback handles “small and accurate.”

Design rule: if stability relies only on outlet feedback, response becomes slow or oscillatory under thermal lag. Feedforward reduces the required feedback aggressiveness.

Tuning knobs tied to symptoms (what to change first)

  • Rate limiting (dP/dt): reduces rapid command swings. Use when power commands look “saw-tooth” and outlet temperature tracks the saw-tooth.
  • Anti-hunting (deadband / integral clamp): prevents the loop from chasing noise near the setpoint. Use when the outlet crosses the target repeatedly with small amplitude.
  • Setpoint ramp: avoids cold-start overshoot. Use when start-up produces a hot spike before settling.
  • Cold-start mode: temporarily cap maximum power, increase reliance on feedforward, and relax feedback gain until the heat exchanger warms.
  • Actuator resolution awareness: widen the regulation band if phase-angle/PWM steps are too coarse to hold a tighter band without limit cycling.

Fault-aware control: when one sensor becomes untrusted

  • Untrusted flow: treat as a dry-fire risk. Inhibit heater enable, latch a fault, and require re-validation of flow truth before re-enabling.
  • Untrusted outlet temperature: reduce allowed power, rely on conservative limits and secondary safety sensing, and log the trust loss for service action.
  • Untrusted inlet temperature: degrade feedforward to a conservative baseline and let outlet feedback correct slowly with reduced gain.
  • Trust as a state: sensor plausibility flags should drive a control state machine (normal → degraded → inhibited), not a single conditional statement.

Safety bias: when the sensor that prevents unsafe heating becomes uncertain (typically flow), the safe output is heater inhibit + latch + log snapshot.

Figure F5 — Feedforward + Feedback Control Loop Stable Outlet Control Loop ICNavigator Flow truth Inlet temp Feedforward base power Ramp setpoint Σ combine Knobs rate / hunt Power Cmd limited Heater + Water thermal inertia Outlet feedback Feedback trim Trust flags
Cite this figure Figure ID: F5 · Feedforward + Feedback Control Loop · ICNavigator
Figure F5 shows a practical loop: flow+inlet feedforward sets baseline power, outlet feedback trims error, and stability knobs (rate limit, anti-hunt, ramp) shape response under thermal inertia.

H2-6 — Power Stage Topologies: SCR/Triac vs IGBT Switching

Goal: choose topology • understand stressors • recognize failure signatures

Tankless heater power stages usually fall into two classes: phase-angle control using SCR/triac for AC resistive elements, and switching control using IGBT (or similar) when higher power control dynamics or specific implementations demand it. The topology choice changes the dominant stressors (dv/dt, di/dt, surge, line dips) and the observable failure signatures.

When SCR/triac phase-angle control is used

  • Best fit: AC resistive heater elements where cost and simplicity dominate and modulation is performed by firing angle per half-cycle.
  • Observable “texture”: line current is chopped; noise and conducted emissions often track firing angle changes.
  • Common pitfalls: dv/dt false triggering, noisy zero-cross references, and temperature/flow measurement interference that appears synchronized with phase control.
Modulation: phase angle Signature: chopped current Risk: dv/dt misfire

When an IGBT switching stage appears

  • Best fit: implementations that require high-power switching dynamics, finer control granularity, or a switching power path in the heater design.
  • Observable “texture”: a switching frequency exists; dv/dt edges are sharper and can couple into sensing/capture paths if layout and reference strategy are weak.
  • Common pitfalls: gate ringing, Miller turn-on, current-sense blanking errors, and control-rail resets during high di/dt transients.
Modulation: switching Signature: PWM edges Risk: di/dt injection

Key stressors (what they cause, what they look like)

  • dv/dt: false triggering and EMI; shows up as sharp node spikes and sensing “texture” aligned with switching or firing changes.
  • di/dt: ground bounce and reference shifts; shows up as sudden ADC shifts, MCU brownout/reset, or capture-pin glitches during switching transitions.
  • Surge current: cold element resistance and inrush create peak stress; shows up as early-life failures, fuse/NTC heating, or intermittent trips at turn-on.
  • Line dip: control rail droops while power devices may still be energized; requires default-off gating and safe restart sequencing to prevent unsafe states.

Design rule: every topology must define “default OFF” behavior across line dips. A safe system assumes the controller can reset at the worst moment.

Thermal design ties (heatsink, junction temperature, derating)

  • Junction temperature determines reliability and drift; rising temperature changes device behavior and can turn borderline EMI/false trip issues into repeatable faults.
  • Heatsink and airflow are part of the electrical design boundary: thermal resistance sets the maximum continuous power without triggering protective cutbacks.
  • Derating strategy: reduce allowed power as temperature climbs; log thermal limit events to distinguish “real thermal” from sensor glitches.
  • Failure signature: problems that appear only after warm-up often indicate thermal drift, insufficient heatsinking, or mounting/interface degradation.

Topology decision criteria + failure signatures (what to decide, what to watch)

  • Choose SCR/triac when AC resistive control is sufficient and the design can tolerate phase-angle EMI signatures; watch for dv/dt misfire, noisy zero-cross, and phase-synchronous sensing artifacts.
  • Choose IGBT switching when switching control is required for the implementation; watch for gate ringing, di/dt injection into control rails, and switching-synchronous capture/ADC corruption.
  • Field signature mapping: “EMI at certain power levels,” “random reset under load,” and “false dry-fire/over-temp trips” often align with topology-specific stressors.
Figure F6 — Topology Compare: SCR/Triac vs IGBT Power Topology Stress Map ICNavigator SCR / Triac phase-angle AC mains Phase control Heater resistive IGBT switching AC/DC entry Switch cell Heater load Stressors dv/dt di/dt surge line dip Failure signatures: EMI reset false trip
Cite this figure Figure ID: F6 · Topology Stress Map · ICNavigator
Figure F6 contrasts phase-angle and switching approaches and highlights stressors that commonly drive EMI, resets, and false protection trips in tankless heater controllers.

H2-7 — Gate Drive, Isolation, and Protection Around the Switch

Goal: robust switching • fast fault shutdown • avoid nuisance trips

The switch “survival zone” is the ring around the power device: gate drive and isolation set switching behavior, snubbers and surge parts control dv/dt stress, and current sensing determines whether protection trips are real or nuisance. A robust design treats fast shutdown as a hardware path and uses measurements (gate, switch node, current sense) to validate that a trip was justified.

Gate drive essentials (especially for IGBT switching)

  • Isolation strategy: opto / isolated gate driver / digital isolator must tolerate fast common-mode transitions and keep the driver reference stable.
  • Gate resistors: separate turn-on vs turn-off resistance to balance EMI/ringing against switching loss and fault turn-off speed.
  • Miller control: Miller clamp or negative gate bias prevents dv/dt-induced unintended turn-on during high-speed transitions.
  • UVLO default-off: under-voltage lockout must force a safe OFF state if the drive rail sags during line dips or transients.
Knob: Rg(on) Knob: Rg(off) Guard: UVLO Guard: Miller clamp

Snubber and surge coordination (RC/RCD + MOV/TVS)

  • Snubber goal: absorb leakage inductance energy and reduce switch-node overshoot and dv/dt.
  • MOV/TVS goal: clamp larger surge events and protect against line transients beyond the snubber’s intended energy range.
  • Coordination rule: snubber handles repetitive high-frequency edges; MOV handles infrequent high-energy surges—avoid overlapping stress that overheats either part.
  • Placement: snubber loop must be tight to the switch node and return; long loops increase radiated noise and can worsen nuisance trips.

Common trap: “Adding a snubber” without controlling its loop area can reduce overshoot on paper but increase coupling into sensing and capture paths.

Current sensing and fast shutdown

  • Sensor options: shunt (fast, simple), CT (isolated, very fast but can saturate), Hall (isolated, slower but robust for DC-like components).
  • Fast shutdown path: overcurrent comparator or desaturation detection should directly command gate-off (or inhibit trigger) before software reacts.
  • Timing discipline: use blanking windows only where switching artifacts dominate; do not extend blanking so far that real faults become unprotected.
  • Latch + log: latch a trip in hardware, then capture a short log snapshot (power level, trip reason, sensor trust flags) after shutdown.

Preferred behavior: shut down first, explain later. A fast hardware path prevents damage even when firmware is busy or unstable.

Nuisance trips vs real faults (how to discriminate)

  • Nuisance trip patterns: trips align with switch-node spikes or dv/dt edges while load current does not sustain a rise.
  • Real fault patterns: current rises and persists, desat/OC evidence remains asserted, and the device shows an abnormal Vce/Vsw behavior under load.
  • Root causes of nuisance: comparator reference shifts due to ground bounce, CT recovery artifacts, or poor separation of sense and power returns.
  • Root causes of real faults: shorted load, device degradation, snubber failure, or surge damage that reduces margin until a predictable trip occurs.

Evidence hooks: the “three-waveform minimum”

  • Gate waveform (Vge): check ringing, unintended re-turn-on, and turn-off speed during trips.
  • Switch node (Vsw / Vce): check overshoot, dv/dt edges, and whether spikes line up with the trip instant.
  • Current sense timing: check whether OC/desat asserts on a brief artifact or on sustained overcurrent.

Fast verdict: if trips coincide with Vsw spikes but current does not persist → likely nuisance. If current persists and protection asserts cleanly → likely real fault.

Figure F7 — Switch Protection Ring Switch Protection Ring ICNavigator Switch IGBT / Triac gate loop power loop sense loop Gate Drive ISO + UVLO Miller Rg Snubber RC / RCD RC MOV Current Sense OC / DESAT shunt CT Hall fast OFF Evidence minimum 3 Vge Vsw Isense
Cite this figure Figure ID: F7 · Switch Protection Ring · ICNavigator
Figure F7 groups the “survival ring” around the switch: gate drive isolation and Miller control, snubber/MOV coordination, current-sense to fast shutdown, and loop discipline. Evidence targets are Vge, Vsw, and Isense timing.

H2-8 — RCD / Leakage Safety Chain + Over-Temp Hardware Interlocks

Goal: one-fault safe • hardware-first safety • clear trip paths

This chapter maps the safety chain that must remain effective even if firmware becomes unresponsive. Leakage protection (RCD/GFCI) relies on differential current detection, while over-temperature protection requires at least one independent hardware cutoff. A robust tankless controller treats these as hardware interlocks that can inhibit heating and latch faults without requiring MCU intervention.

Leakage current paths (where the differential current comes from)

  • Heater to water: insulation aging or contamination can create a path from the heating element into the water volume.
  • Moisture ingress: condensation, splash, or seepage creates surface leakage across insulating materials.
  • Creepage paths: insufficient creepage distance or polluted surfaces allow small leakage currents that can grow with humidity and temperature.
  • Harness and reference issues: poor returns and coupling can inject interference into sensing, creating apparent leakage events if the detection front-end is not hardened.

RCD/GFCI concept (differential current detection)

  • Normal: outgoing and return currents are equal; their difference is near zero.
  • Leakage: part of the current returns through an unintended path (water/moisture/ground), so the difference becomes non-zero.
  • Implication: the safety chain focuses on the difference, not the absolute load current.

Design implication: switching dv/dt and di/dt can couple into the detection front-end. The detection path must be hardened so “noise difference” does not look like true leakage.

Practical implementation blocks (control-board viewpoint)

  • Sensing coil / CT: measures differential current component.
  • Amplifier + filtering: raises and shapes the signal for robust decision-making under noise.
  • Comparator + latch: hardware threshold + latch to ensure the trip is retained and cannot be cleared by transient firmware behavior.
  • Trip interface: drives a cutoff path (relay/contactor inhibit, gate disable) and forces a safe state.

Hardware over-temperature interlocks (independent cutoff)

  • Thermal fuse: one-time cutoff for extreme conditions; ensures permanent safe failure in severe overheating.
  • Bimetal thermostat: independent, resettable cutoff; provides a hardware limit without MCU dependence.
  • Independent path: the cutoff must disable heating even if the MCU output is stuck high or the firmware loop is stalled.

Non-negotiable: software-only over-temp limits do not satisfy one-fault safety. At least one independent hardware cutoff is required.

One-fault safe thinking: what still protects the system if MCU locks up

  • Leakage trip is hardware-latched: comparator+latch forces heater inhibit regardless of firmware state.
  • Driver default-off: UVLO and gate disable paths ensure loss of control power results in OFF, not ON.
  • Hardware over-temp cutoff: thermal fuse/bimetal opens the power path independently.
  • Trip logging is secondary: record after shutdown, never before.

Must-be-hardware list: leakage differential detection + latch, independent over-temp cutoff, and default-off gating during undervoltage.

Figure F8 — RCD/Leakage + Over-Temp Hardware Safety Chain Safety Chain Map (One-Fault Safe) ICNavigator Leakage paths H→W . Creep CT coil AMP filter COMP latch TRIP Trip I/F heater inhibit Over-Temp Hardware independent cutoff Fuse Bimetal MCU lockup hardware still trips default OFF
Cite this figure Figure ID: F8 · Safety Chain Map · ICNavigator
Figure F8 maps leakage detection (CT → amplifier/filter → comparator+latch → trip interface) and independent over-temp cutoffs (fuse/bimetal). The chain remains protective even if the MCU becomes unresponsive.

H2-9 — Power Supply & Brownout Behavior (why it resets or mis-triggers)

Goal: keep control safe during line dips • default-off power stage

Brownout failures are rarely “just a reset.” The most damaging pattern is a line dip that collapses the control rails while the power stage is still capable of conducting. A robust tankless controller designs the auxiliary PSU for safe shutdown time, enforces default-off gating, and latches fault states so recovery cannot re-enable heating without explicit safe conditions.

Auxiliary PSU for control: isolation choice, UVLO, and hold-up

  • Isolation choice: isolated auxiliary rails reduce unintended ground paths and limit noise injection into ADC reference and digital thresholds.
  • UVLO discipline: define “valid control” rails; below UVLO, drive must be disabled and the power stage must be inhibited.
  • Hold-up target: size hold-up so rails remain valid long enough to complete gate disable and trip latch actions during line dips.
Constraint: UVLO Constraint: hold-up Constraint: default-off

Engineering objective: “hold-up time” is measured against safe shutdown actions, not against keeping the UI alive.

Brownout patterns: why line dips cause resets and mis-triggers

  • Short, shallow dips: rails stay near UVLO but comparators/ADC references drift, producing false trips or sensor “glitches.”
  • Deep dips: MCU resets while the power device may still be biased or partially enabled, creating a half-on risk.
  • Recovery bounce: rails oscillate around UVLO and repeatedly reset logic, causing repeated enable/disable edges unless latched safe states are enforced.

Critical risk: MCU reset is not equal to power stage OFF. The design must explicitly force OFF on undervoltage.

Sequencing: safe enable, default-off gates, watchdog + latch behavior

  • Safe enable: allow heating only after control rails are above UVLO, watchdog is healthy, and safety interlocks are clear.
  • Default-off gating: gate enable (or triac inhibit) must be pulled to OFF by hardware if rails fall, watchdog fails, or reset is imminent.
  • Latch behavior: after a brownout-related shutdown, keep heating inhibited until explicit recovery conditions are met (sensor trust, stable rails, no latched safety trips).
  • Reset reason logging: record the reset source (brownout/watchdog/external) so repeated field incidents can be separated from real load faults.

Preferred pattern: inhibit first, then log. Latch prevents “auto-reheat” loops after unstable mains events.

Noise paths during brownout: switching noise into ADC ground / reference

  • Ground bounce: high di/dt return currents shift local ground potential, moving comparator thresholds and ADC readings.
  • Reference contamination: ADC reference and analog ground must not share return paths with switch currents or snubber currents.
  • Sampling sensitivity: near-UVLO operation increases sensitivity; thresholds narrow and logic edges become ambiguous under noise.

Practical mitigation: separate power return, sense return, and reference return; treat ADC ref as a “quiet island” with controlled connection points.

Evidence hooks: the “minimum three” for brownout diagnosis

  • Rail droop capture: capture the control rail minimum and duration at the moment of reset/trip.
  • Reset reason logs: confirm whether the event is brownout, watchdog, or external reset.
  • Gate disable timing: measure time from rail droop start to gate disable/inhibit assertion to verify no half-on window exists.

Fast verdict: droop + brownout reset + delayed gate disable implies a dangerous sequencing gap that must be closed with hardware default-off gating.

Figure F9 — Brownout Safety Timing Map Brownout Safety Timing Map ICNavigator Aux PSU control rails UVLO Hold-up MCU reset reason Watchdog Default-OFF gate disable Gate OFF Power Stage SCR / IGBT Trip Latch inhibit persists across reset Latch ON Evidence timing Vrail Gate_EN Latch dip rec
Cite this figure Figure ID: F9 · Brownout Safety Timing Map · ICNavigator
Figure F9 ties brownout behavior to safety timing: control rail droop must trigger hardware gate disable and a latched inhibit path, while reset reason logging classifies the event for field analysis.

H2-10 — EMC/Surge/EFT/ESD Ruggedization (only what matters for this board)

Goal: immunity tied to symptoms • clear coupling paths • checkable layout rules

Ruggedization is not a generic checklist. For a tankless control board, the most costly field failures present as false trips, flow pulse loss, temperature ADC jumps, or random resets. This chapter maps the coupling paths that cause those symptoms and provides a layout checklist that can be verified by inspection.

Mains surge protection coordination (board-level chain)

  • MOV/TVS: clamp high-energy surges at the entry and limit stress seen by downstream rectifiers and auxiliary PSU.
  • Fuse: isolates sustained faults and prevents protective parts from overheating during abnormal events.
  • NTC / inrush: reduces turn-on surge that can otherwise create brownout-like behavior in the control domain.
  • Common-mode choke: reduces common-mode noise flow across the mains interface and improves immunity and emissions margin.

Coordination principle: no single part “wins.” Each part handles a different energy/time domain, and placement/return paths determine effectiveness.

EFT/ESD coupling into flow pulse and temp ADC lines

  • Flow pulse line: fast transients can appear as false edges or can mask real pulses, causing minimum-flow logic to trigger dry-fire protection.
  • Temp ADC line: injected charge or reference shifts can create a sudden temperature spike or drop, leading to false over-temp decisions.
  • Where coupling happens: long harnesses, high-impedance nodes, and inputs that share return paths with switching currents.

Field symptom mapping: “false dry-fire” often starts as pulse integrity failure, while “false over-temp” often starts as ADC reference/ground disturbance.

Layout rules: the minimum that matters

  • Minimize high di/dt loops: keep switch currents and snubber currents in tight local loops away from sensing and control returns.
  • Star return discipline: separate power return, sense return, and reference return; define a controlled single-point connection.
  • Isolation barrier clarity: maintain a clean isolation boundary and keep noisy copper away from the barrier edge.
  • External line entry protection: place protection parts near the connector and keep their return path short and well-defined.
  • ADC reference island: treat ADC ref/AGND as a quiet island; avoid routing switching edges or high current returns through it.
Check: di/dt loop Check: star return Check: barrier Check: connector I/O Check: ADC ref

What to validate: immunity tests tied to real symptoms

  • Surge: confirm the control rail droop stays within safe limits and that gate disable asserts immediately under undervoltage.
  • EFT: monitor flow pulse capture error rates (false edges, missing edges) and verify dry-fire logic does not false-trip.
  • ESD: monitor for resets, stuck inputs, and sudden ADC jumps; verify recovery does not auto-enable heating without safe conditions.
  • Combined stress: repeat tests at worst-case temperature/humidity where leakage paths and creepage sensitivity increase.

EMC symptom → likely coupling path → first fix

  • False trip during switching edges → ground bounce into comparator/ADC thresholds → separate returns; shorten sense loop; add controlled filtering at the decision point.
  • Flow pulse loss / extra pulses under EFT → signal-line injection and edge integrity failure → harden input conditioning, improve return, place protection at connector.
  • Temp ADC sudden spike → ADC reference island disturbed by switching current or ESD injection → protect ADC input, isolate reference routing, schedule sampling away from edges.
  • Random reset under surge → power injection into aux PSU / insufficient hold-up → improve entry coordination, increase hold-up to complete safe shutdown, enforce default-off gating.

Deliverable: treat this list as a field triage: symptom first, then coupling path, then the shortest correction with the highest probability of impact.

Figure F10 — EMC Coupling Map (Board-Only) EMC Coupling Map (Tankless Control Board) ICNavigator Mains entry MOV Fuse NTC CM choke Control domain Control PSU MCU ADC ref quiet island Power stage di/dt source Inputs Flow Temp power ground signal Symptoms (what shows up in the field) false trip pulse loss ADC jump random reset
Cite this figure Figure ID: F10 · EMC Coupling Map · ICNavigator
Figure F10 connects board-level protection and layout to field symptoms. Power injection, ground bounce, and signal-line hits explain why surge/EFT/ESD can show up as false trips, pulse capture errors, ADC jumps, or resets.

H2-11 — Validation & Field Debug Playbook (SOP)

Format: Symptom → first 2 measurements → discriminator → first fix

This SOP compresses diagnosis into repeatable steps that match how tankless controllers fail in the field: temperature hunting, false safety trips, nuisance RCD trips, and brownout resets. Each symptom bucket starts with two measurements that can be captured with minimal tools, then uses a discriminator to separate root-cause families, and ends with the highest-probability first fix.

Minimal-tool approach: DMM + 2-ch scope (or logic analyzer) + reset/fault logs. Optional: clamp meter for leakage/line current trend.

MPN examples Concrete parts to anchor “first fix” actions (examples, not mandatory)
  • Supervisors / UVLO reset: TPS3839 TPS3890 MCP1316 STM811
  • Watchdog timers: TPS3430 TPS3431 MCP131 MAX6369
  • Hardware latch / “default-OFF” gating: SN74LVC1G74 SN74HC279 74LVC1G00
  • Schmitt input hardening (flow pulses): SN74LVC1G17 74HC14  • ESD at connector: TPD1E10B06 TPD2E2U06
  • Comparators / fast fault detect: TLV3201 LMV331 TLV1701
  • Current-sense amplifiers (noise-robust): INA240 INA181
  • Voltage reference / analog stability: REF3330 LM4040
  • IGBT isolated gate drivers: UCC21520 Si8233 ACPL-332J
  • Triac/SCR opto drivers (mains resistive heater control): MOC3063 MOC3052 VO3062
  • RCD/GFCI controller IC (legacy option, availability varies): RV4145A
  • Entry protection examples: MOV: B72214 (EPCOS/TDK series example) • TVS: SMBJ series • Common-mode choke: 744821 (Würth series example)

Note: Part numbers above are examples to make the SOP actionable. Final selection depends on mains voltage, isolation class, thermal, and compliance constraints.

Symptom

Outlet temperature hunts / oscillates (visible “up-down” cycles)

Goal: separate control hunting vs sensor lag vs flow disturbance.

First 2 measurements

  • M1: outlet temperature slope and oscillation period from logs (ΔT/Δt and cycle time).
  • M2: commanded power vs time (phase angle or PWM duty) and whether it “saws” with the same period.

Discriminator

  • If the power command oscillates strongly and leads the temperature swing → control hunting (gain too high, no rate limiting, noisy feedback).
  • If power command is smooth but temperature oscillates → sensor lag / placement or flow steps dominating the thermal inertia.

First fix (highest hit-rate first)

  • Add rate limiting and deadband before changing sensors; verify stability improves without sacrificing warm-up time.
  • Align sampling away from switching edges; stabilize ADC reference island (REF3330, LM4040).
  • If still unstable, validate sensor dynamics and placement; reduce noise pickup into ADC ground/REF.
Symptom

Trips on over-temperature immediately at start

Goal: distinguish real thermal event vs ADC glitch / open-short fault.

First 2 measurements

  • M1: does measured temperature rise physically (heater block / outlet) or “jump” instantly?
  • M2: capture ADC input or reference behavior during the first power edges (look for synchronous spikes/steps).

Discriminator

  • Instant step without real heating → sensor/ADC artifact (reference shift, ground bounce, input injection, open/short).
  • Fast monotonic rise with confirmed power conduction → real overheating (insufficient flow, excessive initial power, thermal path issue).

First fix

  • For “glitch” cases: harden analog thresholds with a clean reference (REF3330) and comparator filtering (TLV3201); improve AGND/REF return.
  • For “real” cases: limit cold-start power ramp and require flow-verified enable; verify hardware cutoff interlock is independent.
Symptom

Dry-fire false trip / or heats with “no flow detected”

Goal: establish “flow truth” and enforce fail-safe inhibit.

First 2 measurements

  • M1: flow sensor pulse train integrity at the controller pin (amplitude, noise, missing/extra edges).
  • M2: inlet/outlet temperature slope after power enable (does ΔT rise consistent with heating and flow?).

Discriminator

  • Pulses exist but counter shows zero / random → input conditioning/capture failure (EMI, weak pull-up, no Schmitt, ESD hits).
  • Pulses are truly absent yet heater conducts → safety chain defect (flow-bad does not force default-OFF).
  • Pulses are present but dry-fire trips anyway → plausibility thresholds too aggressive or filtered pulses are delayed.

First fix

  • Harden flow input: Schmitt trigger (SN74LVC1G17) + ESD at connector (TPD1E10B06) + defined pull-up and short return path.
  • Enforce flow-bad → inhibit in hardware: latch + gate inhibit (SN74HC279, SN74LVC1G74).
  • Validate plausibility logic with a minimum-flow window and temperature-rise sanity check (two-sensor truth).

Safety note: “heater ON while flow unknown” must be prevented by default-off gating, not only by firmware.

Symptom

RCD nuisance trips during heating ramp

Goal: separate true leakage growth vs dv/dt/EMI injection into the RCD sense chain.

First 2 measurements

  • M1: RCD sense output / comparator output vs time during ramp (does it trigger on specific switching edges?).
  • M2: correlate trip timing with switch node dv/dt or phase-angle edge (capture Vsw or gate timing).

Discriminator

  • Trips are edge-synchronous and repeatable → nuisance coupling (dv/dt pickup, ground reference movement, poor filtering).
  • Trips correlate with humidity/temperature/time-in-service, not with edges → real leakage path (heater-to-water, moisture ingress, creepage contamination).

First fix

  • For nuisance coupling: tighten ground/reference control for the RCD amplifier/comparator (TLV3201, LMV331), add controlled filtering, and separate returns from power di/dt loops.
  • For true leakage: inspect creepage/clearance on the board, contamination tracks, and connector sealing; validate with controlled humidity test.
  • Legacy GFCI controller option (design-specific): RV4145A (availability varies).
Symptom

Random reset / display flicker under load

Goal: prove brownout vs EMI reset, and confirm gate disable timing is safe.

First 2 measurements

  • M1: capture control rail droop (min voltage + duration) at the reset moment.
  • M2: read reset reason (brownout/watchdog/external) and capture gate disable/inhibit timing.

Discriminator

  • Droop + brownout reset → hold-up/UVLO/sequencing problem.
  • No significant droop but reset occurs → EMI injection into reset/clock/watchdog or ground bounce.

First fix

  • Brownout class: supervisor + clean UVLO gating (TPS3839, MCP1316) + watchdog (TPS3430) + default-OFF latch (SN74LVC1G74).
  • EMI class: improve entry chain coordination (MOV/TVS/CM choke), fix reference island (REF3330), and harden critical lines with ESD parts (TPD2E2U06).

Minimum evidence pack: rail droop waveform + reset reason + gate OFF timing (no “half-on window”).

Symptom

Works on bench, fails in field (EMC / moisture / grounding)

Goal: compress environment variables into a reproducible triage path.

First 2 measurements

  • M1: capture a “fault snapshot”: Vrail + fault flag + flow count + temperature slope at the moment of failure.
  • M2: reproduce one variable at a time: longer harness, controlled humidity, or controlled EFT/ESD stress; log which variable re-triggers the symptom.

Discriminator

  • Humidity-driven failures → leakage/creepage and RCD path sensitivity.
  • Harness/EMC-driven failures → signal-line injection (flow/temp) and reference/ground coupling.
  • Load/ramp-driven failures → inrush and brownout sequencing.

First fix

  • Start at connector entry: ESD/TVS placement, return path control, and input conditioning (SN74LVC1G17, TPD1E10B06).
  • Then stabilize control rails and default-OFF gating (TPS3839, TPS3430, SN74HC279).
  • Finally tune control parameters only after the hardware evidence chain is clean.
Deliverable: compact decision tree + minimal-probe checklist

The decision tree below prioritizes safety first (flow/RCD/over-temp), then separates brownout vs EMC vs sensor-truth issues using three measurements: Vrail droop + reset reason, gate/Vsw timing, and flow pulses + temperature slope.

Probe: Vrail Probe: Gate_EN / Vsw Probe: Flow pulses Probe: Temp slope Log: reset reason Log: fault flags
Figure F11 — Field Debug Decision Tree (Minimal Tools) Field Debug Decision Tree (Minimal Tools) ICNavigator Start from symptom temp hunting over-temp at start dry-fire / flow RCD nuisance reset First 2 measurements (pick the lane) Lane A: Vrail + Reset droop + reason flag Lane B: Gate / Vsw edge timing & dv/dt Lane C: Flow + Temp pulse integrity + slope Discriminator (1 question that splits root cause) brownout? droop + BOR reset edge-synchronous trip? dv/dt coupling sensor truth? pulse + ΔT sanity First fix (hardware-first, then tuning) UVLO + WDT + latch input hardening ground / ref island tuning TPS3839 • TPS3430 • SN74HC279 SN74LVC1G17 • TPD1E10B06 REF3330 • INA240
Cite this figure Figure ID: F11 · Field Debug Decision Tree · Tankless Control Board · ICNavigator
Figure F11 is a compact field triage map: pick the symptom, capture two measurements, apply one discriminator question, then apply the first fix with concrete parts (supervisor/watchdog/latch, input Schmitt+ESD, reference island & current-sense).

Request a Quote

Accepted Formats

pdf, csv, xls, xlsx, zip

Attachment

Drag & drop files here or use the button below.

H2-12 — FAQs ×12 (Evidence-locked)

Collapsible Q&A • Each answer: 2 signals/logs → discriminator → first fix

These FAQs are scoped to the tankless control board only: flow truth, temperature truth, switching evidence (gate/Vsw), RCD/leakage chain, over-temp & dry-fire protection, brownout/reset evidence, and EMC coupling paths. Answers include concrete probes and example MPNs.

Safety: When “flow is unknown” or “over-temp is plausible,” the correct first action is default-OFF gating and latch behavior—never rely on firmware alone.

Q1Has flow, but still dry-fire trips. Which two signals prove sensor vs wiring vs EMI?
Capture (1) the flow pulse train at the MCU pin and (2) the ΔT slope (inlet→outlet) during commanded heating. If pulses at the pin are clean and counted but ΔT rises abnormally fast, the sensor is not the issue—dry-fire logic or power gating is. If pulses show missing edges/bursts during switching edges, EMI or wiring/return path is the likely cause. First fix: add a Schmitt conditioner SN74LVC1G17 plus connector ESD TPD1E10B06, and verify pulse integrity under full-power edges.
Q2Heats even when flow reads 0. How to prove “fail-safe inhibit missing” vs “pulse capture broken”?
Capture (1) Gate_EN (or triac inhibit) timing and (2) the raw flow input waveform before any firmware filtering. If Gate_EN stays active while raw flow is absent/invalid, the hardware chain is not default-OFF. If raw flow exists but is not recognized, pulse conditioning/capture is broken. First fix: enforce a hardware inhibit latch so “flow-bad → heater OFF” is guaranteed (SN74HC279 or SN74LVC1G74) and then harden the pulse input with a Schmitt stage.
Q3Trips over-temp 5–10s after start. How to separate real overheating vs NTC/ADC glitch?
Capture (1) heater-block temperature (or the closest thermal sensor) and (2) ADC reference/AGND movement at the same moment. Real overheating shows a monotonic rise consistent with power conduction. A glitch shows a step/spike synchronized to switching edges while the heater block does not track physically. First fix: stabilize the analog reference island (REF3330), add comparator filtering for fast trips (TLV3201), and align sampling away from gate edges before changing thermal thresholds.
Q4Outlet temperature jumps instantly when switching starts. What proves reference/ground bounce vs real temp change?
Capture (1) ADC input node (NTC divider midpoint) and (2) ADC reference (Vref) or AGND-to-DGND delta across a switching edge. If the divider node moves with Vref/ground bounce and not with a plausible thermal time constant, it is measurement corruption. First fix: isolate the analog return, give the divider a controlled RC, and use a clean reference (LM4040 or REF3330). Then verify the “jump” disappears without altering the actual heating power.
Q5Temperature oscillates when faucet flow changes. Feedforward first or filtering first?
Use two captures: (1) flow change (pulse frequency) vs time and (2) power command vs time. If power command lags flow steps and then over-corrects, the priority is flow-based feedforward and a controlled ramp. If power command is stable but outlet temperature is noisy, then modest filtering and sampling alignment matter more. First fix: implement a bounded feedforward map plus rate limit; then add minimal filtering only to remove edge-synchronous noise.
Q6Stable at low power, hunts at high power. Which evidence splits control hunting vs switching-coupled sensing?
Capture (1) power command waveform and (2) temperature/flow raw readings synchronized to switching edges. If raw readings show edge-synchronous spikes at high power, the loop is chasing measurement artifacts—this is not “tuning-only.” If raw readings stay clean but command oscillates, it is genuine control hunting. First fix: for sensing corruption, fix reference/returns and add a clean current-sense path (INA240) plus sampling alignment; for hunting, lower aggressiveness and add deadband/rate limit.
Q7RCD or over-temp triggers exactly on switching edges. How to prove dv/dt coupling into the sense chain?
Correlate (1) trip comparator output with (2) Vsw dv/dt (or gate edge timing). If the trip is edge-synchronous and repeatable at a specific dv/dt magnitude, it is coupling/ground reference movement rather than true leakage/overheat. First fix: reduce edge energy (snubber/layout), and harden the trip front-end with proper filtering and reference control. For IGBT drive, verify clean gate control using an isolated driver such as UCC21520 or Si8233.
Q8False fault flags appear during ramp. What proves gate/Vsw ringing vs comparator threshold shift?
Capture (1) Vsw ringing waveform and (2) fault comparator input vs threshold around the event. If faults occur only when Vsw rings beyond a repeatable peak, ringing is the trigger. If comparator input shifts with Vref/ground movement, it is threshold corruption. First fix: add/retune snubber and minimize loop inductance; or stabilize Vref and use a fast, well-behaved comparator front-end (TLV3201, LMV331) with a clean return.
Q9RCD trips only at high power. What are common leakage paths and how to isolate safely with evidence?
Use (1) RCD differential sense output amplitude and (2) temperature/humidity condition as the discriminator. True leakage usually grows with moisture, contamination, or heater insulation degradation (heater-to-water path). Nuisance trips often correlate tightly with dv/dt edges. First fix: if moisture-correlated, inspect creepage/contamination and sealing. If edge-correlated, harden the sense chain and reduce dv/dt. Where a dedicated controller is used, a legacy option is RV4145A (availability varies by region).
Q10RCD trips more on humid days. How to prove moisture/creepage vs control-board nuisance coupling?
Compare (1) trip probability vs humidity/condensation and (2) edge-synchrony to Vsw. Moisture/creepage issues track humidity and may persist independent of exact switching edge timing. Nuisance coupling tracks dv/dt edges and is highly repeatable at specific operating points. First fix: for moisture—clean/coat/restore creepage integrity and connector sealing; for nuisance—separate returns, filter the sense chain, and control dv/dt via layout/snubber.
Q11Random reset under load. Rail droop or dv/dt injection—what two measurements decide fast?
Capture (1) control rail droop minimum + duration and (2) reset reason flag (brownout vs watchdog vs external). Droop plus brownout indicates hold-up/UVLO/sequencing weakness; minimal droop with resets suggests dv/dt/EMI injection into reset/clock. First fix: for brownout, add a supervisor TPS3839 (or MCP1316) plus watchdog TPS3430 and a default-OFF latch. For EMI, improve reference/return integrity and reduce edge energy at the power stage.
Q12Works on bench, fails in field. What minimal “fault snapshot” separates harness/EMC vs moisture vs line dip?
Record a snapshot containing (1) Vrail + reset reason, (2) fault flags + trip source, and (3) flow count + ΔT slope at the moment of failure. If failures correlate with harness length/route and edge timing, EMC injection is likely; if correlated with humidity, leakage/creepage dominates; if correlated with heavy loads or mains dips, brownout/sequencing dominates. First fix: harden connector entry (ESD + Schmitt), enforce default-OFF gating, then address layout return paths.

Fast triage rule: If a fault is edge-synchronous, suspect dv/dt coupling. If it is humidity-correlated, suspect leakage/creepage. If it is droop + BOR, suspect brownout/hold-up.

Figure F12 • FAQ Evidence Map (what to capture first)

A compact visual map that forces every FAQ back onto evidence: Vrail/reset, gate/Vsw edges, flow pulses, temperature slope, and the RCD/leakage chain.

Figure F12 — FAQ Evidence Map (Tankless Control Board) FAQ Evidence Map (Capture These First) Vrail + Reset reason droop, BOR/WDT flags safe gate-OFF timing Gate_EN + Vsw dv/dt edge-synchronous faults ringing & snubber clues Flow pulse integrity clean edges, no bursts count matches reality Temperature truth (ΔT slope) real thermal time constant vs glitches inlet/outlet + heater-block sensor RCD / Leakage chain evidence CT output + comparator trip timing edge-synchronous vs humidity-correlated Symptom groups these blocks resolve random reset / flicker edge-synchronous trips dry-fire / flow disputes over-temp at start temperature hunting RCD nuisance / leakage ICNavigator
Cite this figure Figure ID: F12 · FAQ Evidence Map · Tankless Control Board · ICNavigator
Figure F12 ties every FAQ back to evidence that can be captured quickly: Vrail/reset, gate/Vsw edges, flow pulses, ΔT slope, and RCD chain timing.