Liquid Cooling Manifold & Pump Control
← Back to: Data Center & Servers
A rack/server liquid-cooling manifold & pump controller keeps coolant circulation stable by combining a BLDC drive, clean Flow/ΔP/Temp sensing, and a fault-aware state machine. It prevents dry-run/leak events and maintains continuity through redundant power-domain switchover, while recording the minimum telemetry and fault logs needed for fast root-cause analysis.
H2-1 | Page Boundary & System Role: What Manifold & Pump Control Owns
A liquid-cooling manifold and pump control module sits inside the server/rack coolant loop and owns the execution layer: driving the BLDC pump, conditioning flow/ΔP/temperature inputs, enforcing safety interlocks (including leak response), managing redundant power domains for continuity, and emitting actionable telemetry plus time-ordered fault events for diagnosis.
What this page covers
- Pump drive + protection: BLDC/PMSM drive chain, startup/priming ownership, derating and safe shutdown triggers.
- Signal front-end contract: where flow, differential pressure (ΔP), and coolant temperatures enter, and what “control-grade” means.
- Safety enforcement: leak and interlock latching, graded response (warn → derate → shutdown) and anti-oscillation rules.
- Redundant power domains: A/B feed, OR-ing behavior, continuity of control rail vs power rail, and switchover evidence in logs.
- Diagnosability: minimal telemetry fields and event codes that make field failures reproducible.
Explicit non-goals (to avoid cross-page overlap)
- No facility CDU / chiller plant design: only the local loop and its execution controls are in scope.
- No BMC protocol deep dive: transport (IPMI/Redfish details) is out-of-scope; only “what must be observable” is defined.
- No PSU topology (PFC/LLC) and no rack PDU metering: this page stays at the pump module domain.
- No general thermal policy: system-level fan curves and workload scheduling are handled elsewhere; this page provides reliable actuation + evidence.
H2-2 | What Must Be Measured: Flow, ΔP, Temperature, Bubbles & Leaks
Reliable pump control is not “one-sensor control.” The execution layer needs a minimal observability set that survives real coolant loops: flow (delivery), differential pressure ΔP (loop impedance / blockage proxy), and temperature gradient (heat transport result). A leak signal is safety-critical and must be treated with graded certainty to prevent false shutdowns.
The engineering meaning of each signal
- Flow (delivery capacity): confirms coolant is actually reaching the load. Most useful to confirm “flow established” after priming and to detect intermittent dropouts.
- ΔP (impedance proxy): reacts strongly to blockage, filter loading, kinked lines, or cold-plate restriction changes; often more control-stable than raw flow in noisy regimes.
- Temperature (Tin/Tout, ΔT): validates heat transport and provides hard safety limits; it can detect “pump spinning but not transporting heat” when paired with flow/ΔP.
- Leak (safety latch): must support warning/derate/shutdown tiers; condensation and service events demand anti-false-positive logic.
Common “false reality” traps (and how to avoid being fooled)
- Bubbles / poor priming: flow readings can oscillate; ΔP can look inconsistent; current ripple often increases. Use cross-checks rather than a single threshold.
- Low-flow quantization: many flow sensors become jumpy near minimum measurable flow. Treat flow as diagnostic in that region, not as a tight control target.
- ΔP zero drift: temperature and mounting stress shift the baseline. A control loop must include drift tolerance (deadband/hysteresis) and plausibility checks.
- Slow temperature response: Tin/Tout can lag fast transients. Use temperature primarily for protection and slow validation, not for fast torque decisions.
- Condensation vs leak: moisture sensors may trigger during cold starts. Grade the leak signal (suspect vs confirmed) and require persistence or multi-sensor corroboration.
A practical cross-check triangle (minimal but powerful)
- Low flow + high ΔP → likely restriction/blockage (filter/cold plate) rather than a drive failure.
- Low flow + normal/low ΔP → likely bubbles, dry-run, or loss of prime (not enough head built).
- Flow looks “OK” but ΔT stays high → sensor placement error, bypass/short-circuit path, or insufficient contact at the load.
H2-3 | BLDC Pump Drive Architecture: 6-Step vs FOC (Why Servers Often Prefer FOC)
Data center liquid-cooling pumps commonly use 3-phase BLDC/PMSM motors driven by a three-phase half-bridge (integrated or discrete MOSFET stages). The practical decision between 6-step trapezoidal commutation and field-oriented control (FOC) should be made using execution-layer constraints: low-speed torque for priming, acoustic/vibration limits, and diagnostic visibility under bubbles or partial dry-run conditions.
- Strength: simpler implementation and fewer tuning dependencies; robust for fixed-speed or moderate dynamic requirements.
- Trade-off: commutation torque ripple can amplify mechanical resonance (pump + tubing), raising acoustic noise in some operating points.
- Low-speed caveat: sensorless operation becomes less stable when back-EMF is weak; open-loop alignment/forced commutation can reduce priming success rate under bubbles.
- Diagnosability: fewer internal observables; abnormal conditions are often detected later (e.g., “no-flow” rather than root-cause signatures).
- Strength: smoother torque (lower ripple) and better low-speed authority; improves startup reliability and reduces vibration-sensitive behavior.
- Strength: current-loop control makes “torque intent vs outcome” measurable, enabling stronger abnormal detection (bubbles, partial dry-run, restriction).
- Trade-off: requires reliable current measurement and parameter/tuning management; control complexity increases validation effort.
- Failure containment: when sensors degrade (noisy flow, drifting ΔP), the controller can degrade to safer modes while preserving evidence in logs.
- Priming sensitivity: frequent cold starts, trapped air risk, or low NPSH conditions favor smoother low-speed torque control.
- Noise budget: if torque ripple excites tubing/manifold resonance, prioritize torque smoothness over simplest commutation.
- Fault evidence: if root-cause isolation matters (bubble vs restriction vs dry-run), prioritize current-sense observability and event logs.
- Complexity risk: if calibration/tuning cannot be controlled across units, keep control architecture conservative and enforce strict degradation rules.
H2-4 | Startup & Priming Are the Hard Part: Low-Speed Torque, Bubbles, Dry-Run, Cavitation
A pump “spinning” does not guarantee coolant transport. Priming is the execution-layer reliability bottleneck because trapped air can prevent head buildup, and abnormal regimes (bubbles, partial dry-run, incipient cavitation) can look acceptable if only one signal is trusted. A robust implementation treats priming as a bounded state machine with graded evidence, power limits, and explicit fault codes.
- Air lock: the impeller moves air rather than liquid; speed may rise while ΔP/flow stay low or unstable.
- Partial dry-run: rotation exists but coolant contact is insufficient; current/estimator stability may change before flow confirms failure.
- Incipient cavitation: pressure fluctuations reduce effective flow and can cause ΔP oscillation; prolonged operation risks damage and noise spikes.
- Soft-start: limit acceleration (dRPM/dt) and/or electrical power (Pmax) during START/PRIME to prevent supply dips and mechanical shock.
- Low-speed torque margin: during PRIME, allow controlled torque boost within a strict time window; avoid infinite retries that become destructive dry-run.
- Evidence-based detection: treat “no-flow” as a conclusion from multi-signal mismatch (flow vs ΔP trend vs electrical signatures), not a single threshold.
- Anti-oscillation: enforce minimum dwell time and hysteresis between states to prevent rapid start/stop cycling.
- Prime outcome: success / timeout / unstable signals / safety trigger.
- Key snapshots: RPM target, current proxy (bus/phase), flow, ΔP, and a brief window statistic (mean + ripple).
- Counts & timing: prime duration, retry count, time since last successful prime.
H2-5 | Sensor AFE & Sampling: Turning “Dirty Signals” into Controllable Metrics
Pump control quality is limited by signal quality. Flow, differential pressure (ΔP), and coolant temperature are “dirty” in the field: drift, noise coupling, quantization at low flow, and transient artifacts during priming can destabilize control and inflate false alarms. A production-grade implementation separates control-grade signals (stable, low bandwidth) from diagnostic features (ripple, slope, plausibility), and records the evidence that explains state transitions.
- Dominant failure mode: slow offset/temperature drift that masks real restriction changes or fabricates apparent ΔP.
- Execution-layer rule: allow zero/offset re-baselining only in an explicitly safe window (stable low activity), never as a continuous hidden correction.
- Noise coupling: reference and supply disturbances can appear as ΔP movement; treat ΔP credibility as a scored signal, not a single number.
- Low-flow limit: quantization and jitter dominate near the minimum measurable range; avoid using raw flow as a fast control variable in that region.
- Air/bubbles: transient spikes or dropouts are common during priming; classify flow as “unstable” when ripple and persistence criteria fail.
- Engineering output: publish both Flow and a Flow stability flag so the state machine can degrade rather than oscillate.
- Safety: enforce over-temperature and abnormal temperature rise rate (dT/dt) limits.
- Transport check: use inlet/outlet trends as evidence that coolant transport is effective, especially when flow sensing is unstable.
- Placement awareness: interpret temperature in context (inlet vs outlet) to avoid false conclusions during transients.
- Window averaging: best for slow, stable control metrics (temperature trends, long-term ΔP).
- IIR filters: common choice for control loops—stable output with adjustable responsiveness via time constant.
- Kalman-style estimators: use for fusion and credibility (e.g., combining noisy flow with electrical/ΔP evidence), not as a default filter everywhere.
H2-6 | Leak Detection: Conductivity, Humidity, Optical, Inference — and False-Alarm Control
Leak handling must be evidence-based. The difficult part is not “detecting something,” but preventing condensation and service artifacts from causing disruptive shutdowns. A robust pump-control implementation uses graded evidence and tiered actions: suspect → confirmed → emergency, with explicit debounce/persistence rules and a clear lock-and-log policy.
- Conductivity (rope/point): fast local detection near manifold fittings and pump area; requires fluid-compatibility and aging-aware thresholds.
- Humidity / moisture: useful as a suspect indicator; high condensation risk demands strong debounce and environmental gating.
- Optical / local level: provides direct “fluid present” evidence when mechanically feasible; strong candidate for confirmation.
- Inference: flow drop + ΔP anomaly + temperature pattern is auxiliary only; do not confirm a leak from inference alone.
- Time persistence: require a minimum duration above threshold (debounce/persistence) before changing severity state.
- Context gating: during priming, maintenance windows, or known condensation risk, prevent moisture-only inputs from jumping directly to confirmation.
- Cross-check: confirm using at least one “strong” sensor class (conductivity/optical), plus an optional inference score for confidence.
- Severity transitions: Suspect → Confirmed → Emergency, including the exact trigger class (conductivity / humidity / optical / combined).
- Snapshot: flow, ΔP, temperature, electrical effort proxy, and current pump state (START/PRIME/RUN/DEGRADED) at confirmation time.
- Latch policy: whether the condition is self-clearing or requires explicit clearing after inspection.
H2-7 | Redundant Power Domains & Fault Tolerance: A/B Rails, OR-ing, Switchover Without Speed Drop
“Redundancy” in a pump control board is not a single backup wire—it is a power-domain architecture that keeps the control domain alive while the high-current motor domain can switch sources. The practical goal is to prevent a brownout reset, avoid reverse-current propagation, limit DC-link sag, and record a traceable event with timestamp and reason code.
- Power-stage domain: the high-current path feeding the inverter DC-link (A/B → OR-ing → Vbus).
- Control always-on domain: MCU, gate-driver logic, and critical sensor rails remain stable through switchover.
- Sense integrity: ΔP/flow/temp credibility is preserved by keeping references and ADC rails out of brownout.
- Reverse-current control: prevent backfeed from DC-link into a failing input rail so the fault does not propagate.
- Low drop: ideal-diode OR-ing minimizes voltage loss and heat compared with diode drops in high-current paths.
- Switchover transients: treat Vbus sag and dV/dt as first-class signals; avoid trip-chains caused by brief dips.
- Control continuity: keep state machine and sampling stable; do not allow a reset to re-enter startup logic.
- Vbus protection: enforce sag limits and UVLO margin by managing DC-link energy and short transient handling.
- Soft transition: temporarily cap acceleration/power during switchover to prevent overcurrent trips and oscillation.
- Power cap: limit maximum electrical effort when only one domain is healthy.
- Target reduction: reduce flow/ΔP targets to stay away from critical rail margins.
- Anti-flap rule: avoid repeated A↔B bouncing; lock preference until rails are stable for a defined window.
H2-8 | Protections & Fault Modes: Stall, Overcurrent, Overtemp, Dry-Run, Sensor Distortion
Effective protection is evidence-driven and stable. The goal is to map field symptoms (flow, ΔP, current/power proxy, temperature) to fault classes, apply tiered actions (warn → derate → shutdown), and avoid oscillation caused by repeated start-stop cycles. Sensor faults must be handled as credibility problems first, not as immediate shutdown triggers.
- Electrical effort (current/power proxy): the drive is “pushing” vs limited.
- Hydraulic result (flow + ΔP): whether pressure head and transport are established.
- Thermal outcome (temperature + dT/dt): whether heat is being removed as expected.
- Stall / blockage: electrical effort rises while flow remains near zero and ΔP does not build as expected.
- Overcurrent: current exceeds limit (transient or sustained); Vbus sag can amplify trips if not handled with margin.
- Overtemperature: device/winding temperature rises; derating is preferred before escalation to shutdown.
- Dry-run: flow fails to establish while temperature rise pattern becomes abnormal; electrical effort may not be extreme.
- Sensor distortion: stuck-at flow, drifting ΔP, open/short temperature—treat as credibility loss and degrade conservatively.
- Warn: log evidence, tighten sampling, and maintain safe limits.
- Derate / Degraded: cap power/acceleration, reduce targets, and continue operation when safe.
- Shutdown / Latched: stop and latch only when safety thresholds or repeated failures demand it.
H2-9 | Control Strategy: From Speed Control to ΔP / Flow Control (Practical Boundaries)
Pump control targets are chosen by signal credibility and operating phase, not by algorithm complexity. Speed control is the most robust baseline, while ΔP control tracks loop impedance changes more directly. Flow control can be effective only when flow sensing is stable and trustworthy; otherwise, flow is better used for diagnostics and cross-checks.
- Speed control: simplest and most stable; does not guarantee flow or ΔP under changing impedance.
- Flow control: targets transport directly; depends heavily on flow sensor quality and bubble sensitivity.
- ΔP control: targets pressure head / impedance behavior; requires a stable ΔP signal (offset + drift managed).
- Reliable ΔP available: prefer ΔP-control and apply speed / power clamps to protect margins during transients.
- Flow is noisy or bubble-prone: keep flow as diagnostic-grade (credibility scoring, cross-check, alarms), not the primary loop target.
- Low temperature / degas / bubble period: fall back to a conservative mode (typically speed control) until signals stabilize.
- Control-grade: bounded noise, stable offset, reasonable rate-of-change, and consistent with physics.
- Diagnostic-grade: helpful for triage but unsafe to close the loop directly when it can chase noise.
- Cross-check: compare trends across {ΔP, flow, electrical effort, temperature} before enabling aggressive control modes.
H2-10 | Telemetry & Fault Logs: What to Record to Diagnose Problems
This section focuses on what the pump control board / manifold controller should record locally. Continuous telemetry provides trends, while event logs provide evidence. The diagnostic minimum is an event code plus a snapshot of key rails, hydraulic signals, and active limits—captured with a consistent relative timestamp.
- Telemetry: periodic values (rails, RPM, flow, ΔP, temperatures) used for trending and cross-check.
- Event log: discrete records (start, prime failure, dry-run, leak, domain switch, derate, sensor fault).
- Snapshot: a “freeze frame” of key telemetry taken at the event moment; the snapshot is what makes logs actionable.
MVP field set (minimum to localize root cause)
- Time: uptime_ms (relative time), monotonic counter
- Power rails: Vin_A / Vin_B, Vbus, active_domain, brownout_flag
- Motor effort: Idc (or power proxy), RPM, speed_cmd, limit_active
- Hydraulic: Flow, ΔP, Tin/Tout (or the closest available equivalents)
- State & validity: state_machine_state, sensor_valid_flags, fault_code, action_taken
Advanced field set (faster diagnosis, fewer reproductions)
- Trends: Vbus sag rate, ΔP slope, flow ripple (feature stats)
- Counters: retry_counter, backoff_level, domain_switch_count
- Thermal detail: driver_temp, winding_temp (if available)
- History: last-N event ring buffer for correlation
H2-11 | Validation & Production Test: Proving Reliability, No False Trips, and No-Drop Switchover
This section defines a practical evidence plan for a rack/server liquid-cooling manifold & pump controller: development validation (design correctness), production test (unit-to-unit consistency), and in-field self-test (diagnosability). Scope is limited to pump-control behaviors (priming, dry-run, leak discrimination, redundant power-domain switchover, and sensor fault handling).
1) Three-layer evidence model (DVT/EVT → Production → In-field)
- Goal: prove control/protection/state-machine logic is correct under worst-case boundary conditions.
- Method: repeatable fault injection + scripted runs + time-aligned evidence (event + snapshot + trend).
- Coverage: bubbles/unprimed start, dry-run windows, condensation vs leak, power-domain switchover profiles, sensor drift & disconnections.
- Goal: guarantee unit-level consistency in minutes, with low fixture cost.
- Method: testable “proxies” (open/short injection, threshold checks, controlled brownout pulses, logging integrity).
- Output: pass/fail + traceable calibration constants + key counters (e.g., switchover count).
- Goal: detect degradation early without disrupting service (prefer derate/alert over unnecessary shutdown).
- Method: lightweight plausibility checks (sensor sanity, domain health, log space, anomaly counters).
- Output: diagnostic flags + a minimal snapshot for remote triage.
2) Definition of Done (DoD): measurable acceptance criteria
Protection correctness (safety + detection):
- Dry-run detection time: T_detect_dryrun ≤ X s, followed by safe stop or safe derate within a defined window.
- Leak confirmation time: T_confirm_leak ≤ Y s with graded actions (Warn → Derate → Emergency shutdown).
- Sensor fault detection: open/short and stuck readings detected within T_detect_sensor ≤ Z s, entering a defined fallback mode.
- No “protection oscillation”: bounded retry count and cooldown windows to prevent repeated start/stop instability.
Service continuity (no-drop switchover):
- Power-domain switchover continuity: ΔRPM_max ≤ A% and/or ΔFlow_min ≥ B% during A→B transfer profiles.
- Control-rail survival: no controller reset during micro-interruptions within the specified tolerance.
- Mode switching stability: hysteresis and rate limits prevent “mode flapping” in marginal sensor conditions.
3) Must-run validation use-cases (fault injection → expected action → evidence)
Each use-case below should be written and executed with a consistent template: purpose → injection method → observables → expected actions → logging evidence → pass/fail criteria.
- Bubbles/priming variability: multiple orientations + different initial fill levels; verify RUN entry requires credible Flow/ΔP establishment.
- Dry-run window: run without liquid for a bounded time; verify detection time and safe stop/derate without false leak triggers.
- Leak vs condensation discrimination: compare true leak injection vs controlled humidity/condensation; quantify false-positive rate.
- Power-domain switchover: A hard drop, B micro-interruption, and brownout ramps; verify continuity metrics and event stamping.
- Sensor disconnect/short injection: Flow/ΔP/Temp open/short/stuck; verify fallback mode and stable limits (no uncontrolled acceleration).
4) Production test: fast proxies that still catch real failures
- Power rails: always-on rail, gate-driver rail, ADC references within limits.
- Sensor integrity: open/short detection paths + stuck-at plausibility checks.
- Switchover detection: controlled brownout pulse and verify domain status + event logging.
- Logging integrity: event buffer write/read, monotonic counters, CRC if applicable.
- Full priming & bubble matrix across orientations.
- Extended dry-run robustness with controlled thermal rise.
- Condensation chamber correlation vs leak sensor thresholds.
5) Example material part numbers (MPNs) that support testability
The following are example MPNs commonly used as building blocks for pump-control boards. Selection must be verified against electrical ratings, thermal design, and wetted-material requirements (for sensors).
- 3-phase smart gate driver (BLDC/PMSM): DRV8323RS / DRV8323RH (SPI or HW interface, diagnostics).
- Current-sense amplifier (PWM rejection): INA240A3 / INA240A4 (shunt sensing).
- Hot-swap / inrush & power limiting (domain input): LM5069 (9–80V class hot-swap controller).
- Ideal-diode OR-ing (A/B domain combine): LTC4359 (ideal diode controller with reverse protection).
- ΔP sensor (digital differential pressure example): SSCDRRN002ND2A3 (differential pressure sensor family example).
- Liquid flow sensor (I²C example): SLF3S-1300F (liquid flow sensor example).
- Coolant/board temperature sensor (digital): TMP117 (0.1°C class digital temperature sensor).
- Humidity sensor (condensation context): HDC3022 (RH sensor example for condensation correlation).
- Capacitive sensing for leak / liquid presence (front-end): FDC1004 (capacitance-to-digital converter).
- EEPROM (calibration constants / identifiers): 24AA02 (2Kb I²C serial EEPROM).
- Supervisor / watchdog (anti-hang & reset discipline): TPS3828 (voltage supervisor with watchdog variants).
6) Figure V11 — Fault-injection validation matrix (visual checklist)
Rows: injected faults / boundary conditions. Columns: expected actions, continuity constraints, and logging evidence. Text in the figure is kept minimal (mobile-readable).
H2-12 | FAQs (Liquid Cooling Manifold & Pump Control)
Each FAQ stays within the pump-controller scope: BLDC drive behavior, Flow/ΔP/Temp signal conditioning, priming/dry-run/leak logic, redundant power-domain switchover, and local telemetry & fault logs (no CDU/plant, no BMC protocol stack, no PSU topology).
Why can the pump spin but flow never establishes?
→ H2-4 / H2-2
Separate “motor rotation” from “hydraulic circulation.” If RPM rises but Flow≈0 and ΔP≈0, priming/air pockets or dry-run is likely.
If ΔP rises while flow stays low, loop restriction is more likely. Use a PRIME state with limited acceleration, time windows, and retries with cooldown.
Log PRIME failures with snapshots (RPM/Flow/ΔP/Idc/Tin/Tout) for repeatable diagnosis.
If Flow readings jump, is it bubbles or sensor noise—and how to tell?
→ H2-5 / H2-4
Use cross-checks instead of Flow alone. Bubble-driven jitter often aligns with ΔP oscillations and priming phases, while pure electrical noise may not correlate with
ΔP, RPM, or motor power. Apply a control-grade filter (window/IIR) plus a “confidence flag.” When confidence drops, treat Flow as diagnostic-only
and fall back to conservative control (speed or ΔP) until stability returns.
Speed control or ΔP control—which is more stable, and what are the common traps?
→ H2-9
Speed control is the safest baseline but does not guarantee flow under changing loop impedance. ΔP control can track impedance changes better, but only if ΔP is control-grade (low drift, low noise, correct placement). Common traps: speed control under-delivers flow after restrictions change; ΔP control “chases noise” when bubbles or drift corrupt ΔP. Use mode switching with hysteresis and minimum dwell time to prevent flapping.
How does ΔP sensor zero drift mislead control, and what online compensation works?
→ H2-5 / H2-9
A positive drift makes the controller believe pressure is “already high,” reducing pump output and starving flow; a negative drift can over-drive the pump, raising noise and cavitation risk. Practical online compensation: update a slow offset only in known low-energy windows (e.g., pump stopped or very low speed with stable conditions), clamp offset rate-of-change, and invalidate ΔP control when plausibility checks fail (Flow/Idc/Tin–Tout patterns disagree).
Why does dry-run detection false-trip, and when are delay + cross-check required?
→ H2-4 / H2-8 / H2-6
False trips often happen during priming (air, bubbles), at low temperature (viscosity changes), or when a single sensor glitches. Use a delay so PRIME transients are not treated as dry-run, then require cross-checks: persistent low Flow plus low/unstable ΔP plus abnormal power/temperature slope (at least two independent cues). Apply graded actions (Warn → Derate → Stop) and log the exact rule that triggered.
How to distinguish stall vs cavitation using current, ΔP, and flow signatures?
→ H2-8 / H2-4
Stall tends to show high motor current/power with poor RPM and no flow establishment; driver temperature may rise quickly. Cavitation more often shows oscillatory ΔP and Flow,
intermittent flow collapse, and unstable operating points; current may not spike as hard as a stall. Response differs: stall favors shutdown and bounded retries; cavitation favors derate,
softer targets, and extended priming until stability returns.
Why can protections cause repeated start/stop oscillation, and how to fix it?
→ H2-8 / H2-4
Oscillation usually comes from tight thresholds without hysteresis, no minimum dwell time, and aggressive retries that re-enter PRIME before the loop stabilizes. Fix with a clear state machine (START → PRIME → RUN → DEGRADED → FAULT), hysteresis on key conditions, bounded retry budget, and cooldown windows. Severe faults should latch until a safe reset condition. Always log “reason code” and the last two snapshots to reveal the loop.
How to prevent leak-probe false alarms in condensation-heavy environments?
→ H2-6
Treat leak sensing as a graded decision, not a single threshold. Use debounce time and require spatial/temporal consistency.
Condensation can be managed by correlating the leak signal with humidity/temperature context (dew-risk) and hydraulic plausibility (Flow/ΔP/temperature trends).
For Leak_Suspect, derate and keep monitoring; for Leak_Confirmed, escalate actions. Example building blocks include humidity sensors (e.g., HDC3022) and
liquid-presence front-ends (e.g., FDC1004) depending on the probe method.
During redundant power-domain switchover, how to minimize speed/flow disturbance?
→ H2-7
Keep the control rail alive while switching only the power stage domain. Use OR-ing/ideal-diode paths and controlled hot-swap behavior, plus enough DC-link energy to bridge brief sags. During switchover, temporarily clamp torque/acceleration and hold integrators to avoid over-correction. Detect sag early (domain status + Vbus trend), then log the switchover event with a before/after snapshot. Example parts: ideal-diode controllers (e.g., LTC4359) and hot-swap controllers (e.g., LM5069).
What minimum log fields are needed to replay an intermittent flow drop?
→ H2-10
Minimum fields must explain “what the controller decided” and “what the loop did.” Log: state/mode, target (speed/ΔP), clamps (torque/power), RPM, Flow, ΔP, Tin/Tout, driver temperature, Vbus and domain A/B status, DC current (Idc), fault flags, and a relative timestamp. For intermittent issues, store a short snapshot window around the trigger (pre/post) plus counters (retry count, switchover count). Keep calibration IDs in EEPROM for traceability (e.g., 24AA02).
How can production quickly verify sensors are not swapped or drifting?
→ H2-11 / H2-5
Use fast proxies that validate sign, scaling, and plausibility. For ΔP: apply a known differential (or a controlled electrical injection for the AFE path) and verify polarity and range. For Flow: verify direction/response consistency (expected pulse/I²C change) and confirm plausibility against a simple pump step. For temperature: apply a short thermal step and verify response time and slope. Store calibration constants and a pass/fail digest into the unit record for traceability.
Trapezoidal or FOC drive—when can trapezoidal be more reliable?
→ H2-3 / H2-11
Trapezoidal commutation can be more reliable when sensing resources are limited, EMI is harsh, or deterministic protection behavior is prioritized over acoustic performance. FOC typically needs accurate current sensing, stable parameters, and robust tuning across temperature and aging; if validation coverage is incomplete, field behavior can be unpredictable. In contrast, a simpler trapezoidal drive with strong stall/priming logic and bounded retries can be easier to test and certify. Example gate-driver building blocks include DRV8323-family devices.