Building Intercom Master Hardware Guide
← Back to: Security & Surveillance
Multi-zone talk/listen and paging hub with mic-array AEC, multichannel A/V codec, internal Ethernet switching, PoE PSE ports, and time-stamped diagnostics.
H2-1. Definition & Page Boundary
A Building Intercom Master is the central audio routing and supervision console that terminates multiple indoor/outdoor stations, provides full-duplex talk/listen and paging, and keeps the system stable by controlling audio quality, network reliability, port power policy, and time-stamped event evidence.
- Audio quality at scale: keep duplex voice usable across multiple endpoints using mic-array front-end + AEC/NS/AGC, with measurable headroom and clipping control.
- Deterministic delivery: route voice/control over the internal switch with jitter awareness, prioritization concepts, and clear health counters (no protocol tutorials).
- Power + fault isolation: supply and supervise stations via PoE PSE ports with per-port allocation, thermal/over-current behavior, and recoverable retry policy.
- Door-station imaging, door relays, access readers, or endpoint sensor/AI details.
- Standalone PoE switch architecture or building-wide network design.
- Cloud/app ecosystems, recording platforms, or deep security key management.
| Evidence class | What to measure first | What it proves (fast discriminator) |
|---|---|---|
| Audio path | Per-mic noise floor & gain match, codec input level, DSP headroom/clipping counters, end-to-end latency. | Separates acoustic/AEC issues from gain staging or CPU overload before touching network settings. |
| Network path | Per-port link state, error frames, queue drops (if exposed), jitter/late-packet counters, endpoint disconnect events. | Proves whether audio breakup is transport jitter/loss vs codec/DSP artifacts. |
| PoE & rails | Per-port power draw/allocation, over-current/thermal flags, brownout/UVLO events, rail ripple during port inrush. | Distinguishes port power cycling from endpoint faults and prevents “random reboot” misdiagnosis. |
| Time & logs | Timestamp monotonicity after reboot, RTC holdover, sync reacquisition time, event reason codes (port off, retry, DSP overload). | Creates a replayable incident timeline for field service without requiring external capture tools. |
H2-2. System Roles & Topologies (What connects to what)
This chapter locks the deployment boundary and turns “topology talk” into a computable design checklist: endpoint scale, concurrent audio load, uplink capacity, PoE budget, latency/jitter targets, and failure domains.
- Star (most common): the master directly powers/links multiple stations. Strong fault isolation, but higher port count and concentrated power/thermal design.
- Logical chain / tiered extension: used when physical constraints exist. Saves ports, but increases sensitivity to jitter, intermediate outages, and broadcast storms (do not treat it like a free port multiplier).
| Checklist item | Why it matters (what it constrains) | Evidence to log/monitor |
|---|---|---|
| Endpoint count (indoor/outdoor) | Determines switch port count, PoE budget, and aggregate control traffic. | Per-port link uptime, disconnect reason codes, endpoint heartbeat failures. |
| Concurrent calls (duplex streams) | Sets DSP/codec throughput and the latency headroom for AEC + jitter buffering. | DSP load/headroom counters, late-packet counters, audio underrun/overrun flags. |
| Paging zones (priority levels) | Forces predictable mixing/routing and prevents paging from starving talk/listen paths. | Mixing path selection logs, zone activation timestamps, queue drops (if exposed). |
| PoE power budget (per-port / total) | Controls startup sequencing, port priority shedding, and thermal design. | Per-port power draw, allocation decisions, OCP/OTP events, retry counts. |
| Latency target (end-to-end) | Constrains capture/DSP/buffer choices; too much buffering breaks duplex “feel.” | Measured audio round-trip delay, buffer depth stats, CPU/DSP overload events. |
| Jitter tolerance | Defines how aggressive prioritization must be and how robust the playback path should be. | Jitter estimate stats, reorder/late counters, per-port error frames. |
- Power domain coupling: PoE inrush or overload can corrupt local audio rails unless isolation and policy are defined (shows up as brownout/UVLO + audio artifacts).
- Switch domain coupling: a single noisy port (errors/storms) can raise jitter for multiple streams unless containment counters and policies exist.
- DSP domain coupling: AEC/NS overload affects all calls simultaneously; overload must be observable and tied to a controlled degradation mode.
H2-3. A/V Pipeline Overview (Mic → Network → Speaker)
A building intercom master is best debugged as a signal-flow system, not a feature list. This chapter defines where audio can degrade (capture / DSP / transport / playback), and the minimum measurement tap points that prove which stage is responsible—before any tuning or network changes.
- Capture: Mic Array → AFE / ADC → (TP1: Mic PCM) → DSP prep.
- DSP: AEC / NS / AGC → (TP2: Post-AEC PCM) → Encode / Packetize.
- Transport inside the box: Queueing + jitter tolerance (concept) → Depacketize → Jitter Buffer.
- Playback: Decode → Mix → (TP3: Post-Decode PCM) → DAC → Class-D Amp → Speaker/Handset.
- Echo loop alignment: the speaker reference must match the acoustic return path (wrong tap point or delay mismatch yields residual echo).
- Latency budget: excessive buffering improves dropouts but degrades duplex “feel” and can destabilize AEC under double-talk.
- Double-talk handling: when both ends speak, aggressive suppression can erase near-end speech unless headroom and reference routing are correct.
- TP1 — Mic PCM: proves front-end noise floor, gain mismatch, and clipping before DSP touches the signal.
- TP2 — Post-AEC PCM: proves whether residual echo is created by AEC/reference alignment vs later transport artifacts.
- TP3 — Post-Decode PCM: proves whether breakup originates from buffering/late packets vs the analog amplifier/speaker chain.
| Stage | What sets it | Typical contributor | Evidence to record |
|---|---|---|---|
| Capture framing | Frame size / hop size at the input; ADC/DMA chunking. | Small but non-zero; grows with longer frames. | Input frame duration, DMA period, “input overrun” counters. |
| DSP processing | AEC filter/partitioning, NS/AGC windowing, CPU/DSP headroom. | Can be stable or spike under overload. | DSP load/headroom, clipping/limiter flags, ERLE/residual indicators (if exposed). |
| Encode / packetize | Codec frame duration and packet aggregation policy. | Often quantized by codec frame size. | Encoder frame size, queue depth, “encoder underrun/overrun” counters. |
| Jitter buffer | Target jitter tolerance and late-packet behavior. | Largest controllable latency bucket. | Buffer depth stats, late/reorder counters, playout adjustments. |
| Decode + mix | Stream count, mixing paths, resampling avoidance. | Usually bounded unless resampling thrashes. | Decode time stats, mix path selection logs, “audio underrun” events. |
| DAC + amp | Output filters, limiter/thermal behavior, rail stability. | Small; issues manifest as distortion/pumping. | Limiter/thermal flags, rail brownout events during high volume. |
H2-4. Mic Arrays & Acoustic Front-End (Designing for AEC Success)
AEC performance is capped by geometry, front-end noise/headroom, and mechanical/acoustic coupling. When duplex audio fails, the fastest path is to prove whether the master is limited by mic mismatch, phase delay, or an uncontrolled echo path—before any DSP parameter changes.
- 2 mics: lowest cost and simplest; limited spatial discrimination; most sensitive to enclosure resonances.
- 4 mics: common baseline; workable coverage with moderate calibration effort; good for wall/desk consoles.
- 6–8 mics: best far-field and robustness; higher channel count and tighter manufacturing/bring-up discipline.
- EIN / noise floor: sets intelligibility in quiet rooms; excessive EIN manifests as constant hiss at far-end.
- Dynamic range & clipping margin: prevents “crackling” on loud speech or paging; insufficient headroom triggers AEC instability.
- Mic bias & input impedance: poor biasing shows up as unit-to-unit drift or channel imbalance.
- Anti-alias bandwidth: too aggressive reduces clarity; too wide risks folding noise into voice band.
- Porting & leakage: uncontrolled vents and seams create unpredictable echo paths and wind/handling noise.
- Vibration: speaker vibration couples into mic capsules and creates non-acoustic “echo-like” artifacts.
- Speaker-to-mic path: distance and obstruction matter more than DSP claims—treat as a first-class design parameter.
- Sensitivity mismatch: channel gain differences must be bounded so beamforming/AEC does not chase moving targets.
- Phase delay mismatch: stable per-channel delay is required for consistent spatial response and echo alignment.
- Step 1 — Per-channel noise floor: measure silent-room PCM RMS at TP1 to isolate AFE noise and bias issues.
- Step 2 — Gain match: play a fixed tone/speech at fixed distance; compare channel levels; flag outliers before DSP.
- Step 3 — Impulse/latency alignment: run a short impulse test; estimate inter-channel delay spread; verify stability across reboots.
- Step 4 — Echo path sanity: increase playback volume; confirm the AEC reference path reduces residual echo at TP2 without pumping.
- Step 5 — Mechanical sensitivity: tap/handle the enclosure; if artifacts appear at TP1, address vibration/porting before tuning DSP.
H2-5. Echo Cancellation / Noise Reduction / AGC (Evidence-Based Tuning)
This chapter turns “sounds bad” into provable causes: reference pick-off errors, headroom/level control issues, or real-time overload. It focuses on what to verify (tap points + counters + trends) and the first fix that restores stable full-duplex—without algorithm lectures.
- Correct reference: a digital copy of the actual speaker drive signal in the playback path, close to the final output queue/FIFO.
- Avoid reference pollution: UI tones/alerts should not “appear/disappear” relative to the reference. Keep routing consistent.
- Delay alignment matters: a valid reference with the wrong delay behaves like a wrong reference (residual echo persists at TP2).
- Keep it in one timebase: reference and capture must remain coherent (avoid implicit resampling in the critical path).
- Residual echo: reference tap/delay mismatch, or a playback path that differs from the reference (mix/volume/limiter applied elsewhere).
- Pumping AGC: aggressive gain swings amplify background noise between speech, or fight with noise reduction ordering.
- Double-talk distortion: near-end speech is suppressed or clipped when both sides speak; headroom and double-talk gating are suspect.
- Intermittent collapse: real-time overload causes buffer underruns/late processing, making tuning appear inconsistent.
- ERLE trend: should improve and stabilize when reference + delay are correct; flat/erratic ERLE points to mismatch or pollution.
- Clipping/limiter counters: spikes imply insufficient headroom (TP1/TP2) or aggressive output limiting (playback path).
- Far/near level stats: a stable near-end speech level with bounded AGC gain indicates healthy dynamics.
- CPU/DSP headroom: low headroom correlates with sporadic audio underruns, broken duplex, and “random” artifacts.
| Symptom | Evidence to capture | Discriminator | First fix |
|---|---|---|---|
| Far-end hears strong echo | TP2 residual energy, ERLE trend, reference tap location + delay setting. | ERLE does not improve when speaker level changes. | Move reference pick-off closer to speaker drive; re-align delay. |
| Noise “breathes” between words | AGC gain trace, near-end level stats in silence, NS on/off compare. | Gain rises rapidly during gaps and lifts noise floor. | Slow release / raise gate; stabilize NS→AGC ordering. |
| Double-talk becomes thin / choppy | TP1/TP2 peaks, clipping flags, double-talk indicator (if available). | Near-end energy drops at TP2 while TP1 is healthy. | Increase headroom; reduce over-aggressive suppression; verify reference is not polluted by UI sounds. |
| Occasional “robotic” artifacts | Underrun/overrun counters, CPU/DSP load spikes, buffer depth stats. | Artifacts correlate with headroom dips or underruns. | Fix real-time margin (load partitioning/buffer policy) before tuning filters. |
| Echo only at high volume | Output limiter/thermal flags, TP2 residual vs volume sweep. | Residual echo rises sharply past a limiter threshold. | Align reference after volume/limiter stage (digital), or reduce distortion in the output path. |
H2-6. Multichannel Codec & Audio Routing (Selection and Partitioning)
This chapter translates system requirements into a channel map, a clock plan, and a routing architecture that keeps the AEC critical path coherent. The focus is on avoiding hidden resampling and reference pollution, while keeping analog and digital domains separated at a block level.
- Inputs: Mic1..MicN (array), optional line-in, optional handset/headset mic.
- Outputs: Speaker/handset out, optional line-out/PA, optional monitor/record tap.
- Critical path: mic capture → AEC/NR/AGC → encode/packetize; playback decode → mix → speaker drive with a clean reference tap.
- PDM: easy wiring for many mics, but clock quality and edge noise can leak into the voice band if margin is poor.
- I²S/TDM: explicit slot planning scales well; coherent clocks reduce the need for resampling in multi-stream systems.
- Control (I²C/SPI): configuration only; does not define audio timing. Timing is defined by Fs + MCLK + slot plan.
- Keep the AEC path isolated: avoid mixing UI sounds into the reference inconsistently across modes.
- One-rate design: aim for a single primary Fs across capture and playback; treat any secondary Fs as a deliberate, documented exception.
- Bounded mixing: mixing multiple zones is fine; mixing multiple sample-rate domains without a plan is not.
- Analog codec/AFE domain: sensitive to noise and bias stability; weakness shows as higher TP1 noise floor and channel mismatch.
- Digital SoC/DSP domain: workload spikes show up as underruns and intermittent artifacts.
- Amp/speaker domain: large current swings; instability manifests as distortion, limiter events, or duplex collapse at high volume.
| Signal | Dir | Interface | Fs | Word / Slot | Clock master | Notes |
|---|---|---|---|---|---|---|
| Mic1..MicN | In | PDM or TDM | Primary Fs | Bit depth / slots | Codec or SoC | AEC critical path; avoid SRC. |
| Speaker Out | Out | I²S/TDM | Primary Fs | Word / slot | Codec or SoC | Place ref tap near final drive. |
| Ref Tap | Out | Internal | Primary Fs | N/A | Same as speaker | Must match speaker drive timing. |
| UI Tone | In/Logic | Internal | Primary Fs | N/A | Same domain | Inject consistently to avoid mismatch. |
| Line-Out / PA | Out | I²S/TDM | Primary Fs | Word / slot | Codec | Keep separate from AEC ref if routed differently. |
| Optional handset | In/Out | I²S or analog | Primary Fs | Word / slot | Codec | Document mode switches and mixing policy. |
H2-7. Ethernet Switch Inside the Master (QoS/VLAN/Multicast—Only What Matters)
In a Building Intercom Master, the internal switch is not “just networking”—it is the traffic junction between uplink (building LAN) and multiple PoE downlinks (stations/endpoints). Audio reliability depends on keeping voice/control resilient under microbursts from video and bulk transfers, and on having counters that prove where loss or jitter originates.
- Uplink port: aggregation point. Microbursts happen when multiple downlinks transmit concurrently (common cause of jitter without full bandwidth saturation).
- Downlink ports (PoE): endpoint-facing. Link flaps and CRC/FCS errors are common indicators of cabling, EMI, or power-coupled issues.
- Traffic classes: Voice/Control (small packets, jitter-sensitive), Video (high bitrate, continuous), Bulk (firmware/logs, bursty).
- Priority: voice/control must map to a higher-priority queue than video/bulk.
- Jitter vs buffer: shallow buffers drop under bursts; deep buffers raise latency (“slow talkback”). Prefer predictable queueing.
- Consistency: classification must be stable across modes; inconsistent marking causes “intermittent” voice artifacts.
- Segmentation: separate endpoint traffic domains to limit fault propagation (voice/control vs bulk).
- Port isolation: prevent endpoint-to-endpoint flooding when not required by topology.
- Operational goal: a bad endpoint should not destabilize the whole master via broadcast/multicast spillover.
- Broadcast storms: a single misbehaving device or loop can raise broadcast/unknown-multicast counters and starve voice queues.
- Containment goal: unknown multicast/broadcast should not be flooded across all PoE downlinks indiscriminately.
- Storm detect flags: if available, treat storm detection as a first-class alarm tied to link/queue telemetry.
| What to log | Threshold style | What it indicates | First response |
|---|---|---|---|
| Per-port link up/down | Any repeat pattern (minutes/hours), or bursts correlated with voice drops | Cable/connector issues, EMI coupling, or power-related instability | Check link partner, cabling, and power events (see H2-8). |
| CRC/FCS errors | Rising trend rate (not just absolute count) | Physical-layer quality degradation (noise, impedance, ground issues) | Inspect cable/termination; correlate with PoE load steps. |
| Queue drops (per-queue) | Any drops in voice/control queue during calls | QoS misclassification, insufficient priority, or microburst congestion | Fix traffic class mapping; reduce bulk bursts; adjust queue policy. |
| Rx/Tx drops (port) | Spikes during firmware/log transfers | Buffer exhaustion or congestion at uplink/downlink | Stagger bulk tasks; enforce rate limits; verify uplink margin. |
| Broadcast/multicast counters | Sudden step increase or sustained high rate | Storm/loop/misbehaving endpoint | Isolate suspect port; confirm storm detect flag; restore segmentation. |
| Storm detect flag | Any assertion event | Broadcast/multicast containment failure | Immediate containment: isolate port class; preserve logs for root cause. |
H2-8. PoE PSE Subsystem & Power Tree (Budgeting, Protection, Graceful Behavior)
The master is a power distributor as well as an A/V hub. PoE port events (detection, inrush, overload, thermal) can couple into local rails and destabilize audio or switching unless the system enforces a clear budget, priority shedding policy, and observable power telemetry.
- Detection/classification: identify a valid PoE load before enabling power, to avoid wasted retries and false faults.
- Per-port allocation: each port has a limit; the system must also respect a total power ceiling.
- Priority shedding: when total power approaches limit, shed lower-priority ports first to keep core services stable.
- Protection: short/overcurrent/overtemp must produce a clear reason code and a controlled retry schedule.
- Audio symptom coupling: buffer underruns, “robotic” voice, or link renegotiations can be downstream of rail dips.
- What to prove: correlate PoE total current steps and per-port power changes with local-rail monitors and underrun counters.
- Graceful response: reduce or shed low-priority ports before core rails cross a brownout threshold.
- Primary input feeds a high-voltage PoE rail for PSE and separate regulated rails for local compute and audio.
- PSE rail(s) supply PoE ports; total power monitoring and per-port current sense are essential.
- Local rails (codec/SoC/switch/UI) must be protected from PoE load steps via regulation and control policy.
| Item | Priority tier | Limit / class | Typical | Peak / inrush | Overload policy |
|---|---|---|---|---|---|
| Port 1 | Tier-0 | Per-port cap | Typical W | Peak W / inrush note | Protect first; shed last. |
| Port 2 | Tier-1 | Per-port cap | Typical W | Peak W / inrush note | Shed before Tier-0 if needed. |
| Port 3 | Tier-2 | Per-port cap | Typical W | Peak W / inrush note | Shed first on overload. |
| Local: SoC/DSP | Core | Rail budget | Typical W | Peak W | Never brown out; protect with policy. |
| Local: Codec/AFE | Core | Rail budget | Typical W | Peak W | Noise-sensitive; avoid dips/steps. |
| Local: Switch/PHY | Core | Rail budget | Typical W | Peak W | Prevent link flaps under load steps. |
| Total PoE | System | Total cap | Typical W | Peak W | Triggers priority shedding stages. |
| Fault type | Immediate action | Retry schedule | Latch-off rule | Event log |
|---|---|---|---|---|
| Short / severe overcurrent | Fast shutdown + fault code | Backoff retries (increasing delay) | After repeated failures, latch until manual/maintenance window | Port ID + reason + timestamp + peak current |
| Overload (near limit) | Current limit or controlled shutdown | Retry after cool-down | Latch if thermal or repeated overload persists | Port power + duration + shedding stage |
| Overtemperature | Reduce/disable port, protect PSE | Retry after temperature falls | Latch if temperature re-trips quickly | Temp + port + PSE state |
| Undervoltage / brownout | Enter shedding stage, preserve core rails | Recover in stages (avoid all-ports restart) | Latch low-priority ports if repeated brownouts occur | Total power + rail minima + reboot reason |
- Trigger: brownout monitor asserts a flag when key rails approach minimum.
- Action: flush event logs and mark a last-known-good state before rails collapse.
- Goal: preserve timestamps, fault reasons, and avoid corrupted storage/state after power events.
H2-9. Time Sync & Clocking (Logs, Correlation, Scheduled Audio)
Time in a Building Intercom Master is an engineering dependency: it makes event logs trustworthy, enables cross-device correlation during incidents, and keeps scheduled paging/call records consistent. This chapter focuses on what time must prove and how to measure it, without turning into a timing-hub page.
- Incident correlation: access events, alarms, link faults, PoE port faults, and call actions must align on a single timeline.
- Call & paging records: start/answer/end times, retries, and dropouts are only actionable when timestamps are stable.
- Distributed paging alignment: scheduled announcements and zone paging should not drift into audible misalignment.
- RTC + NTP (common default): stable wall-time for logs, reasonable alignment across devices, and survivable behavior after reboot.
- PTP-capable (optional): tighter inter-device correlation when supported; treat as a feature level, not a deployment tutorial.
- Monotonic timer (always): guarantees ordering and durations even if wall-time shifts.
- Audio sampling clock: drives ADC/DAC timing and full-duplex stability (audio quality & long-call drift behavior).
- System wall-time: used for logs, records, and incident timelines (needs sync + persistence).
- RTC holdover: preserves time across power loss/offline windows; quality shows up as drift rate.
- Offset: wall-time difference to a reference (NTP/PTP stats).
- Jitter: short-term variability of offset—often worsens under CPU/network load.
- Holdover drift: offset growth while offline (RTC quality & temperature sensitivity).
- Timestamp monotonicity: detect backwards jumps or repeated timestamps in event streams.
- Reboot continuity: confirm timestamps remain plausible across reboot boundaries (paired with boot/session identifiers).
- Offset (avg/max): record average and worst-case offset against the reference over representative load conditions.
- Jitter (p95/p99 or peak-to-peak): verify stability; spikes indicate contention, queueing, or time-client instability.
- Holdover drift rate: disconnect network time input and log offset growth over time (hours) to characterize RTC holdover.
- Timestamp monotonicity: flag any negative time jumps; detect duplicate timestamps for high-rate events.
- Reboot continuity: after reboot, confirm wall-time remains plausible and the system marks a new boot/session ID.
- Sync reacquisition time: measure time-to-stable-offset after link restoration.
H2-10. Validation Plan (Bench Tests That Prove It Works)
This validation plan turns the master station into an engineering-grade deliverable: each test is defined by setup, metric, pass criteria, and the log fields required to prove outcomes. It covers worst-case behaviors across audio, networking, PoE power, and timing.
- Worst-case first: validate under load, temperature, and port power events—then measure normal conditions.
- Evidence-driven: every failure mode must map to a measurable counter, flag, or timestamped record.
- Cross-domain correlation: align audio artifacts with network counters and power events using the time evidence checklist.
| Test item | Setup (bench) | Metric | Pass criteria | Required log fields |
|---|---|---|---|---|
| End-to-end talk latency | Call path active; measure from near-end mic stimulus to far-end speaker output at multiple endpoints | One-way latency + jitter | Stable within target; no step changes during background load | ts_wall, ts_mono, call_id, jitter buffer stats, underrun counters |
| Double-talk robustness | Near-end and far-end speak simultaneously; vary levels and distance | Echo audibility + clipping/AGC events | No persistent echo; no severe distortion/pumping under double-talk | ERLE trend (or equivalent), clip counters, AGC state, CPU headroom |
| Max volume echo stress | Speaker volume swept to worst case; steady near-end voice | Residual echo level vs volume | Residual echo remains bounded; no runaway oscillation | AEC state, reference pickoff status, clip/limiter flags, ts_mono |
| Noise / NR stability | Inject HVAC/ambient noise; vary SNR; log over long duration | Noise floor + speech clarity proxy | No “pumping”; speech remains intelligible; no periodic artifacts | NR mode, gain stats, noise estimate, CPU load, ts_wall |
| Network jitter under load | Generate concurrent video streams + bulk transfers while voice call runs | Voice packet jitter/loss + queue drops | Voice queue drops stay at zero (or within policy); voice remains stable | per-queue drops, per-port errors, voice stream stats, ts_mono |
| QoS classification proof | Toggle voice/control traffic and verify counter movement in intended queues | Queue mapping correctness | Voice/control always lands in priority queue; video/bulk never starves it | queue counters by class, rule-hit counters (if available), ts_wall |
| Broadcast storm containment | Inject abnormal broadcast/multicast rates from one port | Storm flags + impact on voice | Storm is detected/contained; voice stability maintained where expected | broadcast/multicast counters, storm flag, isolated port ID, ts_mono |
| PoE port startup sequencing | Power multiple endpoints; vary order and simultaneous plug-in events | Rail dip + reboot/underrun correlation | No core rail brownout; audio does not collapse during port power ramps | Total PoE power, per-port power, rail minima flags, underrun, reboot reason |
| PoE overload & shedding | Increase load to total cap; verify shedding tiers | Which ports drop + recovery behavior | Tier-2 sheds first; recovery is staged; no oscillatory restart loop | shedding stage, port priority, fault reason, retry schedule, ts_wall |
| Thermal worst-case (PoE + audio) | Elevated ambient; sustained PoE load + active calls | Thermal flags + performance stability | No uncontrolled shutdown; clear throttling/fault reporting if limits hit | temps, throttling flags, port faults, audio stats, ts_mono |
| Timestamp monotonicity | High-rate event generation; long run; induce load spikes | Backward jumps / duplicates | No negative time jumps; duplicates handled with sequence fields | ts_wall, ts_mono, event_seq, boot_id, source_id |
| Reboot continuity & sync reacquisition | Reboot during normal operation; then restore network time after offline window | Time plausibility + reacquisition time | Wall-time remains plausible; boot/session boundary is explicit; reacquisition logged | boot_id, sync_state, offset/jitter, holdover drift, ts_wall |
H2-11. Field Debug Playbook (Symptom → Evidence → Isolate → Fix)
This chapter offers a streamlined playbook for diagnosing common issues in a master station using minimal tools. The focus is on providing a decision tree for high-frequency symptoms, accompanied by the first two measurements, the discriminator, and the first fix for each.
- First 2 Measurements:
- Codec input level: Measure the audio signal level on the input of the codec.
- ERLE (Echo Return Loss Enhancement) statistic: Monitor the ERLE values to detect the amount of echo suppression.
- Discriminator: If ERLE is poor but network errors (such as jitter or packet loss) are not significant, the issue is likely with the acoustic coupling or incorrect AEC (Acoustic Echo Cancellation) reference routing.
- First Fix:
- AEC reference routing: Adjust the tap points for the echo path to optimize suppression.
- Gain staging: Ensure the mic gain is correctly set to avoid audio distortion and feedback.
- First 2 Measurements:
- Codec input/output level: Check for input clipping and ensure the output signal is consistent.
- Jitter buffer underrun: Measure the buffer underrun rate to ensure smooth audio transmission.
- Discriminator: If the jitter buffer underrun is high, it indicates network congestion or insufficient bandwidth for stable transmission, causing audio to break up.
- First Fix:
- QoS (Quality of Service) configuration: Ensure that voice traffic is prioritized over bulk data traffic.
- Network routing optimization: Check if the network is properly routed for low latency and no packet loss.
- First 2 Measurements:
- Port power and link status: Verify if the PoE (Power over Ethernet) is supplying power to both the device and the network link.
- Packet loss: Check for packet loss on the network.
- Discriminator: If there is no packet loss and PoE power is stable, but only one side of the call can hear, it indicates a codec misconfiguration or a failure in the audio path.
- First Fix:
- Port power policy: Recheck the power supply to ensure proper PoE allocation.
- Codec configuration: Verify both transmit and receive channels are configured correctly in the codec.
- First 2 Measurements:
- Codec input/output level: Check for clipping or signal distortion at high volumes.
- Signal-to-noise ratio (SNR): Measure the SNR to determine if the distortion is caused by noise.
- Discriminator: If clipping occurs at high volumes but the SNR is acceptable, the issue is likely due to insufficient dynamic range or gain setting in the codec.
- First Fix:
- Gain staging: Adjust the codec input/output gain to prevent clipping.
- Audio pipeline check: Ensure the signal path is not overdriven at any stage.
- First 2 Measurements:
- PoE power status: Monitor the power-up sequence for each PoE port.
- Link errors: Check for link status changes when the PoE ports are power-cycled.
- Discriminator: If call drops coincide with PoE port cycling and link errors occur, the issue is likely power-related.
- First Fix:
- Power sequencing: Ensure that PoE ports are correctly sequenced for startup and shutdown.
- Hold-up strategy: Implement a buffer to hold PoE power for a few milliseconds to avoid abrupt power changes during reboot.
H2-12. FAQs ×12
Answer: If echo is heard only in one room, it may be caused by either improper AEC reference routing or a sound reflection issue in the room. Check the AEC reference tap points and inspect room acoustics (microphone-to-speaker path).
Answer: When echo worsens with volume, check for clipping at the input, AGC pumping, or insufficient DSP headroom. Increase DSP headroom and fine-tune AGC settings to prevent excessive gain adjustments.
Answer: If voice breaks up under heavy traffic, check if QoS is properly configured to prioritize voice packets. If not, increase the jitter buffer size to handle network delay fluctuations better.
Answer: Frequent disconnections could be due to faulty cabling, electromagnetic interference (EMI), or switch port errors. Inspect the cables and check for link errors or port configuration issues in the switch.
Answer: PoE port reboots may be caused by power allocation issues or thermal limits. Ensure that PoE power is properly distributed and that thermal protection is functioning as expected.
Answer: Paging delays could be due to latency budget overflow between zones. Review the latency budget for audio and data transmission to ensure that all zones are in sync.
Answer: If only some microphones sound noisy, check the mic bias or AFE noise level. Mechanical leakage from mic ports or poor sealing could also cause interference.
Answer: If AEC stops working after a firmware update, check if the channel map or clocking configuration was altered. Restoring the previous settings can fix this issue.
Answer: If timestamps jump after reboot, check for RTC holdover or time synchronization reacquisition issues. Ensure that the RTC is correctly maintaining time during power cycles.
Answer: Low volume despite clean audio could be due to improper gain staging or amp power rail sag. Check the gain settings and ensure that the amplifier is receiving adequate power.
Answer: Intermittent howling could be due to a feedback loop or a mismatch in beamforming steering. Adjust the microphone array configuration and verify the beamforming settings.
Answer: If event logs indicate that the call has ended but the user reports ongoing audio, check for packet path failures. Inspect the event logs for packet loss or jitter issues.