PoE Switch (802.3af/at/bt): PSE Control, Metering & Protection
← Back to: Telecom & Networking Equipment
This page focuses on the PoE power subsystem inside a multi-port Ethernet switch: PSE detection/classification, per-port power control, power budgeting, metering, protections, and thermal derating. It does not expand into timing/PTP, routing features, Wi-Fi, or optical transport.
H2-1 · What PoE switching really includes (boundary + quick spec map)
A “PoE switch” is two systems sharing one chassis: the Ethernet data plane and the PoE power plane. For engineering decisions (and for field troubleshooting), the power plane must be treated as a first-class subsystem: it has its own state machine (detect → classify → power → monitor), its own limits (per-port current/thermal), and its own allocation policy (who gets power when the total budget is tight).
- Boundary (what this page covers) PSE behavior: detection/classification, per-port power path, budget/allocation, metering/telemetry, OC/SC/surge protection, and thermal derating.
- What “af/at/bt” means in practice Not just a label—each step changes pair usage (2-pair vs 4-pair), negotiation expectations, and how much headroom is needed to deliver stable PD power.
- Why “90 W” is rarely “90 W at the PD” Power disappears across the chain: budget reserves → per-port limits → cable I²R loss → thermal derating and protection behaviors.
| Standard | Pairs / Type | Negotiation signals | Engineering focus | Common “gotcha” |
|---|---|---|---|---|
| 802.3af Type 1 |
Primarily 2-pair | Detect + Class (basic) | Compatibility, clean detection, stable MPS | False detection caused by port leakage / protection parts |
| 802.3at Type 2 |
2-pair with higher power | May use 2-event classification | Inrush handling, per-port current limits, thermal headroom | PD inrush triggers foldback → “power-on then drop” |
| 802.3bt Type 3/4 |
Often 4-pair for higher power | Enhanced classing + optional LLDP power (policy) | Budgeting across many ports, 4-pair delivery, derating strategy | Total budget seems enough, yet ports drop due to heat/priority policy |
- Class ≠ guaranteed delivered wattage. Class is primarily a negotiation/management signal; delivery depends on port limits, cable loss, and thermal policy.
- “Nameplate PoE++ power” is not a field promise. Under dense loads, temperature and protection behavior often become the real limiting factors.
Practical rule-of-thumb
When targeting high continuous PD power on many ports, plan for headroom at three places: (1) total supply budget (reserve for worst-case ports), (2) per-port current limiting (avoid nuisance foldback), and (3) thermal derating (adjacent ports and airflow determine sustained power). If any one of these is tight, “works on the bench” can become “flaps in the field.”
H2-2 · System architecture: data plane vs PoE power plane (where PSE lives)
A multi-port PoE switch behaves like a dual-plane machine. The data plane moves packets through switch silicon and PHYs. The PoE power plane negotiates and delivers DC power per port through the magnetics injection path. A third layer—the control plane—ties them together by applying policies (priority, power limits, retry rules) and exporting telemetry (V/I/P/T and faults).
- Data plane (kept PoE-relevant) Switch ASIC → PHY bank → magnetics → RJ45. Only the PoE coupling points matter here: injection, EMC, and how protection parts influence detection/link.
- PoE power plane (PSE subsystem) 54 V PSU/backplane → multi-channel PSE controller bank → per-port high-side switches + sensing → injection at magnetics → PD.
- Control plane (why products differ) MCU/SoC configures PSE policy and reads per-port telemetry/logs. This is where “stable under load” vs “random flaps” is decided.
- Port density + airflow: adjacent ports at high power create hotspots that force derating or protection trips.
- Total supply budget: a shared pool must cover worst-case simultaneous loads and startup bursts; policy reserve prevents cascade failures.
- Per-port protection tuning: inrush, foldback, and retry behavior decide whether PDs boot cleanly or keep rebooting.
Every later chapter maps back to this diagram: detection/classification (state machine), budgeting (allocation), metering (telemetry), protection (fault behavior), and thermal derating (sustained power). If the PSE is placed and managed as a coherent subsystem, PoE becomes predictable.
H2-3 · Per-port power path (PSE channel anatomy) — the stuff that actually burns
A PoE switch becomes predictable only when each port is treated as a power channel with a known loss budget, known thermal hotspots, and known telemetry points. A “working” design can still fail in the field when a port’s heat sources (MOSFET, sense resistor, protection parts, copper) stack up under sustained load or when protection-side leakage distorts what the PSE measures during detection/classification.
- Typical port path (left → right) 54 V bus → ORing/fuse/limit → (optional) hot-swap/eFuse → PSE high-side switch → current sense → injection at magnetics → RJ45/cable.
- Where heat concentrates High-side MOSFET (I²·Rds(on)), sense resistor (I²·Rsense), and nearby copper/connector resistance under high continuous current.
- What must be observable Per-port V/I/P, temperature near the switch element, and fault flags (OC/SC/thermal). Without these, “random flaps” remain guesswork.
- Switch loss: MOSFET conduction loss rises with current; sustained high power is often limited by junction-to-ambient thermal path.
- Sense loss + accuracy: Rsense dissipates heat and its tolerance/temperature drift directly affects metering and protection thresholds.
- Copper & connectors: layout and port density determine whether heat spreads or accumulates; adjacent ports can reduce each other’s headroom.
If a port powers on reliably but drops after minutes to hours under steady load, the first suspects are the MOSFET/sense/copper thermal path and the derating/protection rules—not the PD itself. The same channel map also explains “detect/class glitches” when leakage or protection capacitance alters what the PSE measures.
H2-4 · Detection & classification sequence (why ports flap or never turn on)
PoE power-up is a measured handshake, not a blind “turn-on.” The PSE evaluates what it sees at the port and only then decides whether to apply power and under what limits. Most “mysterious” behaviors—ports that never power, power briefly then drop, or repeatedly reboot—can be explained by locating which state in the sequence fails and what measurement is being distorted (leakage, noise, inrush, or low-power MPS patterns).
- Detection: “Is a valid PD present?” False results often come from port leakage (protection parts, contamination) or coupling/noise near the injection/magnetics region.
- Classification: “What level should be prepared?” 2-event / higher classing can fail when pulses are distorted by leakage/capacitance or measurement noise, leading to wrong limits and immediate drop.
- MPS: “Is the PD still drawing power?” Ultra-low-power or intermittent PD loads can look like “disconnect,” causing shutoff and repetitive reboot cycles.
- Never powers: usually fails in Detect/Class (leakage, noise, wrong port-side protection behavior).
- Powers then drops quickly: often Power-On limit/inrush interaction (port foldback/retry behavior).
- Flaps every seconds/minutes: commonly MPS drop (PD duty-cycles too low or policy window is too strict).
Start from the visible symptom, map it to the earliest failing stage, then verify using port telemetry (V/I/fault flags) and controlled tests (swap cable, swap port, cap-load/inrush test, low-duty MPS test). This avoids “guess-and-change” tuning.
H2-5 · Power allocation & budgeting (the real differentiator in multi-port)
In a multi-port PoE switch, the customer-visible quality is defined by predictable power behavior when the system is tight: how many ports can be powered simultaneously, which ports are protected during overload, and whether the switch degrades gracefully instead of causing random dropouts. The core is a budgeting model (system vs port limits vs concurrency) plus an allocator that turns policy and measurements into per-port grants.
- System budget (the pool) Bounded by PSU capability, thermal headroom, and backplane delivery. This is the real “total available PoE.”
- Per-port cap (the ceiling) Bounded by port channel thermal limits, switch element losses, and protection rules. A port can be capped below its standard label.
- Concurrency (the reality) How many ports can sustain high power at once, considering port density and heat coupling. This decides “all-ports busy” behavior.
- Static (class-based): reserve power from classification results; simple but can waste headroom or mis-handle bursts.
- Dynamic (LLDP policy-level): adjust grants based on negotiated needs and observed usage; improves utilization but needs strong telemetry.
- Priority-aware: preserve critical ports first; prevents “everyone flaps” by making degradation intentional and explainable.
- Deny: reject new power requests when the pool is low.
- Limit: cap a port grant to keep the system stable.
- Borrow: reclaim budget from low-priority ports to satisfy critical ports.
- Drop: power-off selected ports under emergency conditions (budget/thermal/fault escalation) with logs and controlled retry.
Every deny/limit/drop must be triggered by a measurable condition (budget waterline, thermal headroom, fault counters) and must produce an explainable record. If actions are invisible, field behavior looks random even when it is deterministic.
H2-6 · Per-port metering & telemetry (voltage/current/energy, alarms, logs)
Metering is not a “nice-to-have.” It is the evidence layer that makes power policies safe and field debugging fast. The goal is to observe what the PSE believes (its internal voltage/current/temperature estimates and thresholds), and to keep enough event records to separate cable issues, PD behavior, and PSE protection/thermal policy.
- Core electrical Vport, Iport, Pport. Used for budget allocation, limit decisions, and protection verification.
- Energy & trend Energy counters (Wh) and time windows. Used to profile ports and prevent oversubscription surprises.
- Thermal & limits Temperature near the switch element/copper plus derating flags. Explains “drops after minutes.”
- State + faults Detect/class results, MPS status, OC/SC/thermal flags, retry/latch behavior. Turns symptoms into a stage-by-stage explanation.
- Bandwidth vs transients: inrush and burst loads can be averaged out, hiding short overloads that trigger protection.
- Rsense tolerance + drift: temperature shifts the gain; ports can “read low” while internally exceeding a limit.
- ADC/front-end offsets: low-current regions are most sensitive to offset and reference drift.
- Calibration variance: per-port mismatch can cause inconsistent behavior unless calibration/policy margins are aligned.
- Detect/Class failures: counters and last-failed stage.
- Overload/short: threshold hit, duration, and retry count.
- Thermal derate: start/end markers and current cap applied.
- MPS drop: last observed load pattern and shutdown reason.
- Reboot loop: repeated on/off cycles indicating unstable policy or PD behavior.
With V/I/T + event counters, a port problem can be classified quickly: cable-related (voltage sag patterns), PD-related (inrush or low-duty MPS behavior), or PSE-policy/thermal-related (budget waterline and derating correlation). This closes the loop back to the allocator in H2-5.
H2-7 · Protection behaviors (OC/SC/inrush/foldback/surge) — what saves you, what kills you
PoE protection is judged in the field by what the user sees: instant drop, periodic cycling, or a port that looks dead. The difference between “saved the hardware” and “killed the experience” is usually set by a few knobs: current limit threshold, foldback behavior, retry timing/count, and whether faults latch or auto-recover.
- Over-current / short-circuit Instant trip, foldback, hiccup, or latch-off — each creates a different “feels broken” pattern.
- Inrush (startup surge) Large PD input capacitance can look like a fault. Tight limits or short soft-start windows cause false trips.
- Surge / ESD at the RJ45 PoE port Protection is not only the device (TVS) but also the return path and placement that control stress and heating.
- Policy knobs that define behavior Threshold, curve, retries, and reset rules decide whether a port degrades gracefully or flaps endlessly.
- Instant drop on plug-in: hard trip or foldback too aggressive versus inrush; check startup window and limit.
- Periodic power cycling: hiccup retry loop (often inrush or overload). Logs and retry cadence reveal the policy.
- Port looks dead: latch-off with a strict reset condition; requires explicit recovery rules and clear reporting.
Define the threshold that triggers a response, the foldback profile (if used), retry timer and retry count, and latch/auto-recover policy. Without these being visible through counters or logs, deterministic behavior appears random to operators.
H2-8 · Thermal design & derating (why “90W” becomes “60W” in real chassis)
The gap between a label and real deliverable power is often a thermal story. High port density creates heat coupling: adjacent ports at high load push local hotspots over safe limits, and a well-designed switch responds with predictable derating rather than chaotic port dropouts. Derating should be part of the same closed loop as budgeting and allocation.
- Main heat sources (PoE plane) PSE switch element losses, sense resistor dissipation, copper delivery losses, and protection devices under stress.
- Port density & coupling Hotspots form when neighboring ports run high power simultaneously, especially in the same airflow shadow.
- Airflow reality Inlet temperature, fan status, and chassis impedance decide the sustainable power pool more than the standard label.
- Derating as a feature Temperature-driven power limits prevent runaway. Good policies are smooth, logged, and predictable.
- Measure: board / device temperature near the PoE channels plus airflow/fan state.
- Act: reduce per-port grant and/or concurrency before reaching shutdown conditions.
- Explain: log derate entry/exit and applied caps so operators understand why power changed.
- Recover: use hysteresis and stable timers to avoid oscillation (“thermo flapping”).
When thermal headroom shrinks, predictable derating keeps services running at reduced power. Unpredictable behavior usually comes from missing thermal observability or a policy that only reacts at the shutdown edge.
H2-9 · PHY & magnetics coupling pitfalls (when PoE breaks data or detection)
PoE is injected through the magnetics, so the same region that carries Ethernet signals also becomes the entry point for common-mode energy. When coupling is uncontrolled, PoE can destabilize the link, corrupt detection/classification measurements, or leave a port “half-alive” after ESD events. The practical fixes are layout rules: keepout zones, shortest return paths, and protecting the RJ45 region without loading the signal path.
- Center-tap injection Transparent to differential data in theory, but a direct path for common-mode disturbance in practice.
- Common-mode noise + return path Return path geometry decides whether disturbance stays local or contaminates detection and PHY behavior.
- Protection device parasitics TVS capacitance and placement can degrade link margin, especially at higher speeds.
- ESD “half-alive” ports Post-ESD parasitic drift (capacitance/leakage) can create intermittent link or flaky detection.
- Link unstable only when PoE is on: center-tap/common-mode coupling near magnetics and return path control issues.
- Detection/classification flaps: noise/leakage/parasitics contaminating the measurement window near the injection region.
- After ESD the port becomes random: protection network parasitics changed; verify before/after behavior and placement/return.
P1: keep PoE high di/dt loops away from magnetics/PHY region and minimize loop area. P2: place RJ45 protection with the shortest, most direct return path (avoid long “ground” detours). P3: keep detection/measurement traces quiet around the injection region and avoid shared noisy returns.
H2-10 · Validation & compliance checklist (bench + production)
Validation must prove three things: reliable bring-up (detect/class/power), stable sustain (MPS and fault handling), and predictable behavior across cable, temperature, and concurrency. A good plan is matrix-driven and produces artifacts: pass/fail results, counters, and before/after comparisons for stress events.
- Bench setup DUT + patch panel + cable matrix + PD emulators + logging to reproduce field behaviors deterministically.
- Stress matrix Load levels, long/short cables, temperature, multi-port concurrency, and fault injection.
- Port consistency Compare port-to-port results under identical conditions to expose tolerance and thermal coupling issues.
- Production test Fast sequence to verify detection/classification/power path plus targeted fault injection and logging.
- Matrix: cable length/quality × load levels × temperature × concurrency.
- Faults: short/overload/open-cable events to verify response and recovery policies.
- Outputs: success rate, retry counts, latch reasons, power telemetry snapshots, and pass/fail codes per port.
- Before/after: repeat the exact same sequence before and after surge/ESD stress to catch “half-alive” drift.
Under the test matrix, ports behave predictably: clean bring-up, stable sustain, and controlled derating or fault recovery with traceable logs. Random behavior usually indicates gaps in the matrix, missing observability, or uncontrolled coupling/thermal hotspots.
H2-11 · Field failures & troubleshooting map (symptom → root cause → fix)
A PoE “bad port” is rarely random. Most field issues can be closed quickly by forcing a disciplined loop: symptom → telemetry snapshot → one isolation test → targeted fix. This section provides a practical map that ties user-visible behavior to what the PSE is measuring and which protection/budget decisions are firing.
1) Fast “golden signals” to capture (per affected port)
- State: Idle / Detect / Class / Power-On / MPS-Monitor / Fault (and which fault code).
- Electrical: Vport, Iport, Pport (or peak I), and whether current limit/foldback is active.
- Negotiation: detected class (0–8), 2-event flag, and any LLDP-derived requested power (if enabled).
- Protection history: short-circuit/overload count, retry cadence (hiccup) vs latch-off, and last clear time.
- Thermal: PSE die temp (if available) + board hotspot sensor nearest the port bank.
2) Field isolation tests (no rework, high hit-rate)
- Swap test: same PD + same cable → different port (separates PD/cable vs port hardware/policy).
- Limit test: cap the port to a lower power (e.g., force Type 2 / lower class) and see if flapping disappears (thermal/budget).
- Cable stress: try known-good short cable vs long/thin cable (brownout + inrush false trips).
- PD behavior: test with a PD emulator (fixed class + controllable load) to remove “weird PD firmware” variables.
3) Symptom map (what users see → likely roots → fix actions)
| Symptom (field) | Fast check (telemetry) | Most likely root cause | Fix action (fastest first) |
|---|---|---|---|
| No power (PD never turns on) | Stuck at Detect/Class; “invalid signature”; class reads as 0 repeatedly | Cable pairs open/miswire; PD signature masked by front-end TVS/leakage; noise at detection | Try short known-good cable; reduce detection sensitivity/noise; review leakage paths; re-check center-tap injection wiring |
| Powers on then drops (seconds) | Trips right after power-on; inrush/OC flag; high peak I | Large PD input capacitance; inrush limit too aggressive; MOSFET/Sense heating quickly | Raise inrush limit/time (within spec); soften turn-on; reduce port limit; verify FET SOA & Rsense power |
| Port flapping (retries forever) | Hiccup retry counter increments; MPS drop events; periodic resets | MPS not met (PD low-power bursty); policy timer too strict; borderline cable drop | Adjust MPS window/threshold; enable LLDP/dynamic budgeting; cap max power; validate with PD emulator |
| Some ports drop at peak load | Total budget limit hit; allocator denies/steals power; priority triggers | Oversubscription policy; PSU limit; thermal derating coupled to budget manager | Set explicit priorities; reserve budget for critical ports; tune “shed order”; improve PSU headroom |
| High-temp shutdown | PSE die temp or board sensor crosses threshold; derate ladder active | Port density hotspots; inadequate airflow; FET/Rsense dissipation too high | Lower sustained per-port power; spread high-power ports; tighten derate ladder; add copper/heat path/airflow margin |
| Link unstable / detection becomes erratic | Errors after surge/ESD; detection misreads; common-mode noise signs | ESD damage (partial); TVS capacitance/placement; poor return path; magnetics coupling | Inspect RJ45 area placement; confirm low-cap TVS on data pairs; enforce return path rules; replace suspect port components |
H2-12 · BOM / IC selection checklist (criteria-based) + internal links
This checklist avoids “part-number dumping” by tying each block to measurable criteria (power, heat, protection behavior, and observability). Example parts are provided to make the BOM actionable; final choices should still be validated against the exact port count, Type (2-pair/4-pair), and thermal envelope.
A) PSE controller / manager ICs (multi-port)
- Criteria: IEEE 802.3af/at/bt coverage, 2-pair vs 4-pair topology, port count scaling, programmability of detection/class/MPS and fault policy, and telemetry granularity.
- Example parts (specific):
- TI TPS23880 — 4-pair, Type-4, 8-channel PoE PSE with programmable SRAM. :contentReference[oaicite:0]{index=0}
- TI TPS23882 / TPS23882B — 8-channel IEEE 802.3bt PSE family variants (Type-3, 2-pair emphasis depending on variant). :contentReference[oaicite:1]{index=1}
- Microchip PD69210 / PD69220 — Generation 6 PoE PSE controller family; datasheet explicitly notes PD69210/PD69220 are recommended for new designs. :contentReference[oaicite:2]{index=2}
- ADI LTC4291-1 + LTC4292 chipset — 4-port IEEE 802.3bt Type 3/4 PSE controller chipset. :contentReference[oaicite:3]{index=3}
B) Per-port power MOSFET (80–100 V class)
- Criteria: VDS margin (surge + ringing), RDS(on) vs thermal resistance, SOA under inrush + fault, package thermal path, and gate drive constraints from the PSE controller.
- Example parts (specific):
- Infineon BSC340N08NS3-G — 80 V N-channel MOSFET (SuperSO8 5×6). :contentReference[oaicite:4]{index=4}
- Vishay SiR880DP — 80 V N-channel MOSFET (PowerPAK). :contentReference[oaicite:5]{index=5}
- Vishay Si7456CDP — 100 V N-channel MOSFET (PowerPAK SO-8). :contentReference[oaicite:6]{index=6}
- Quick sanity rule: if a port repeatedly “works cold, fails hot,” MOSFET+Rsense dissipation and copper spreading are often the limiting factors before the controller is.
C) 54 V bus protection / hot-swap / eFuse (system input and/or zone feeds)
- Criteria: operating voltage headroom, programmable current limit, fault timing, reverse protection needs, and whether PMBus/I²C telemetry is required at the bus level.
- Example parts (specific):
- TI TPS2663 — 4.5-V to 60-V eFuse. :contentReference[oaicite:7]{index=7}
- TI LM5066I — 10-V to 80-V hot swap controller with improved I/V/P monitoring accuracy. :contentReference[oaicite:8]{index=8}
D) Metering / telemetry (what makes troubleshooting fast)
- Criteria: common-mode voltage range (54 V bus needs margin), accuracy vs bandwidth, alert thresholds, and how easily logs can be attributed to a specific port and timestamp.
- Example parts (specific):
- TI INA228 — 85-V, high-precision current/voltage/power monitor (useful for 54 V bus or zones). :contentReference[oaicite:9]{index=9}
E) Surge / ESD protection (RJ45 + data-pair safe, without killing signal integrity)
- Criteria: ultra-low capacitance on high-speed pairs, placement priority (closest to connector), and a clean return path to chassis/quiet reference to avoid detection false trips.
- Example parts (specific):
- Semtech RClamp0524P — ultra-low capacitance TVS array family for high-speed interfaces (useful pattern for data-pair protection when SI is critical). :contentReference[oaicite:10]{index=10}
- Littelfuse SM8S series — high-power TVS diode family often used on higher-energy rails (candidate pattern for 54 V bus transient clamping with correct voltage rating selection). :contentReference[oaicite:11]{index=11}
F) Point-of-load DC/DC (from 54 V to local rails)
- Criteria: max VIN (PoE bus can spike), efficiency at the expected load, EMI behavior, and protections (UV/OV/OC/thermal).
- Example parts (specific):
- TI LM5164 — 65-V synchronous buck converter (common building block for stepping down from higher buses). :contentReference[oaicite:12]{index=12}
G) Fan / thermal control (keeps “90 W” from collapsing under chassis reality)
- Criteria: number of fans, closed-loop RPM feedback, fault outputs, and integration with the derating ladder (temperature → power cap → shed).
- Example parts (specific):
- ADI MAX31790 — multi-channel fan controller (I²C). :contentReference[oaicite:13]{index=13}
Internal links (titles only; do not expand here)
- Telco Power & Sequencing (power-tree / sequencing context)
- Telco Site Env & Power Monitor (facility-grade sensing context)
- Timing Switch (PTP/SyncE) (time sync is separate; avoid mixing)
FAQs (PoE Switch PSE subsystem only)
These FAQs focus on the PoE PSE power plane: af/at/bt boundaries, detection/classification/MPS behavior, budget allocation, metering accuracy, protection/thermal derating, PoE-to-magnetics coupling, and production test. They intentionally avoid L2/L3 switching and non-PoE subsystems.