Industries9 min read

Industrial Automation Boards: Designing for 10-Year Field Life

Capacitor selection, conformal coating choice, and the burn-in protocols that separate boards that last 18 months from boards that run a decade.


Framing the 10-Year Field Life Problem

Industrial automation boards live a different life than consumer electronics. A PLC backplane, a VFD control board, a CNC servo controller — these run 24 hours a day for years, often in environments that would kill a consumer device inside a month. The expectation from end customers is straightforward: 10 years of continuous operation with mean-time-between-failure measured in the high tens of thousands of hours. The PCB manufacturer's job is to make sure nothing on the board's bill of materials limits that life envelope below what the system is being sold against.

What the environment looks like

  • Ambient temperature — 0 to +60 °C inside a control cabinet, often higher near drive electronics. Internal board temperature can run 20 °C above ambient.
  • Vibration — 1–10 G low-frequency on equipment adjacent to motors and pumps, with occasional shock events.
  • Humidity — non-condensing 5–95% in tropical installations; conformal coating expected.
  • Dust and contamination — conductive metal dust near machining centres, oil mist near hydraulic equipment, agricultural dust in food-processing plants.

"A consumer-grade VFD board will run beautifully for 18 months. Then a single electrolytic capacitor drifts out of spec, a regulator under-volts a microcontroller, and a paper mill stops for an hour. The cost of that hour is more than the lifetime BOM of the board. Designing for 10 years is cheaper than designing for 2." — Pioneer Horizon automation programme manager

What follows is the working framework we use to take a commercial automation design and harden it for a real industrial deployment, organised around the four component classes that fail first.

Electrolytic Capacitor Selection — The First Failure Mode

Aluminium electrolytic capacitors are the dominant failure mode on long-running industrial boards. They dry out. The electrolyte vapour pressure rises with temperature, the seal eventually allows escape, and ESR climbs until the part either fails open or fails to deliver ripple-current suppression. The Arrhenius rule of thumb is that life halves for every 10 °C of operating temperature rise — meaning a 105 °C-rated cap rated for 5000 hours at 105 °C delivers roughly 80,000 hours at 65 °C. That sounds like plenty until you account for derating, ripple-current heating, and the fact that part-to-part variation can pull the bottom of the distribution down sharply.

How we specify electrolytics for 10-year service

  • Temperature rating — 105 °C minimum, 125 °C preferred on rectifier and bus capacitors.
  • Endurance rating — 5000 hours at rated temperature as a floor; 10,000 hours where we can fit them.
  • Voltage derating — 80% of rated voltage as the working point, with 60% on parts that see transients.
  • Ripple-current headroom — calculated rather than assumed, with at least 2× margin on the rated ripple at the actual operating temperature.

When polymer or hybrid is worth the swap

Polymer electrolytics and hybrid polymer-aluminium parts cost 3–5× the price of standard aluminium electrolytic but have no liquid electrolyte to dry out. For point-of-load decoupling and tight-space applications, the cost difference vanishes against the field-failure cost. We typically specify polymer for any bus capacitor under a hot regulator or near a thermal source, and standard wet electrolytic for bulk applications where there is room to over-size and run cool.

Layout for thermal de-coupling

An aluminium electrolytic 5 mm from a hot inductor will see 30–40 °C above board ambient. Moving the part 15 mm away, or interposing a thermal slot in the copper pour, can drop that delta to 8–10 °C. The layout matters as much as the part choice; we run thermal IR scans on every prototype automation board to find the hot spots before the design freezes.

Conformal Coating Choice and Application

Coating is not optional on automation boards bound for a real shop floor. The question is which chemistry, how thick, and how it gets applied. We split the decision matrix four ways.

Acrylic (Type AR)

The default for moderate environments. Easy to apply, easy to rework, transparent for UV inspection, and inexpensive. The downside is poor solvent resistance — acrylic coating dissolves in many cleaning fluids used in industrial environments, which limits its life in food processing and pharmaceutical lines.

Urethane (Type UR)

Better chemical resistance than acrylic, harder surface, longer service life under abrasion. The cost is rework difficulty; once cured, urethane is hard to remove without damaging components. We specify urethane for boards in oil-mist or solvent-exposure environments, with the understanding that field repair is by board swap, not local rework.

Silicone (Type SR)

Wide temperature range, excellent flexibility, good vibration damping. The downside is poor abrasion resistance and a tendency to creep — silicone coating can migrate over time, exposing pads at edges. We use silicone where the board sees extreme thermal cycling, especially on outdoor automation enclosures with ±50 °C ambient swings.

Parylene

The premium choice — vacuum-deposited polymer, pinhole-free, conformal at the atomic scale. Cost is 5–10× selective spray coating. We specify parylene only where the customer's reliability budget justifies it, typically on safety-rated industrial controllers or pharmaceutical-grade automation.

Application discipline

  • Selective coating with programmed keepout zones for connectors, jumpers, and test points.
  • Two coats minimum, with intercoat cure verified.
  • UV-inspection per board, photographed and archived.
  • Edge bead controlled at minimum 1 mm to prevent peel-back at corners.

The most common coating failure we see in the field is not a coating-chemistry failure; it is a coverage failure where an operator missed a region or a keepout was placed wrong. The inspection step matters more than the chemistry choice.

Burn-In and Early-Life-Failure Screening

The classic reliability bathtub curve says infant mortality happens in the first 100–1000 hours of operation. For a board headed into a 10-year field deployment, the cost of catching those failures on our floor — at unit cost in the tens of dollars to test — is dramatically lower than catching them six weeks into the customer's installation, when a failure pulls a process line down. We screen automation boards through a burn-in protocol calibrated to the application.

The four screening levels we run

  1. Level 0 — Functional test only. Default for non-critical commercial boards. Catches gross assembly defects but not latent failures.
  2. Level 1 — 24-hour powered burn-in at 25 °C. Catches early electrical defects without thermal stress. Adds roughly $4 per board in oven time and electricity.
  3. Level 2 — 48-hour powered burn-in at 70 °C, with functional cycling. The standard screen for industrial automation. Pulls in infant-mortality failures on capacitors, semiconductors, and connector contact resistance. Adds roughly $12 per board.
  4. Level 3 — Temperature-cycled burn-in, -20 to +70 °C, 100 cycles with functional verification at each extreme. Reserved for safety-rated boards or applications with extreme thermal swings. Adds $40–80 per board.

What we look for during burn-in

  • Parametric drift — voltage references that shift more than 0.5% during the burn-in are flagged.
  • Communication errors — any retry on an industrial bus (CAN, RS-485) during the burn-in is logged.
  • Current consumption — a board drawing 5% more current at hour 24 than at hour 4 is suspect.
  • LED indicators and outputs — every state transition cycled and verified.

Yield expectations at each level

Across our recent automation builds, Level 2 burn-in is rejecting roughly 0.3–0.8% of units after they have passed functional test. That is the infant-mortality population. Skipping the burn-in does not make those units pass in the field; it sends them to a customer who finds them with a stopped line.

For the broader bring-up and validation framing, see our walk-through of the prototype-to-mass-production gate process — burn-in is gate 6 of 7 in that framework.

Closing the Loop With Field Data

Designing for 10 years is incomplete without a feedback loop from the field. Boards in service generate failure data — directly through returns, indirectly through customer service records, ambiently through warranty cost. The teams that build the most reliable industrial boards are the ones that build that loop deliberately rather than wait for the warranty department to escalate.

Three loops we maintain

  1. Return analysis. Every returned board from a customer is logged into our traceability system against the panel ID, then assigned a failure-mode classification (capacitor wear, solder joint crack, semiconductor breakdown, connector wear, mechanical damage, no-fault-found). Quarterly, we review the distribution and look for new failure modes appearing — that is the early-warning signal for a process drift.
  2. Customer service logs. Not every failure comes back as a returned board. Many are diagnosed in the field and only show up as a service ticket. We ask customers, when contractually possible, to share anonymised service data so we can correlate field hours with our process records.
  3. Burn-in trend tracking. The percentage of boards rejected at burn-in is a leading indicator. A run where burn-in rejects rise from 0.4% to 1.2% is telling us something has changed on the line — a paste batch, a reflow profile, a component lot — and the same change will likely show up in the field as elevated returns six to twelve months later.

What the data tells us to change

The most common findings from the loop are not exotic. Electrolytic capacitor wear-out coming in earlier than expected drives a part-grade upgrade. A spike in connector contact failures drives a switch from tin to gold plating on critical I/O. A pattern of solder joint cracks under a specific large component drives a thermal-relief redesign or an underfill addition. None of these changes are obvious from first principles; they emerge from the data.

If you have an automation product in the field today and want to walk through the failure data with engineers who design the boards rather than the systems, share your return data with our reliability team. We will return a Pareto and a short list of design-side changes likely to bend the curve, inside two working weeks.

Chat on WhatsApp