A system is reliable when it rarely fails. No system can be made completely failure-free.
Metrics
MTTF
Mean Time To Failure. The statistical average time from first use until the first failure of a non-repairable component or system.
where is the reliability function (probability of surviving to time ). For a constant failure rate :
Used for components that are discarded on failure: sensors, MCUs, batteries, LEDs.
MTTR
Mean Time To Repair. Considered for repairable systems such as servers and industrial machines.
MTBF
Mean Time Between Failures. The statistical average time between successive failures of a repairable system.
For systems where repair time is negligible compared to operating time, MTBF MTTF.
Used for systems that are repaired and returned to service: servers, industrial machines, repairable IoT gateways.
Number of Components vs. Metrics
When independent components are connected in series (the system fails if any one fails), their failure rates add:
Since , the system MTTF is:
is always shorter than the shortest individual MTTF. Adding more components strictly decreases system reliability.
For identical components each with MTTF :
Doubling the component count halves the system MTTF.
For components in parallel (redundant: system survives until all fail), reliability improves. With identical components each with failure rate , the system survives until the last one fails:
For two identical components in parallel: .
Component Count Rule
Every additional component added to an IoT design has a cost on multiple axes simultaneously. Minimising component count is therefore a primary design objective, not merely a cost-cutting measure.
Cost
- Bill of materials
Each component has a unit price. At volume, even a $0.05 passive adds thousands of dollars across a production run. - PCB area
More components require a larger board or finer pitch routing. Both increase fabrication cost and constrain enclosure options. - Assembly
Each component is a placement operation on the pick-and-place machine. More placements mean longer cycle time and higher assembly cost per unit. - Testing
Every component is a potential failure point during manufacturing test. More components increase test fixture complexity and time. - Supply chain
Each distinct part number is a procurement dependency. Shortages, end-of-life notices, or single-source suppliers create supply risk.
Points of Failure
Components in series multiply failure risk. If each component has reliability (probability of surviving a given period), the system reliability is:
Each , so strictly decreases with every added component. Equivalently, system MTTF for series components satisfies:
is always shorter than the shortest individual MTTF.
IoT devices are often deployed in locations with no on-site maintenance. A field failure requires a technician visit or a full device recall. The cost of a single field failure (labour, logistics, customer impact) vastly exceeds the cost saved by adding the component in the first place.
Power Consumption
Every component draws idle current. In battery-powered IoT devices, the sum of all idle currents determines battery life. Removing one component that draws continuously can extend battery life by weeks in a low-duty-cycle device.
Design Complexity
More components introduce more potential interaction effects: coupling, noise, voltage drop, thermal interference. Each interaction is a debugging surface. Fewer components means fewer failure modes to analyse and certify.