MES Software: Vendors, Features & Costs Compared 2026
MES software compared: vendors, functions per VDI 5600, costs (cloud vs. on-premise) and implementation. Honest market overview 2026.
Machine downtime is any period in which a machine scheduled to produce is not producing saleable output. It covers both unplanned stops (breakdowns, faults, material starvation, operator absence) and planned stops (changeovers, maintenance, inspections). Synonyms: machine stop, equipment downtime, Maschinenstillstand, Anlagenstillstand. In ISO 22400 terms it sits inside the Availability losses of OEE.
Downtime is measured against planned production time, not calendar time. From 30+ years wiring PLCs and retrofitting brownfield lines into MES systems, I've learned the hard part is never the stopwatch — it's deciding what counts as a stop, who assigns the reason, and whether microstops under 60 seconds are logged at all. Get those three rules wrong and your downtime number looks fine while the line loses 20% of its capacity.
Three terms that get mixed up in nearly every reason-code taxonomy I see. They belong in different OEE buckets and trigger different corrective actions — confusing them is the fastest way to optimise the wrong problem.
| Dimension | Machine Downtime (unplanned) | Microstop | Planned Downtime |
|---|---|---|---|
| Duration | Typically > 5 minutes | Usually < 5 minutes, often < 60 seconds | Any duration, scheduled in advance |
| OEE bucket | Availability loss | Performance loss (speed loss) | Excluded from OEE (in planned production time) |
| Typical cause | Breakdown, fault, material starvation | Jams, sensor trips, minor adjustments | Changeover, PM, shift handover |
| Reason-code required? | Yes, always | Auto-classified from PLC signals | Yes, but from master data (order type) |
| How to reduce | Root-cause analysis, maintenance strategy | Process stabilisation, SPC, training | SMED, predictive scheduling |
Rule of thumb on the shop floor: if no one wrote down why it stopped, it was a microstop. That is precisely why microstops stay invisible — and why they're usually the biggest hidden loss.
Across the retrofit and MES rollouts I've run since 2000, downtime causes cluster into five repeating categories — and the distribution is astonishingly stable across forging, beverage, assembly and plastics. Technical failures of mechanical, hydraulic or electrical components account for roughly 30–40% of unplanned downtime minutes in brownfield plants, usually bunched around aging drives, worn seals and PLC I/O modules. Material and logistics issues — empty upstream buffers, wrong material, late kit — contribute another 20–30%. Changeover and setup hide in plain sight: nominally planned, but the overrun is unplanned downtime. Operator-related stops (misoperation, breaks not covered, training gaps) sit at 10–15%. Quality events — in-line rejects triggering stops, adjustments, re-feeds — close the loop back to COPQ. Any plant reporting only "other" or "technical" for more than 15% of stop time has a reason-code problem, not a maintenance problem.
The formula is trivial: Downtime = Planned Production Time − Actual Running Time. The measurement is where it falls apart. Three rules I enforce on every rollout. First, the trigger comes from the machine, not from paper. A digital signal from the PLC — cycle pulse, safety circuit, spindle-run bit — defines running vs. stopped. Humans assign reason codes; they do not define the stop itself. Second, set a microstop threshold explicitly (60 or 120 seconds is standard) and log both sides separately. Third, reason codes belong in a two-level hierarchy: category (Technical, Material, Changeover, Operator, Quality) plus a specific code under it. Flat 40-item dropdowns get ignored. Once captured, downtime should feed straight into the Availability factor of OEE and into ISO 22400 KPIs like MTBF and MTTR without manual re-entry. If it's typed twice, it'll be inconsistent.
Standard industry ranges put unplanned downtime at €2,000–€15,000 per hour in discrete manufacturing and considerably higher in automotive tier-1 lines under JIT obligations. The number I use with customers is blunter: take the contribution margin per part, multiply by the line's nameplate rate, that's your cost per minute. A 30-parts-per-minute assembly line with €4 contribution margin loses €120 every minute it's down. A forging press on a high-value part can burn €500+ per minute once you add scrap from restart transients. The hidden multiplier is downstream: one upstream stop creates a starved downstream line, a late delivery, a premium-freight shipment, and an angry customer scorecard. When we quantify this honestly during rollouts, the business case for automatic downtime capture usually pays back in 8–14 weeks.
There is no single lever — there is a sequence. 1. Measure first. No initiative survives without a clean baseline. Connect the PLC, capture every stop automatically, let reason codes run for two to four weeks before changing anything. 2. Pareto the causes. In almost every plant, the top three reason codes account for 60–70% of unplanned minutes. 3. Fix the top two. Condition-based maintenance on the worst offender, process stabilisation on the second. Ignore the long tail until the top two are closed. 4. Industrialise the fixes. Move from reactive to preventive to predictive maintenance only where the data volume justifies it — most plants skip predictive and fail, because they never stabilised preventive. 5. Lock the gains. Visible dashboards at the line, shift-handover reviews tied to the same numbers, and reason-code discipline as part of the operator standard. That loop is what turns a downtime project into a downtime culture.
Hard-earned lesson from a Simatic S5→S7 retrofit on a beverage line: The customer reported 4% unplanned downtime and insisted their operators were meticulous. We tapped the PLC directly — no paper, no operator button — and captured every cycle pulse for two weeks. True unplanned downtime was 17%. The delta was almost entirely sub-90-second stops caused by a chronically misaligned cap feeder that operators had learned to nudge back into alignment every few minutes. Each stop felt "not worth logging." Aggregated, it was the largest loss on the line. We didn't need predictive AI or a maintenance programme — we needed one mechanical redesign of the feeder chute, costing less than €2,000. Output went up 11% in the next month. Principle: if downtime depends on human logging, you are measuring attention, not reality.
We see the same pattern across 15,000+ connected machines in 18 countries: automatic capture first, reason-code hierarchy second, corrective action third. Carcoustics' 500+ machines across seven countries stream cycle and stop signals via IXON IoT gateways and MQTT into Azure, cutting unplanned downtime by 4% and lifting availability by 8% in six months. Neoperl's fully automated assembly lines correlate PLC alarms directly with stops, delivering a 10% reduction in stops and 8% availability gain without any SPS programming change. Klocke scaled to all Weingarten packaging lines in three weeks using plain digital-I/O gateways — no LAN retrofit required on older equipment. The deployment pattern is deliberately boring: tap the signal at the source, classify it with one taxonomy, and let the numbers do the arguing. Zero customer churn in 2024 and ~150% SaaS growth say the pattern holds whether you're forging, bottling or moulding.
What's the difference between machine downtime and idle time?
Downtime is a scheduled production period in which the machine is not producing. Idle time is unplanned but outside scheduled production — e.g., a shift with no order. Idle time sits outside OEE; unplanned downtime sits inside it as an Availability loss. Mixing them inflates OEE and hides real losses.
Is changeover considered downtime?
Technically yes — the machine isn't producing. Operationally, scheduled changeovers are usually treated as planned downtime and either excluded from OEE or tracked separately. Any overrun beyond the standard changeover target becomes unplanned downtime and should be flagged for SMED work.
What's an acceptable level of unplanned downtime?
In world-class discrete manufacturing, unplanned downtime sits below 5% of planned production time. Mid-maturity plants land at 10–15%. Over 20% is a signal that measurement, maintenance strategy or material supply is structurally broken — not that operators need more training.
How does downtime relate to OEE?
Unplanned downtime directly reduces the Availability factor of OEE. A line running 6 hours unplanned-stopped out of a 16-hour scheduled shift has 62.5% Availability before Performance and Quality are even applied. Reducing downtime is usually the single biggest lever on OEE.
Can old machines without modern controls be monitored for downtime?
Yes, always. A digital I/O gateway wired to a single "machine running" signal — often already present as a lamp, relay or contactor — captures runtime and stops without any PLC programming change. I've connected equipment from 1990 this way; the machine doesn't need to be smart, the gateway does.
How long does it take to implement automatic downtime tracking?
On a single line with a modern PLC (S7 / TIA), 2–4 hours per machine for the gateway, plus configuration. A full plant of 20–40 machines typically takes 2–4 weeks end-to-end, including reason-code taxonomy workshops. Brownfield plants with mixed controls take longer on taxonomy than on technology.
Why does manual downtime logging fail?
Because operators are paid to run the line, not to document why it stopped. Under 60-second stops are almost never logged. Reason codes get approximated. By shift end, memory has overwritten detail. Every manual-logging plant I've audited under-reports downtime by 30–60%. It's not dishonesty — it's human bandwidth.
How does SYMESTIC track machine downtime?
SYMESTIC taps machine signals via OPC UA, digital I/O or MQTT gateways, auto-detects every stop above a configurable microstop threshold, and prompts operators to assign reason codes from a two-level taxonomy on the shop-floor client. Data flows live into Production Metrics dashboards and into Alarms when PLC-level events are correlated. Go-live on a line is typically days, not months.
Related: OEE · MES · MTBF · MTTR · Predictive Maintenance · Cost of Poor Quality · Machine Data Capture (MDE) · Alarms · Production Metrics · OEE Software.
MES software compared: vendors, functions per VDI 5600, costs (cloud vs. on-premise) and implementation. Honest market overview 2026.
OEE software captures availability, performance & quality automatically in real time. Vendor comparison, costs & case studies. 30-day free trial.
MES (Manufacturing Execution System): Functions per VDI 5600, architectures, costs and real-world results. With implementation data from 15,000+ machines.