←

Peak Shaving: Load Management & Real-Time Control

By Mark Kobbert · Last updated: April 2026

What peak shaving actually is — and why the engineering problem is not "turn machines off" but "predict the next 180 seconds well enough to decide which ones"

Peak shaving is the discipline of keeping a plant's measured power draw below a chosen threshold across the rolling 15-minute intervals that the utility uses to set the annual demand charge. That description makes it sound like a scheduling problem. It is not. Underneath the scheduling sits a tight real-time control loop with an unforgiving latency budget: measurement under one second, forecast under five seconds, actuation under thirty seconds — because the 15-minute window closes whether your system is ready or not, and a single missed window in January can set the demand-charge baseline for the following twelve months. This article is the engineering view of that control loop — the measurement stack, the forecasting horizon, the actuation hierarchy, the edge-vs-cloud split, and the five failure modes that separate working peak-shaving deployments from the deployments that produce nice dashboards while the annual peak marches upward anyway.

I have spent the better part of the last decade building the real-time data pipeline that SYMESTIC uses to stream machine and energy telemetry from more than 15,000 production assets across 18 countries into a central platform and back out as control decisions. Peak shaving sits at the intersection of three things I spend most of my engineering time on: industrial signal acquisition (how do you get a Modbus register value off a substation meter with sub-second freshness), real-time stream processing (how do you compute a rolling-window power average across a few thousand measurement points without drifting), and actuation integration (how do you write a setpoint or a shed command back to the plant without killing production). The rest of the article is the architecture view of how those three pieces have to fit together for the economics to work.

The 15-minute window tax — why the utility's billing interval is the thing your architecture has to solve for

Every industrial electricity tariff in Germany, and most equivalents in Europe and North America, settles the demand charge on a 15-minute average of active power. The meter integrates your instantaneous power draw over each quarter-hour, publishes the average, and the highest such average across the entire calendar year sets the demand-charge basis for the next twelve months. This interval length is not decorative — it is the single most important number in the peak-shaving architecture, because it defines what a "peak" actually is and what it is not.

What the 15-min window does	Consequence for the architecture
Averages across the interval. A 5-second spike to 4 MW followed by 14 minutes 55 seconds at 1 MW produces an average barely above 1 MW.	Short motor inrush currents and sub-second transients are not the enemy. The enemy is sustained elevated power lasting minutes.
Resets on interval boundary. The window starts at :00, :15, :30, :45 and closes at the next boundary.	The control loop must know where it is inside the window — a load spike at :00 has 15 minutes to be diluted; a spike at :14 has 60 seconds.
Takes the annual maximum. One bad interval in 35,040 (15-min slots/year) sets the bill.	The system must work every interval of every day, not on average. One missed Monday in January is the full-year penalty.
Locks in for 12 months. The peak becomes the billing basis for the next calendar year.	A mistake is expensive by definition. The annualised cost per kW in Germany currently sits between €50 and €150 — typically around €75 for mid-voltage industrial.

The pattern I call The 15-Minute Window Tax is the structural cost that this billing mechanic imposes on every industrial electricity consumer. It is not a tax in the legal sense; it is the consequence of the asymmetric mapping between one bad interval and twelve months of elevated charges. A plant with a 2 MW typical peak that slips to 3 MW once in January at a demand charge of €75/kW pays an additional €75,000 for the rest of the year — for a 15-minute event that may have been a single unfortunate overlap of three start-up sequences on a Monday morning. The architecture's job is to ensure that interval never happens.

The control loop — measurement, forecast, actuation, and the latency budget that ties them together

A working peak-shaving system is a closed-loop controller. It continuously measures current power draw, projects where the 15-minute average will land given the current trajectory, and takes action when the projection exceeds a configured threshold. The whole loop has to close inside a latency budget that is determined by how close the window is to its boundary — and the nearer the boundary, the tighter the budget.

Stage	Target latency	Typical implementation	Failure mode if violated
Measurement acquisition	< 1 s	Modbus TCP poll of utility-grade meter (Janitza, A. Eberle, Siemens Sentron), 1 Hz sample rate, published to MQTT broker.	Forecast runs on stale data; control loop reacts to the event after it has happened.
Rolling-window integration	< 2 s	Streaming aggregation at the edge gateway — typically an Azure Stream Analytics edge module or equivalent — computing running 15-min average on the current window.	Drift between meter's own interval average and system's computed average; decisions fight the utility bill.
Forecast of interval end-value	< 5 s	Extrapolation of remaining window given current draw, with scheduled-load overlay (which machines are planned to start in the next 5 min) and confidence interval.	System alarms at :14:30 when the window is already unrecoverable; no actuation time.
Actuation	< 30 s	Write setpoint/shed command back through the gateway to PLC or BMS; confirmation read-back on next cycle.	Actuation arrives after interval close; peak is already logged by meter.

Each of these stages has a concrete engineering choice behind it, and each choice has consequences that show up six months later as either a working system or an expensive ornament. The full loop target — under 40 seconds from meter reading to actuated command — is achievable on commodity infrastructure but requires that the forecast and decision logic run at the edge, not in the cloud. This is not a cloud-vs-cloud religious argument; it is the consequence of the round-trip latency between a German plant and a European Azure region being non-negligible and variable, and the loop having to close deterministically even when the wide-area network degrades.

The edge-vs-cloud split — what belongs on the gateway and what belongs in the cloud

Peak shaving is the cleanest example I know of a hybrid edge-cloud control problem, and it is the architecture decision that most vendor implementations get wrong. The temptation is to run everything in the cloud — it is easier to develop, easier to monitor, easier to update. It is also demonstrably the wrong answer for the sub-30-second actuation loop. The rule I have applied consistently across SYMESTIC deployments since 2019:

Layer	What runs there	Why
Edge gateway (on-prem, per plant)	Meter polling, window integration, 60-second forecast, threshold check, actuation dispatch, local load-shed hierarchy.	Hard real-time loop. Must work during WAN outage. Latency budget does not tolerate a round-trip to the cloud.
Cloud platform (SYMESTIC Cloud MES)	Historical analytics, pattern recognition, model training for forecast, cross-plant benchmarking, regulatory reporting, dashboard.	Soft-real-time and batch workloads. Benefits from cross-plant data and elastic compute. Latency-tolerant.
Bidirectional sync	Production schedule → edge (for forecast overlay); forecast outputs + events → cloud (for analytics); updated forecast model → edge (weekly).	Cloud enriches edge decisions with learned patterns; edge reports back for continuous improvement.

The anti-pattern I call The Forecast Horizon Gap is what happens when the forecast lives in the cloud: the system sees the peak coming, the message takes 4–12 seconds to reach the cloud, the forecast runs in 2 seconds, the decision takes 3 seconds, the actuation command takes another 4–10 seconds to get back to the PLC. Total: somewhere between 13 and 27 seconds — but only when the network is healthy. Add a transient packet loss or a cloud region failover and the budget collapses entirely. Every peak-shaving deployment I have audited that ran the forecast in the cloud had a shadow pattern of "almost-missed" intervals where the actuation landed in the next 15-minute window. The fix is always the same: push the forecast and the immediate decision to the edge, and keep the cloud for the things it is actually good at.

The simultaneous startup cascade — the single most common peak cause, and why staggering is not as simple as it looks

The pattern I have named The Simultaneous Startup Cascade is the failure mode that causes somewhere between 50 and 70 percent of annual peaks across the SYMESTIC customer base, based on the aggregate telemetry I have reviewed for the past three years. The mechanism is always the same: a shift break or a weekend ends, a dozen or more assets come out of idle or power-down within the same 15-minute window, each with its own warm-up or inrush profile, and the aggregate 15-minute average lands at 1.5× to 2× the normal operating baseline.

The naive response — "stagger the start times" — is correct in direction and insufficient in depth. A working stagger policy has to account for four things at once:

Warm-up duration per asset class. An injection moulding machine's heating cycle is 15–45 minutes of elevated draw. A press's oil-temperature conditioning is 8–20 minutes. A compressed-air compressor's inrush is 30–90 seconds followed by steady-state. Staggering with a uniform 2-minute gap ignores these durations and clusters the sustained warm-up loads inside the same window anyway.
Production priority. The stagger policy cannot treat all assets as equal. Machines on the critical path for the shift's first orders must start earliest; non-critical assets can be delayed 10–20 minutes without operational impact.
Current window position. Starting a warm-up cycle at :01 gives the system 14 minutes to dilute the load; starting at :14 gives it 60 seconds and usually forces a shed. The scheduler must know the current window phase and use it.
Live measurement feedback. The stagger plan is a forecast; the meter is the ground truth. The scheduler must adapt in real time when actual draw deviates from planned — because the first month of operation always reveals that the machine nameplates were wrong.

A MES-integrated startup scheduler that handles these four dimensions is the structural solution. A spreadsheet handed to the shift supervisor is not. The difference, measured across the customer base, is typically a 20–40 percent reduction in annual peak for the spreadsheet version and a 50–70 percent reduction for the MES-integrated version — which, in euro terms for a 2 MW plant at €75/kW, is roughly €30–€45k/year vs. €75–€105k/year in avoided demand charge.

The load-shed tier hierarchy — four priority tiers and the maximum-outage-time per tier

When forecast meets threshold and the stagger schedule has nothing left to stagger, the system has to shed load. Load shedding is the second line of defence, and it is the feature that most home-grown implementations either do not have or have only in a crude, single-tier form that ends up shedding things that cannot safely be shed. The structured version I design into every peak-shaving deployment uses four tiers with explicit maximum-outage criteria per tier:

Tier	Load class	Typical examples	Max outage	Shed order
Tier 1	Discretionary comfort loads	HVAC setpoint shift (not full shutoff), warehouse lighting zones, EV charging stations.	10–30 min	First — zero production impact.
Tier 2	Thermal-buffered processes	Chillers with cold-buffer tanks, hot-water tanks with storage, cooling-tower fans (not chillers).	5–15 min	Second — buffered capacity absorbs the outage.
Tier 3	Deferrable production	Non-bottleneck machines with work-in-progress buffer, batch processes with flexible start.	2–10 min	Third — controlled production deferral.
Tier 4	Critical production	Bottleneck machines, running orders, safety-related loads.	Never shed automatically	Only operator-initiated emergency; accept the peak instead.

The positive pattern I call The Load-Shed Tier Hierarchy is what lets a system absorb unexpected load events without stopping the plant. The critical discipline is that Tier 4 must never be part of automated shedding. The MES has the authority to reduce HVAC by 2 °C and pause the warehouse's LED zones; it does not have the authority to stop the press that is running the customer's priority order. This sounds obvious; it is regularly violated in practice, because the automation logic is written by people who think in energy units rather than in production units. Every load-shed decision must know which tier it is shedding from and stop at the Tier 3–4 boundary.

The phantom baseload — why idle consumption is the peak-shaving win that nobody budgets for

Most peak-shaving conversations focus on the spike above the baseline. The quieter, structurally larger opportunity is the baseline itself — the sustained power draw when the plant is "not producing" that nevertheless shows up on the meter. The pattern I call The Phantom Baseload is the 30–60 percent of nominal draw that plants routinely carry during unmanned hours, weekends, and plant shutdown periods, made up of:

Compressed-air leaks. The compressor cycles on/off overnight despite no production — because the pneumatic system leaks enough to keep the pressure falling. A 10 kW compressor running 50 % of the weekend is 2 MWh of drawn energy for no output.
HVAC serving empty buildings. Production hall conditioned to full setpoint over the weekend. Easy to see in the data; hard to see without the data.
Machines in "idle" that are actually warming. Injection moulding machines left on to "avoid warm-up Monday" — net energy effect is typically 3–5× worse than a clean restart.
Auxiliary equipment with no shutdown logic. Exhaust fans, conveyor motors, and lighting that were never connected to any scheduling system and simply run.

The phantom baseload matters for peak shaving specifically because it raises the entire operating curve — and a raised operating curve means every start-up event lands closer to the threshold than it needs to. Plants that address the phantom baseload systematically (typically through MES-integrated metering at sub-circuit level, pattern detection on nights-and-weekends consumption, and scheduled shutdown automation) see their entire peak-shaving problem become easier by 10–20 percent, without any change to the peak itself. This is not a peak-shaving technique in the classical sense; it is the thing that makes peak shaving work in the first place.

The measurement stack — what the meter actually has to deliver, and which protocols get you there

Every peak-shaving deployment starts and ends with the quality of the measurement. The meter is non-negotiable: it must be utility-grade (Class 0.5 or better), must publish 1-second active-power readings, and must be accessible over a standard industrial protocol. The three protocols I see in 95 percent of German and European deployments:

Protocol	Typical use	Strengths	Weaknesses
Modbus TCP	De-facto standard for industrial power meters (Janitza UMG series, A. Eberle PQ-Box, Siemens Sentron PAC).	Simple, universal, sub-second polling trivial.	No built-in security; no standard register layout (every vendor different).
OPC UA	Modern automation convergence; meters with OPC UA server built-in.	Typed data model; built-in security; standardised information model for power quality (via companion specifications).	Heavier protocol; fewer meters support it natively; higher gateway cost.
IEC 61850	Utility-substation-grade; mandatory for MV/HV substations in EU.	Full substation model; GOOSE for fast signalling; regulatory-grade.	Complex; high integration cost; overkill for most factory-floor deployments.
MQTT (layered on top)	Transport from edge gateway to MES/cloud. Meter itself rarely speaks MQTT natively.	Lightweight; pub/sub; firewall-friendly.	Transport only; requires upstream protocol translation at the gateway.

The practical stack for 90 percent of mid-market German plants I have commissioned: Modbus TCP polling at the meter, translation and rolling-window aggregation at the edge gateway, MQTT publication to the cloud broker. This is not fancy; it is what works under real network conditions, with real meters, at real deployment cost. The fancy stack — OPC UA end-to-end with IEC 61850 at the substation — makes sense for large plants with their own MV substation and a sophisticated automation team; it is expensive overhead for a 2 MW plant with three production halls.

The MES integration — why peak shaving without production context does not work

A peak-shaving system that only sees the meter is a controller with a blindfold. It knows the current power draw; it does not know whether that draw is about to double because the second-shift changeover is starting, or drop by 40 percent because the press line is about to change orders. The forecast is only as good as the forward-looking context it has, and the forward-looking context lives in the MES. Three specific integration points turn a meter-only system into a real peak-shaving control loop:

Production schedule lookahead. The next 30 minutes of scheduled starts and stops from the MES dispatch module feed directly into the edge forecast. When the scheduler knows at 05:45 that nine machines are queued to come up at 06:00, it can plan a stagger policy that starts the first two at 05:58 and the last three at 06:18, smearing the warm-up across two windows instead of concentrating it in one.
Asset-level power profile. The MES stores, per machine, the observed warm-up profile (power vs. time from cold start) and the steady-state range. The forecast uses these profiles to project interval end-value, not a generic estimate.
Bottleneck awareness. The MES knows which machine on which line is the current constraint. The load-shed tier logic uses this directly — a machine that was Tier 3 yesterday may be Tier 4 today if it has become the bottleneck for the running order. Static tier assignment is a common mistake; dynamic tier assignment based on current production state is what separates a production-aware controller from a pure energy controller.

This integration is also the reason why peak shaving belongs in the MES and not in a standalone energy management system (EMS). A standalone EMS sees the meter; it does not see the schedule, the asset profiles, or the bottleneck. It can produce reports and trigger alarms; it cannot run the closed-loop forecast-and-actuation cycle that actually moves the annual peak. I have been in enough customer situations where a standalone EMS was deployed first and then migrated into the MES 18–24 months later, at the cost of rebuilding the integration twice, that I treat the question "EMS or MES for peak shaving?" as answered in advance: MES, from the beginning. The EMS can be a component; it cannot be the owner of the control loop.

From a mid-size plastics injection moulding plant, January 2022: The customer was a third-generation family-owned injection moulding operation in North Rhine-Westphalia, about 180 employees, 14 moulding machines (mix of 80-ton and 400-ton presses, mostly Arburg and Engel), plus three screw compressors and a hall heating system on electric boilers for the winter months. They had been a SYMESTIC customer since 2020 for OEE and production monitoring; energy monitoring was not yet part of the installation. The plant manager called me on a Tuesday in mid-January, second week back from the Christmas shutdown, with a question I had not heard in exactly that form before: he had just received a letter from his utility informing him that his 2022 demand-charge basis had been set at 3,820 kW, based on a measurement from Monday January 10th between 06:00 and 06:15, and asking him whether he wanted to discuss grid-fee optimisation options with their account team. His normal operating peak was 2,050 kW. The mid-January letter meant he would be paying the full-year demand charge on 3,820 kW at roughly €75/kW, or approximately €286,500 for 2022 against a normal-year figure of €154,000 — an extra €132,500 for a single 15-minute interval that he had not been aware of at the time it happened. We pulled the meter archive together with his production data and reconstructed the window. At 06:00 that morning, first shift returned from the Christmas shutdown period. The hall heating system had been set to resume full setpoint at 05:30 — a 420 kW electric boiler load that had been ramping for 30 minutes and was still drawing full power at 06:00 because the hall was down to 8 °C and the setpoint was 18 °C. At 06:00 exactly, 12 of the 14 moulding machines started their warm-up cycles simultaneously — the customary procedure, since "we always start at 06:00" was the cultural assumption. Six of those twelve were 400-ton machines with 180–260 kW warm-up draw each for the first 15–25 minutes. At 06:03, the two larger screw compressors started their no-load-then-loaded cycle, another 160 kW combined. Three individual events — none of them a failure, all of them routine operational practice — overlapped inside a single 15-minute window, and the integrated average came out at 3,820 kW. The meter did its job; the accounting system did its job; the letter arrived two weeks later, correctly. The fix we implemented in the first quarter of 2022 was fundamentally a software problem masquerading as an energy-engineering problem. We connected the existing Janitza UMG 508 meters at the main supply point to the SYMESTIC edge gateway via Modbus TCP at 1-second polling, built a rolling-window aggregator that tracked the current 15-minute average against a configurable threshold (set initially at 2,400 kW), and wrote an MES-integrated startup scheduler that staggered the 14 moulding machines over an 8-minute window instead of starting them all at :00. Critically, we also wired the hall heating system into a Tier 1 load-shed group — the heating could cycle off for up to 15 minutes without the hall dropping below 15 °C given its thermal mass. The results across 2023 were, on reflection, much less dramatic than they should have been given the magnitude of the 2022 event. The annual peak for 2023 came in at 2,420 kW — slightly above the 2,050 kW normal operating baseline, but a full 1,400 kW below the 2022 disaster peak. The demand-charge bill for 2023 dropped to approximately €181,500, a saving of €105,000 against the 2022 cost. The customer's CFO made a point of telling me, during the following year's renewal conversation, that the peak-shaving module had paid for the entire SYMESTIC platform licence for five years in its first year of operation. The lesson I have repeated to every customer since is that peak shaving is not really about the peak — it is about the minute-by-minute control loop that makes sure the peak cannot happen. A system that alerts you the day after a peak has been recorded is an accounting system, not a control system. The difference between the two, in financial terms, is six figures per year at a mid-market plant, and once the lesson has been paid for once, it does not need to be paid for again.

The regulatory dimension — §14a EnWG, demand response, and the flexibility dividend

The German regulatory environment around industrial electricity is shifting materially in 2026, and the shift is directionally favourable to plants that have already built real-time control over their load. Three specific changes matter for peak shaving architecture decisions being made this year:

§14a EnWG (revised 2024–2026) obligates controllable consumer assets above 4.2 kW to be capable of responsive control by the distribution network operator, in exchange for reduced grid fees. Industrial peak-shaving systems that are already MES-integrated and capable of automated response are in a structurally easier position to opt in.
Dynamic grid fees (§19 StromNEV) already offer reduced charges for atypical consumption patterns — plants whose peak draw is offset from the regional grid peak receive up to 80 percent reduction on the demand-charge component. A peak-shaving system that knows the regional grid profile can actively schedule heavy loads to fall outside peak hours.
Demand-response markets (PRL, SRL, MRL) — the primary, secondary, and tertiary balancing reserve markets — compensate industrial consumers for the ability to shed load on short notice. Participation requires sub-minute controllability of at least 1 MW of switchable load. A plant with a well-built load-shed tier hierarchy already has 60–80 percent of this capability in place.

The structural pattern across these three mechanisms is what I call The Demand Response Dividend. The infrastructure a plant builds for its own peak shaving — real-time measurement, edge-based forecasting, tiered load-shed, automated actuation — is substantially the same infrastructure required to participate in the regulatory and market mechanisms that pay for load flexibility. Plants that build peak shaving correctly are, by the structure of the technology, eligible for the grid-fee discounts and the demand-response revenue streams that are being rolled out through 2026–2028. Plants that bolt on a reactive alerting system are not. This is a strategic point, not a technical one: the decision to invest in the edge-based control loop vs. the cloud-based reporting dashboard has implications that extend beyond the peak-shaving bill itself.

The eight disciplines of a working peak-shaving implementation

#	Discipline	Test
1	Utility-grade meter at 1 Hz, published to edge in < 1 s.	Dashboard latency between instantaneous meter display and edge value < 1 second at p99.
2	Rolling 15-minute window integrated at edge.	Edge-computed average matches meter's own interval average within ±0.5 % for every window across a 7-day audit.
3	Forecast with MES schedule overlay and confidence interval.	Forecast end-of-window value published every 10 seconds, with known accuracy band tracked in dashboard.
4	Actuation loop < 30 seconds, edge-resident.	Threshold-to-actuated-command latency audited weekly; no dependency on WAN connectivity for critical shed.
5	Four-tier load-shed hierarchy with production-aware Tier 4 protection.	No automated shed event ever reaches a machine flagged as current bottleneck; override requires operator intent.
6	MES-integrated startup scheduler.	Machine start events staggered according to warm-up profile and production priority; shift supervisor sees plan in advance.
7	Phantom-baseload monitoring.	Off-shift power draw tracked with weekly anomaly reporting; leaks, idle loads, and scheduling gaps flagged.
8	Regulatory-ready data model.	Measurement and event archive retained in a form that supports §14a, §19 StromNEV, and demand-response market participation.

FAQ

What is peak shaving in one sentence?
Peak shaving is the closed-loop control of a plant's power draw against the utility's rolling 15-minute demand window, combining sub-second meter polling, edge-based forecast, tiered load-shed, and MES-integrated startup scheduling to prevent the annual peak that sets the next twelve months of demand-charge billing.

Why does the 15-minute window matter so much?
Because the utility bills demand charges on the highest 15-minute average active power across the entire calendar year, and that single interval locks in the charge for the following twelve months. One bad Monday morning in January at €75/kW on 1,000 kW of excess peak costs €75,000 — for a 15-minute event. The asymmetry between one bad interval and twelve months of penalty is what makes peak shaving a real-time control problem rather than a scheduling problem.

What is the latency budget for the control loop?
Under 40 seconds total, from meter reading to actuated command, consistently. Specifically: measurement acquisition under 1 second, rolling-window integration under 2 seconds, forecast under 5 seconds, actuation under 30 seconds. Deployments that run the forecast in the cloud routinely miss this budget during network degradation and produce the "almost-missed interval" pattern — actuation lands in the next window instead of the current one. The fix is architectural: forecast and immediate decision at the edge, not in the cloud.

Does peak shaving work without MES integration?
Partially. A meter-only energy management system can do reactive alerting and basic threshold-based shedding. It cannot do forecast-with-schedule-overlay, because the production schedule lives in the MES. In practice, meter-only systems achieve 20–40 percent of the peak reduction that MES-integrated systems achieve, for roughly the same software cost. Every standalone EMS deployment I have seen at SYMESTIC customers has been migrated into the MES within 18–24 months, usually at the cost of rebuilding the integrations twice.

What is The 15-Minute Window Tax?
The structural cost imposed by the utility's billing mechanic — the asymmetric mapping between one bad 15-minute interval and twelve months of elevated demand charges. It is not a tax in the legal sense; it is the consequence of an annual-maximum billing rule combined with a short integration window. The architecture's job is to ensure the bad interval never happens, because once it is logged, the cost is locked in for the rest of the billing cycle.

What is The Simultaneous Startup Cascade?
The single most common peak cause in discrete manufacturing — 50–70 percent of annual peaks across the SYMESTIC customer base — caused by a dozen or more assets exiting idle or power-down state within the same 15-minute window after a break, weekend, or plant shutdown. The fix is a production-aware startup scheduler that staggers activations according to warm-up profile and production priority, not a fixed 2-minute gap between machines.

What is The Phantom Baseload?
The 30–60 percent of nominal power draw that plants carry during unmanned hours — compressed-air leaks running compressors overnight, HVAC serving empty halls, machines left "warm" that actually burn more energy than a clean restart would cost, auxiliary equipment with no shutdown logic. The phantom baseload raises the entire operating curve and makes peak shaving harder by 10–20 percent before any peak event occurs. Addressing it systematically is a prerequisite for effective peak-shaving architecture.

Which meter and protocol should we use?
For 90 percent of mid-market German and European plants: utility-grade Class 0.5 meter (Janitza UMG 508/509, A. Eberle PQ-Box, Siemens Sentron PAC3200/4200) at the main supply point, Modbus TCP polled at 1 Hz by the edge gateway, MQTT publication upstream to the cloud. OPC UA at the meter is preferable where supported natively but adds 20–40 percent to the integration cost. IEC 61850 is appropriate for plants with their own MV substation and full substation automation — overkill for anyone else.

How much of the peak-shaving benefit comes from load shedding vs. scheduling?
Roughly 70/30 in favour of scheduling, across the customer base I have direct visibility into. The MES-integrated startup scheduler and the phantom-baseload reduction do most of the work. Load shedding is the safety net that catches the unexpected events — an unusually cold morning, an unplanned restart after a trip, a weekend ramp-up that overlaps with a process anomaly. Both are required, but scheduling prevents peaks before they happen; shedding reacts to peaks as they develop. Architecture decisions should prioritise the scheduling capability first.

How does peak shaving interact with §14a EnWG and demand-response markets?
Favourably. The infrastructure a plant builds for its own peak shaving — sub-second metering, edge forecasting, tiered load shed, automated actuation — is substantially the same infrastructure required for §14a compliance (controllable consumer assets above 4.2 kW), §19 StromNEV atypical-consumption grid-fee reductions, and participation in primary, secondary, and tertiary balancing reserve markets. A plant with a working MES-integrated peak-shaving system is 60–80 percent of the way to qualifying for regulatory grid-fee reductions and demand-response revenue, without additional capital expenditure. Plants with a reactive alerting-only system are not.

What is the typical ROI for an MES-integrated peak-shaving deployment?
For a 2 MW plant with a demand charge around €75/kW/year, typical first-year avoided peak is 500–800 kW, worth €37,500–€60,000 in avoided demand charges. Implementation cost on top of an existing MES (meter hardware, gateway integration, scheduler configuration, commissioning) is typically €25,000–€45,000 one-time. Payback lands between 6 and 14 months for standalone peak shaving; when layered with §19 StromNEV grid-fee reductions and demand-response participation, the combined payback typically drops below 6 months.

About the author

Mark Kobbert

CTO at SYMESTIC GmbH. B.Sc. Business Informatics, SRH Heidelberg. Has led the technical architecture of the SYMESTIC Cloud MES platform since 2014 — from the microservice architecture on Microsoft Azure through the IoT gateway integration to real-time data processing of over 15,000 machines across 18 countries. Architect of the ground-up rebuild of the SYMESTIC platform as cloud-native, microservices, API-first, on Azure. Expertise: cloud-native MES architecture, Microsoft Azure, microservice architecture, OPC UA, MQTT, Modbus TCP, IoT gateway development, edge computing, ISA-95 integration architecture, industrial connectivity, brownfield machine integration, REST APIs, C#/.NET, SQL, Docker/Kubernetes, real-time data processing, IT/OT convergence, industrial energy telemetry. · LinkedIn

Start working with SYMESTIC today to boost your productivity, efficiency, and quality!