Entry 0082·June 8, 2026·Automation·Leverage

Model the Variance, Not the Average Line

A packaged-protein manufacturer running several sausage and lunchmeat lines wanted to know whether their facility could absorb growth, or whether they were about

Truth · modeled scenario

TL;DR

Mechanism: A line modeled on average speeds will be wrong exactly where it costs you, because incoming material variance propagates through every downstream step and averages erase the variance that decides the schedule.
System Effect: Plan against the mean and the model agrees with itself while the floor diverges, so capacity decisions get made on a number the plant never actually runs.
Economic Impact: One mis-modeled line in a recent build read ten points low on utilization; planned around that average, a plant either buys a shift it does not need or starves one it does, and both errors are six figures.
Diagnostic: Ask for a year of run data, not thirty days. If your model holds on the annual file but breaks on a single season, you have been orchestrating an average that does not exist.

The thirty-day file that lied

A packaged-protein manufacturer running several sausage and lunchmeat lines wanted to know whether their facility could absorb growth, or whether they were about to need new capacity. Standard question. The standard answer is a spreadsheet: average line speed per format, average changeover, average crew. Run the demand against the averages, find the gap, write the capex case.

We asked for their data and they sent thirty days. We sent it back and asked for a year.

That exchange is the whole article. Thirty days of data describes a plant on its best behavior. It smooths out the seasonal raw-material shift, the batch that ran cold, the week the upstream supplier changed grind. A year of data shows you the variance, and the variance is where the plant actually lives. Plants are not static. The value is not in modeling the line that runs on a good Tuesday. It is in modeling the line that has to survive a bad February.

With the full year, we built a digital twin and it landed within two percent of their real baseline, 98.2 percent accuracy across production, labor, downtime, and throughput. That number matters less than what produced it: a model fed the variance, not the mean.

Variance propagates; averages hide it

Here is the mechanism. Incoming material variance does not stay upstream. A viscosity change at the front of the line alters fill speeds and weights. A seasonal shift in raw material changes processing time without changing the recipe. Each of those moves a few minutes per run, and a few minutes per run is invisible in an average. But the line does not run on an average. It runs on the actual sequence of actual batches, and the variance compounds as it moves downstream into changeovers, into packaging utilization, into the schedule.

We saw exactly this when one line in the model would not converge. Every other line matched at 98 percent; this one sat at 88 to 89. The instinct is to torture the model until it agrees. Instead we called it out and went looking, and found six or seven weeks of corrupted downtime data, weird utilization spikes that did not make sense. Bad data on one line dragged it ten points off, and ten points is the difference between a line that looks fine and a line that looks like it needs help. If you average over that without seeing it, you plan around a phantom.

That is what predictive orchestration is actually for. Not a prettier dashboard. A model you can run scenarios against before you spend, because it captures the variation the spreadsheet erases. We ran the obvious ones: demand holds, demand grows, demand doubles. Where does the facility break, and is the break labor, throughput, or scheduling? The twin let us collapse four lines on a single shift down to two virtually, and see what that does to utilization, before anyone moved a machine or hired a crew.

Validate against the floor before you promise a number

The discipline that makes this trustworthy is knowing when the model is not ready to talk. On a separate engagement, a protein processor standing up a second plant in Georgia, the temptation was to lead with forecasted savings in the kickoff. We deferred it. The instruction to the client team was explicit: do not report forecasted savings until the on-site assessment yields site-specific data. Quoting a predicted number before you have validated it against the floor does one of two things, both bad. It inflates expectations, or it confuses executives who then anchor on a figure the model has not earned.

This is the corollary to the year-of-data rule. A model is only as honest as the variance you fed it, and the variance you have not yet measured on this specific floor is a hole. So you walk the line first. You pull the annual file. You reconcile the weird weeks. Then the model gets to speak, and when it does, the number holds.

The sequencing is the same on the forecasting side. On a third engagement, a better-for-you food brand, the forecasting workstream kept getting tangled with quote tracking in the same conversations, and the forecasting suffered for it. Separate them. Forecasting is the front of the orchestration loop, the demand signal the whole schedule keys off. If it shares a meeting with PO status and vendor quotes, it gets the leftover ten minutes. Give the predictive input its own table or the orchestration downstream inherits a soft number.

The lens flip

A line modeled on averages will always agree with itself. The arithmetic is clean, the gap is obvious, the capex case writes itself. The trouble is that no plant runs on the average. It runs on the bad February, the cold batch, the six weeks of corrupted downtime nobody flagged. Model the variance and the answer changes, sometimes from "buy capacity" to "you already have it, you are losing it to sequence." Ask for the year, not the month. The variance you smooth out in the model is the variance that decides your schedule on the floor.

Continue reading in Automation

← All entries

Model the Variance, Not the Average Line

The thirty-day file that lied

Variance propagates; averages hide it

Validate against the floor before you promise a number

The lens flip

Your Floor Reports 84 Percent And Runs Fifty

One Percent Downtime Is a Configuration, Not a Fact

Twelve-Month Lead Times Make Capacity a Forecast Decision

The thirty-day file that lied

Variance propagates; averages hide it

Validate against the floor before you promise a number

The lens flip

Your Floor Reports 84 Percent And Runs Fifty→

One Percent Downtime Is a Configuration, Not a Fact→

Twelve-Month Lead Times Make Capacity a Forecast Decision→

Your Floor Reports 84 Percent And Runs Fifty

One Percent Downtime Is a Configuration, Not a Fact

Twelve-Month Lead Times Make Capacity a Forecast Decision