Operating notes
Capacity planning: how to staff an operation to demand.
Staff too lean and the service level collapses. Staff too rich and you burn the budget on idle hours. Capacity planning is how you find the line, and hold it as demand moves.
10 min read · By Everton Paula
Most staffing decisions in a growing company are made by feel. The queue looks bad, so a manager asks for three more people. The budget looks tight, so finance freezes a backfill. Neither decision is wrong on purpose, and both are guesses, because nobody has connected the number of people to the amount of work in a way you can defend.
Capacity planning is that connection. It takes how much work is coming, how long each piece takes, and how much of a person's paid time is actually available, and turns them into a number of people. Done well, it ends the monthly argument about headcount, because the staffing level stops being an opinion and becomes a calculation everyone can see.
What capacity planning is
Capacity planning is matching staffing to forecast demand so the operation hits its service level without paying for idle headcount. It applies anywhere work arrives and people process it: a support queue, a warehouse, a claims team, a moderation desk, a courier dispatch floor. The vocabulary changes by function, the math does not.
A capacity plan needs four inputs, and the quality of the plan is capped by the weakest of them.
- Demand forecast. How much work is coming, by period. Contacts per week, orders per day, claims per shift. This is the input most teams get wrong, usually by planning to the average instead of the peak.
- Handle time. How long one unit of work takes, on average. The average hides a lot, but it is the starting point.
- Available time per person. The hours a person can actually spend on the core work, after everything else their paid day contains.
- Target service level. The standard you are staffing to hold. Answer 80 percent of contacts within 30 seconds, ship 99 percent of orders same day. Without a target, there is no right answer to how many people you need.
The core calculation
The base of every capacity model is the same. Total work equals demand times handle time. People needed equals total work divided by available time per person. Everything else is refinement on top of those two lines.
A worked example. A support team takes 10,000 contacts a week, and the average handle time is 8 minutes. That is 80,000 minutes of work, or about 1,333 hours a week. If an agent is paid for 40 hours a week, it is tempting to divide 1,333 by 40 and conclude you need 33 agents. That number is wrong, and the operation that staffs to it will miss its service level every week. The reason is shrinkage.
Shrinkage: the input that breaks most plans
Shrinkage is the gap between the hours a person is paid for and the hours they are available to do the core work. Breaks, team meetings, training, one-on-ones, system downtime, admin, and absence all live in that gap. In most operations shrinkage runs 25 to 35 percent, and it is the single most common reason a capacity plan understaffs.
Back to the example. If shrinkage is 30 percent, an agent paid for 40 hours is available for 28. Now the math is 1,333 hours of work divided by 28 productive hours, which is about 48 agents, not 33. The 15-agent gap between the naive number and the real one is the difference between holding the service level and drowning under it. A plan that ignores shrinkage is not a conservative plan, it is a wrong one.
Call center staffing has one more wrinkle
For work that queues in real time, such as calls and live chats, you cannot simply divide work by available hours, because contacts do not arrive in a tidy stream. They cluster, and while agents handle one contact, others wait. To hold a target answer time, an interaction operation has to staff above the raw workload number, leaving deliberate slack so the queue does not build during clusters.
This is what an Erlang model captures, and it is why call center staffing tools exist. The practical takeaway without the math: the higher your target service level, the more slack you have to staff, and the relationship is not linear. Going from answering 70 percent of calls in 30 seconds to 90 percent costs far more than the 20-point gap suggests. That curve is a budget conversation worth having before you promise a service level.
From calculation to staffing model
A single number is not a staffing model. A staffing model is the calculation run across periods, so you can see where demand peaks, where the plan is short, and where you are carrying slack. It is what turns capacity planning into headcount planning: the hiring you approve this quarter should come out of the gap the model shows between demand and current capacity over the next two or three quarters, not out of how the queue felt last week.
You can build a first version in a spreadsheet, and you should, because building it teaches you which inputs your operation is missing. If you want a running start, the free workforce forecast tool in the Marketplace Ops Toolkit models demand, handle time, and shrinkage into a staffing number in your browser, with no signup.
The three ways it goes wrong
Planning to the average. Demand is not flat. If you staff to the weekly average, you are overstaffed in the troughs and underwater in the peaks, and customers only experience the peaks. Plan to the shape of demand, not its mean.
Forgetting shrinkage. Covered above, and worth repeating because it is that common. Paid hours are not productive hours. Subtract the gap before you divide.
Treating it as a one-time exercise. A capacity plan is not a document, it is a model you re-run as demand and handle time move. The plan you built in January is wrong by March if nobody updated the forecast. It belongs in the operating cadence, reviewed monthly, not filed after the budget is set.
Why this is operating work
Capacity planning is where a lot of operating pain is quietly decided. A team that is always behind is often not a team with a performance problem, it is a team that was staffed to the wrong number and is being asked to make up the gap with effort. Building the model that gets the number right, and reviewing it in the operating cadence as demand moves, is one of the systems that lets a company grow without the operation breaking under it.
That is the work Plenor does. If your operation is staffed by feel and you want an outside operator to read it and rank what to fix first, the one-week, fixed-fee Operating Teardown is where to start. If you need someone to own the operation while you build it, that is fractional COO work.
Next note.
How to install a weekly operating cadence in 30 days
Where the capacity model gets reviewed and re-forecast as demand moves. The three-meeting week and the 30-day install.
How to run a root cause analysis
When a team is always behind, capacity is often the root cause. The 5 Whys and fishbone methods, with a worked example.
