Building SOPs that survive employee turnover
Every MSP owner has lived some version of this story. A senior tech leaves—maybe with two weeks’ notice, maybe with none—and walks out the door carrying half the business in their head. The way that one finicky client’s backup works, the undocumented workaround for the email migration that keeps breaking and the order of operations for onboarding that nobody else fully knows. Suddenly your team is reverse-engineering knowledge that used to live in one person’s brain, and your service quality dips at exactly the moment you can least afford it.
The instinctive fix is “we need better documentation.” So someone gets assigned to write SOPs, a burst of documents gets created, everyone feels good for a month and then the documents quietly rot. Six months later they’re out of date, nobody trusts them and the next departure hits just as hard as the last.
The problem was never the absence of documentation but rather that most MSP SOPs are built to be written, not built to survive. A turnover-proof SOP is a different kind of artifact, created with different habits. Here’s how to build them.
Why most SOPs die
Before fixing it, it’s worth naming why the usual approach fails.
Most SOPs are written by your best person, in a hurry, as a one-time project. That’s the original sin. The expert writes for an audience that already knows what they know, skips the “obvious” steps, and captures the procedure as it existed on the day they wrote it. The document is born already drifting from reality.
Then nothing maintains it. There’s no trigger that forces an update when the procedure changes, no owner responsible for its accuracy, and no moment in anyone’s workflow where they’re required to actually open it. It becomes a museum piece—technically present, practically dead.
Critically, the SOP lives somewhere people don’t work. If it’s in a shared drive folder nobody opens, a wiki nobody searches or a document three clicks away from where the actual work happens, it doesn’t matter how good it is. People will solve the problem from memory or by asking the nearest senior person—which is exactly the knowledge-in-heads dependency you were trying to escape.
Turnover-proofing means attacking all three failure modes: how SOPs are written, how they’re maintained, and where they live.
Write for the newest person, not the expert
The single biggest shift is changing who you’re writing for. A turnover-proof SOP is written for the least experienced person who could plausibly be asked to do the task; not for the expert who already knows it.
That means no skipped steps because they’re “obvious.” It means defining the acronyms, naming the exact tools and where to find them and spelling out the decision points where a newer tech would hesitate. If a step says “configure the firewall appropriately,” it’s useless. If it says “open the firewall rules in [tool], add a rule allowing X from Y, here’s the screenshot of what correct looks like” — that survives.
A practical test: hand the SOP to someone who has never done the task and watch them attempt it without help. Every place they get stuck, ask a question or do it wrong is a gap in the document, not a gap in the person. This is the fastest way to find the invisible knowledge the expert didn’t realize they were assuming. The goal is a document where competent execution doesn’t depend on having been here for three years.
Capture the “why,” not just the “what”
Step-by-step instructions tell someone how to follow a procedure. They don’t tell someone what to do when reality doesn’t match the procedure and, in MSP work, reality frequently doesn’t match.
The SOPs that survive include the reasoning behind the steps. Why is the backup verified in this specific way? What is this step actually protecting against? What’s the failure mode if you skip it? When a tech understands the intent, they can adapt intelligently when a client environment is slightly different, rather than either blindly following steps that don’t fit or abandoning the SOP entirely.
This is also what protects you from the “the SOP didn’t cover this exact situation, so I improvised badly” failure. A procedure that explains its own logic teaches judgment, not just compliance. That’s the difference between documentation that creates competent technicians and documentation that creates robots who break the moment they hit an edge case.
Make maintenance automatic, not heroic
A SOP is not a document. It’s a living description of how work is done, and work changes constantly. If keeping it current depends on someone heroically remembering to go update it, it will drift because heroics don’t scale and people are busy.
The fix is to attach updates to triggers that already exist in your workflow. When a procedure changes like a new tool, a vendor change, a process improvement, updating the SOP becomes part of that change, not a separate task to remember later. Some practical mechanisms:
Tie SOP review to the ticket that exposed the gap. When a tech hits a situation the SOP didn’t cover or got wrong, fixing the SOP is part of closing the ticket—not optional, not “when there’s time.”
Assign every SOP an owner. It shouldn’t be the person who wrote it, but the person currently responsible for that area. An unowned document is an unmaintained document. The owner’s job isn’t to write everything themselves; it’s to make sure the document stays true.
Build in a review cadence with teeth. A lightweight quarterly pass on the SOPs that matter most, where the owner confirms it still reflects reality. The cadence matters less than the fact that someone is accountable for it happening.
The principle underneath all of these: maintenance has to be cheaper and more automatic than the alternative, or it won’t happen.
Put the SOP where the work happens
The best-written, best-maintained SOP in the world fails if people don’t open it at the moment of need. Discoverability is not a nice-to-have—it’s the whole game.
This means SOPs need to live either inside or one click from the systems where work actually happens. Linked from the PSA ticket type they relate to, searchable from wherever your techs spend their day, surfaced by the task, not buried in a folder hierarchy that requires you to already know what you’re looking for. If finding the right SOP takes more effort than asking the senior tech across the room, people will ask the senior tech and you’re right back to knowledge living in heads.
The standard you’re aiming for: when a newer tech picks up an unfamiliar task, the relevant SOP is the easiest, fastest, most obvious next step. It should be easier than guessing, easier than interrupting someone. When that’s true, the documentation defends itself, because using it is genuinely the path of least resistance.
Standardize the environment so SOPs can exist at all
There’s a hard truth underneath all of this: you can’t write durable SOPs for chaos. If every client environment is a unique snowflake with different tools, configurations and ways of doing the same thing then every procedure has a dozen variations and no SOP can keep up. Documentation sprawl is downstream of environment sprawl.
This is why standardization and documentation are really the same project. The more your client environments, your tech stack and your “way we do things” are standardized, the fewer SOPs you need and the more durable each one becomes. A standardized onboarding process can be documented once and reused everywhere. A bespoke one has to be reconfigured every time. Every step you take toward a defined, repeatable operating model makes turnover-proof documentation more achievable and reduces how much any single person can hoard in the first place.
The real payoff
Turnover-proof SOPs aren’t really about surviving the day a tech quits, though they do that. They’re about systematically removing the person-dependency that caps every small MSP’s ability to grow, scale and eventually sell.
When knowledge lives in documented systems rather than individual heads, new hires ramp faster, service stays consistent regardless of who’s on the ticket, your senior people stop being interrupted constantly to answer the same questions, and—not incidentally—your business becomes worth more, because a buyer is purchasing a functioning operating system rather than a group of individuals who might leave.
The departure of a key employee will still sting, but it stops being a crisis that threatens client relationships and starts being a normal staffing event the business absorbs and moves past. That resilience is the whole point. You’re not documenting procedures. You’re building a business that doesn’t live or die by who happens to be employed there this quarter.
Related: “I used to think you were mean…”
Power Your MSP Success with the Kaseya Community
Get fast answers, share product ideas, access exclusive resources, stay current on Kaseya product innovations, and connect with MSP peers.
2026 Kaseya State of the MSP Report

Get 2026 MSP insights from 1,000 plus providers and learn how to grow revenue, adapt to market pressure, and stay competitive.
Get More MSP Insights
Join 50,000+ MSP professionals receiving expert insights, best practices, and industry trends.






