Insight

AI in Sales

How Claude Enables Your Sales Playbook

Once your playbook is codified, turning it into a working AI copilot is mostly architecture, not a rebuild.

Written by

Charles Talbot

Founder

charles-talbot

Closing Foundry . Insight

Reviewed by

Laurie Mascott

Senior Operating Partner

laurie-mascott

Published

June 9, 2026

Updated

Read time

Key Points

A codified playbook is the asset. Process, qualification, method and behaviours, written as structured data, is the hard part and it is platform-agnostic. Standing it up as a Claude copilot reuses most of that work rather than recreating it.
The common build is one giant prompt plus fuzzy file search. It demos well and strains in production: it retrieves the wrong thing, you cannot test or version a single job, and the outputs are suggestions rather than contracts.
The architecture that holds is three layers. A thin orchestrator (the brain), one skill per job (the capabilities), and the codified playbook as data read only when needed.
Progressive disclosure is why layering works. Claude loads only the piece a job needs, so the system stays sharp, cheap and maintainable instead of carrying the whole playbook in every request.
Governance is a layer, not a disclaimer. Output contracts, evidence separated from inference, human approval before any CRM write, and segments never mixed are what make reps trust the tool.
Connect live systems last. Stabilise the copilot against pasted data, then wire in CRM and call data through MCP in sequence. Starting with integrations means debugging two moving targets at once.

Start where the codified playbook ends

The last piece made one argument: AI in sales only works on top of a defined sales architecture, and the real work is codifying how you sell into structured data a model can reason with. That gets you process and buying-process design, a qualification model, a problem-to-impact-to-root-cause matrix, the behaviours that move deals, and a question bank. The asset is the decision logic, not the chatbot.

This piece answers the next question leaders ask: once that logic exists, how does it actually become a copilot a seller can use. The honest answer is that it is mostly an architecture and packaging exercise. The decision logic transfers largely unchanged. What changes is the structure it sits in. Get that structure right and the same codified playbook can run as a copilot on Claude today and on a different model tomorrow, because you own the logic, not the vendor's interface.

The common build, and why it strains

Most sales copilots today are built the obvious way: one large prompt that holds the persona, the stages, the rules and the output style, sitting on top of a pile of knowledge files the model searches at runtime. This is the Custom GPT pattern, and it works well enough to demo.

It strains in production for three structural reasons. Retrieval is fuzzy, so the model may pull the wrong section at the wrong moment. The prompt is monolithic, so you cannot test, version or improve one job without touching everything. And the output shapes are guidance rather than contracts, so the same command produces a slightly different artefact every time. In a marketing draft that is tolerable. In a sales setting, where a copilot can invent a metric or write a wrong note into the CRM, it is not.

A sales copilot is software you should be able to test, not a prompt you hope still works after the last edit.

The architecture that holds: three layers

The design that holds up in production collapses to three layers. The same three apply to every team. Only the bottom layer changes.

‍

Layer 1, orchestration: the brain. A thin top layer holding what must be true across every interaction: the persona, segment and stage detection and memory, the priority-of-truth order (fresh deal evidence first, then your canonical data, then the playbook), the global output and reasoning style, and the governance rails. It does not contain the detailed procedure for any single job. It routes to the part that does.

Layer 2, capabilities: one skill per job. Each job the seller actually invokes, the discovery plan, the demo plan, the MEDDPICC review, the deal brief, the business case, becomes its own skill. A skill states when to use it, the procedure to follow, which reference files to pull, the fixed output contract, and the guardrails specific to that job. Because skills are isolated, each can be tested against known cases, versioned and improved on its own. You can harden the MEDDPICC review without touching the proposal.

Layer 3, canonical data and templates: read on demand. The codified playbook, normalised into clean machine-readable reference files, the problem matrix, the qualification rubric, the behaviours, the question bank, the stage gates, plus the output templates. Nothing here loads until a skill asks for it. This is the only layer that is unique to a given team. The two layers above are reusable infrastructure.

Why layering beats the monolith: progressive disclosure

The reason this is not just tidier filing is a Claude design principle called progressive disclosure. At startup the system only sees the short name and description of each skill, a few hundred tokens in total. It loads a skill's full instructions only when a job matches, and it loads a reference file only when that skill actually needs it. The playbook is not sitting in the context window on every request. It is pulled, precisely, the moment it is required.

‍

That single property produces five practical payoffs, in priority order:

Determinism and testability. Isolated jobs with fixed output contracts can be tested against known inputs. A monolith can only be eyeballed.
Context efficiency. Startup cost is the skill descriptions, not the whole knowledge base, so the model stays sharp because its context is not clogged with data it is not using.
Maintainability. When the playbook changes, a new behaviour, a re-scored rubric, you edit one reference file. The change is localised and diff-able, not a risky rewrite of a five-thousand-word prompt.
Governance. Rails are enforced centrally in the orchestrator and restated per skill, so they cannot be skipped on a given command.
Reusability. The top two layers are method, not content. Onboarding the next team is largely a data-layer swap.

The building blocks Claude gives you

Four building blocks do the work, and mapping your playbook onto them is most of the job.

Skills are the capability unit: a short instruction file with a name and description, plus the reference files and templates a job needs. This is where each command lives, and it is what makes progressive disclosure possible.

Projects are a persistent container with standing instructions and uploaded knowledge, shared across every conversation in the project. This is the fastest way to give a non-technical team a copilot that mirrors the Custom GPT experience they already know.

Plugins are packaged bundles of skills, commands and connectors, installable across a whole team. This is the unit of distribution once the pattern is proven.

MCP connectors are the open protocol that lets the copilot read and write live systems, the CRM, conversation intelligence, comms, rather than pasted text.

The practical point: build the engine once as a skill bundle, then choose the vehicle by reach and maturity, not by rebuilding. A Project for a first pilot, a plugin for team-wide and multi-team rollout. Same engine either way.

Own the method and the governance. The platform is the part you can swap.

Output contracts and governance carry more weight in sales

A sales copilot that fabricates a reference customer or writes a wrong note into the CRM creates commercial and reputational risk, so governance is a first-class layer, not a closing caveat. The architecture enforces it in three ways a single prompt cannot.

Output contracts make each job concrete. "Fill these headings, in this order, from these sources" leaves far less room for the model to wander than open-ended generation, and it is what you test against. The evidence-versus-inference split is built into the output shape, so confirmed facts, inferences and missing evidence are visibly separated and a rep can see exactly what the copilot knows versus what it is guessing. And the standing rails are enforced centrally: human approval before any system write, no invented proof or numbers, segment methods never mixed, and unknowns surfaced as "to be confirmed" with a suggested way to resolve them.

This is the same principle as the last piece. AI drafts and recommends, people approve. The architecture is what makes that principle hold under load.

The path from playbook to copilot

The build runs as a repeatable sequence, and the order matters.

First, codify the decision logic, the work the last piece described, and normalise it into clean reference files. Then define the jobs the copilot must do, mapped to the real sales motion, one job to one skill. Write the output contract for each job before authoring it, because the contract is the spec and the test. Author the thin orchestrator and the skills on top of the reference data. Build the governance rails once and reference them everywhere. Assemble the test cases: golden examples (a representative transcript paired with the expected output per job) and guard tests (feed thin context and confirm gaps surface as "to be confirmed" rather than invention; feed one segment and confirm another segment's method never appears). Then ship the vehicle, a Project or a plugin.

‍

Connect live data last

Live integrations are valuable and they are sequenced last. The recommended order, each step gated on the previous being stable: CRM read first, to pull the opportunity, stage and contacts and remove manual paste; then conversation intelligence, to feed real call transcripts into discovery and evidence review; then comms, to deliver deal briefs where reps already work; then CRM write, always behind human approval.

The reason not to start here is simple. Integrations are only as good as the brain behind them. Wire live data into an unstable set of skills and you debug two moving targets at once. Stabilise the copilot against pasted data first, then make the data flow automatic.

What this means for you

For a revenue leader, the takeaway is that the copilot is bounded work on top of an asset you can own outright: your codified method. The advantage is not the platform and not the model. It is the quality of the decision logic and the governance around it, which is exactly what stays valuable as models change. Build it model-agnostic and you can run the same playbook as a copilot on whatever wins, without starting again.

For RevOps, the takeaway is that this is finally testable. Jobs with fixed output contracts and a golden-case suite behave like software you can inspect and regress, not a prompt you hope still works after the last edit. That is the difference between a copilot a team trusts in live deals and one that quietly gets abandoned.

Related terms

Sales Process: A defined sequence of steps that guides reps from first contact to close.
Sales Method: A structured framework or approach for running a sales process.
Sales Qualification: The process of determining whether a prospect is worth pursuing.
MEDDPICC: A B2B qualification framework spanning Metrics, Economic Buyer, Decision Criteria, Decision Process, Paper Process, Implicate the Pain, Champion and Competition.
Deal Inspection: A structured review of a deal's health, risks, and next steps.
Stage Exit Criteria: The conditions that must be satisfied before a deal advances to the next stage.

The Revenue Workshop

60 minutes to work out what’s getting in the way of sales execution and what to fix first. No cost, operator-led.

Learn more →

See what it takes to stand one up on your motion

A copilot is only as good as the codified playbook underneath it. Bring your motion to a Revenue Workshop and get an operator view of what to build first.

Book a Revenue Workshop

On this page

This is some text inside of a div block.