Playbooks & Templates

AI Coding Session Brief

The AI coding session brief is the short scope you write before an assistant edits a single line. It produces one decision, a narrow, well-bounded task an assistant can carry end to end, by naming the goal, the files in play, the constraints that must hold, the risks, and what "done" looks like. Fill it in when a change touches real code that other people depend on, not for a one-line typo fix you could describe in a sentence.

TemplateCoding

When to use this

You are about to ask an assistant to change behavior in a file that other code depends on, like the cursor-based pagination Maya is adding to GET /api/orders.
The change will touch more than one file, so the assistant has room to guess at the approach and the blast radius.
You are unfamiliar with the area of the codebase the change lands in and want the assistant to surface risks before it edits.
A previous session sprawled: the assistant added a caching layer or a new dependency you never asked for, and you want this one fenced in.
You are about to start a fresh agent session and want a clean, self-contained scope so the context window is not filled with unrelated history.
The work has a real failure mode (data loss, an outage, a broken contract with callers) and you want the constraint written down before any code runs.

What it helps clarify

The exact goal, stated as the behavior that should change, so a finished diff can be checked against one sentence.
The files the assistant may touch and the ones it must leave alone, so scope creep is visible the moment it appears.
The constraints that must hold (existing callers, the data shape, the public signature) before the assistant proposes an approach.
The risks and edge cases worth naming up front: the empty page, the last page, the invalid input, the concurrent write.
What "done" means in checkable terms: the tests that pass, the build that stays green, the manual case you will run.
How the work will be generated and reviewed, so the next person reading the pull request knows what was checked.

Why a brief beats a one-line prompt

Picture Maya typing "add pagination to the orders endpoint" and letting an agent run. The assistant picks offset pagination, because that pattern appears most often in its training and nothing in the prompt steered it elsewhere. Ten minutes later there is a clean diff across the handler, the query layer, two tests, and the API docs. It looks finished. Only in review does the problem surface: this orders table is append-heavy, so offset pagination skips and repeats rows under concurrent inserts. The right shape was a cursor, a different design entirely. Now Maya is unwinding a confident, well-formatted diff instead of correcting one sentence before any code ran.

The brief is the cheap version of that correction. A sentence in the Constraints field, "the table is append-heavy, use a cursor," steers the whole session before the first edit. The assistant carries the constraint through the implementation rather than fighting it afterward. The cost is a minute of writing; the saving is an hour of review-time archaeology.

There is a second reason to scope tightly, and it is measurable. Reviewers catch defects on small, focused changes and miss them on large ones. SmartBear's classic peer-review research found defect-detection ability falls off sharply past 400 lines of code in one sitting, and more recent analyses of large pull-request datasets put detection near 87% for changes under 100 lines and around 28% for changes over 1,000 lines. A brief that names three files and one behavior tends to produce a diff a human can actually read. A vague prompt tends to produce a sprawling one nobody reviews properly. The brief is upstream of review quality: scope the session small and the diff is reviewable; let it sprawl and the review becomes a rubber stamp.

Claude Code's own guidance lands in the same place. Its rule of thumb is blunt: if you could describe the diff in one sentence, skip the plan and just ask. Everything above that line, multi-file work, an unfamiliar area, a real failure mode, is exactly where a written brief earns its minute. The brief is how you decide which side of that line you are on before you start.

How to fill it in, with a strong entry next to a weak one

Each field does one job, and the difference between a strong entry and a weak one is whether it gives the assistant something concrete to be steered by. Work through them in order.

Goal. State the behavior that changes, as an outcome. Strong: "GET /api/orders returns cursor-paginated results; existing offset callers keep working during the transition." Weak: "improve the orders endpoint." The weak version is the request with a future tense; there is nothing in it a finished diff can be checked against.
Files in play. Name them. Strong: "src/api/orders.ts, src/db/orders.ts, tests/orders.test.ts." Weak: "the API layer." Naming files does two things at once: it points the assistant at the right code and it makes any edit outside that list visible the instant it appears in the diff.
Out of scope. This is the field most people skip and most regret. Strong: "no caching layer, no new dependency, no change to the response schema." Weak: leaving it blank. Agents tend to propose more than you asked, an extra abstraction, a refactor "while we are here," a library. Writing the fence costs one line here; arguing about it in review costs a meeting.
Constraints. The rules that must hold no matter the approach. Strong: "keep the offset path working; invalid cursor throws rather than returning a partial page; no breaking change to the public signature." Weak: "do it the right way." A constraint the assistant cannot violate is worth more than a preference it can interpret.
Risks and Edge cases. Strong risk: "append-heavy table, so offset pagination skips and repeats rows under concurrent inserts." Strong edge cases: "empty page, last partial page, invalid cursor, duplicate at the boundary." Weak: "handle errors." Naming the four inputs the happy path skips is what turns "it works" into "it works on the cases that actually break."
Done means. Make it checkable. Strong: "build green, existing tests pass, four new tests cover the edge cases above, empty and last-page cases verified by hand." Weak: "it works." Claude Code's strongest single recommendation is to give the assistant a check it can run, a test suite, a build, a script, so the loop closes on its own instead of waiting for you to notice each mistake. The Done means field is where you write that check down.

The How it will run and Context to attach fields connect the brief to the session. For run, decide whether to start in plan mode (multi-file or risky) or go direct (small and clear). In Claude Code that is Shift+Tab to enter plan mode and Ctrl+G to open the draft plan in your editor, where you delete the caching step the assistant added and add the line "do not add caching, out of scope." For context, list the files, the failing test, and the project rule to attach, the same rules you would keep in a CLAUDE.md the assistant reads every session.

Pitfalls, and the brief on a team

A brief can look complete while leaving the real work undone. Watch for these.

The restated task. If your Goal and Constraints are the request reworded, the brief protects nothing. The test is simple: can you find a sentence in it the assistant could get wrong? If not, it is too vague to steer anything.
The empty fence. A blank Out of scope is the most common way a session sprawls. Name the additions you do not want, even when they seem obvious, because the assistant does not share your sense of what is obvious.
"Done" with no check. "It works" is not a finish line an assistant can run against. Without a build, a test, or a manual case in Done means, you become the verification loop and every missed edge case waits for you to spot it in review.
The hallucinated dependency. A real, current risk worth a line in Constraints. A USENIX Security 2025 study of 576,000 code samples across 16 models found 19.7% of suggested packages did not exist, over 200,000 unique fabricated names, and the same fake names recur: 43% reappeared on all ten runs of an identical prompt. Attackers pre-register those names, a practice now called slopsquatting. "No new dependency without asking" in your brief turns an install you would have skimmed into a decision you make on purpose.
The brief that never gets revisited. The brief is also a review instrument. When the diff returns, read it against Out of scope and Done means: those two fields catch the change that scoped up and the one that stopped short.

On a team the brief earns a second life. Drop the filled-in version at the top of the pull request and the next reviewer knows the intended behavior, the fence, and what was checked before they read a single hunk, which is exactly the context a 140-line diff otherwise hides. A shared brief template also keeps a team honest about scope: when every change ships with a named goal and an out-of-scope line, the 900-line pull request that touches everything stops looking normal. Priya, leading the team, can read ten briefs faster than ten diffs and see at a glance which sessions were scoped and which were a hope.

Use the brief as the first link in a short chain. The AI Use Case Canvas decides whether the work is worth doing; this brief scopes the coding session; the AI Code Review Checklist and the Reviewing AI Code Safely guide check the diff it produces against the very fields you wrote here. On your next real change, before you type a prompt, fill in just three lines, Goal, Out of scope, and Done means, and you will already be ahead of the version of you that started typing.

The brief

Nine fields, filled in before the assistant edits anything. Keep each answer to a line or two. If a field is blank, you are not ready to start the session.

Goal : the one behavior that should change, stated as an outcome. Example: GET /api/orders returns cursor-paginated results, existing offset callers keep working during the transition.
Files in play : the files the assistant may read and edit. Name them. src/api/orders.ts, src/db/orders.ts, tests/orders.test.ts.
Out of scope : what the assistant must not touch or add. No caching layer, no new dependency, no change to the response schema.
Constraints : the rules that must hold. Keep the offset path working, throw on an invalid cursor rather than returning a partial page, no breaking change to the public signature.
Risks : what could go wrong and who it hurts. Append-heavy table, so offset pagination skips and repeats rows under concurrent inserts; a bad cursor must fail loudly.
Edge cases : the inputs the happy path skips. Empty page, last partial page, invalid cursor, a duplicate at the page boundary.
Done means : the checkable bar. Build green, existing tests pass, four new tests cover the edge cases above, the empty and last-page cases verified by hand.
How it will run : the mode and the verification check. Plan mode first, read the plan, then implement; run the test suite after each change.
Context to attach : the files, errors, or rules the assistant needs. Attach the two source files, the failing test, and the project rules on dependencies.

Example

A filled-in brief (Maya, cursor pagination on the orders endpoint)

Goal: GET /api/orders returns cursor-based pages; existing offset callers keep working during a one-release transition. Files in play: src/api/orders.ts (handler), src/db/orders.ts (query), tests/orders.test.ts (new cases). Out of scope: no caching layer, no new dependency, no change to the JSON response shape or column names. Constraints: keep the offset path working; cursor encodes the order id (monotonic here); invalid cursor throws, never returns a partial page. Risks: the orders table is append-heavy, so offset pagination skips and repeats rows under concurrent inserts; a silently-bad cursor would corrupt a client's paging state. Edge cases: empty result, last partial page, invalid cursor, a duplicate row at the page boundary. Done means: build green; existing offset tests still pass; four new tests cover the edge cases; the empty and last-page cases run by hand and behave. How it will run: plan mode first (Shift+Tab), read and edit the plan with Ctrl+G, then implement; run the test suite after each change. Context to attach: src/api/orders.ts, src/db/orders.ts, the failing pagination test, and the project rule "do not add dependencies without asking."

Usage notes

Write the brief before you open plan mode, then paste it in as the opening message. The assistant plans against your scope instead of inventing its own.
Keep the session to one brief. When a finished change suggests a follow-up, start a new brief and a fresh context rather than stretching this one.
A blank field is a signal, not a formality. If you cannot name the risks or what "done" means, you are not ready to hand the work over yet.
Revisit "Out of scope" and "Done means" at review time. They are the two fields that catch a diff that scoped up or stopped short.
Pair this with the guides that surround it: Planning Before Coding turns the brief into an editable plan, and Giving AI the Right Context covers what to attach in the last field.
When the diff comes back, switch to the AI Code Review Checklist and the Reviewing AI Code Safely guide; the brief is what you review the diff against.

Copy this skeleton

## AI Coding Session Brief **Goal:** **Files in play:** **Out of scope:** **Constraints:** **Risks:** **Edge cases:** **Done means:** **How it will run:** **Context to attach:**

Downloadable version

One page to fill in before an assistant edits any code: goal, files, scope, constraints, risks, edge cases, and what done means.

Preview

Scope the change

Goal: the one behavior that should change, written as an outcome you could verify.
Files in play: the specific files the assistant may read and edit, named in full.
Out of scope: what the assistant must not touch, add, or refactor while it is here.

Name the rules and the risks

Constraints: the rules that must hold (existing callers, data shape, public signatures).
Risks: what could go wrong, on which input, and who it hurts.
Edge cases: the inputs the happy path skips (empty, last page, invalid input, duplicate at the boundary).

Define done and how it runs

Done means: the checkable bar (build green, these tests pass, this case verified by hand).
How it will run: the mode (plan first or direct) and the verification check the assistant runs.
Context to attach: the files, errors, and project rules the assistant needs to do this well.

Download PDF