Playbook · 12 min read

The questions that turn messy ideas into production-ready user stories.

A 78-question playbook for product managers, business analysts, and delivery leads. Covers context, users, current-vs-future state, dependencies, Gherkin acceptance criteria, and the non-functional stuff that blows up in production. Built on INVEST, the 3Cs, and Example Mapping.

By The Backlog.cloud teamPublished 22 April 2026

TL;DR

Stories fail in the conversation, not the card.

The template is easy. As a [user], I want [capability], so that [outcome]. The hard part is the conversation that makes the template honest. Use this question bank during refinement, discovery workshops, or wherever a vague ask threatens to become a half-baked ticket. The goal is not to answer every question. It is to run out of red cards before engineering starts.

Jump to the 78 questions Read the FAQ

Foundations

The two frameworks every good question sits inside

Every question in this post maps to one of two patterns from agile practice. Knowing the pattern is the difference between running a refinement session and running a game of twenty questions.

Framework 01

INVEST

Bill Wake, 2003. The quality checklist.

Independent. The story can be shipped without waiting on another story.
Negotiable. The scope is a starting point for conversation, not a fixed spec.
Valuable. A real user or business gets a real outcome.
Estimable. The team can size it without a spike.
Small. It fits inside a sprint, usually a few days.
Testable. There is an unambiguous way to verify it works.

Framework 02

The 3Cs

Ron Jeffries. The process.

Card. A short written story that fits on an index card. The promise of a conversation.
Conversation. The refinement discussion where the real requirements surface. Where most of the question bank below gets used.
Confirmation. The acceptance criteria that proves the story is done, usually as Gherkin scenarios or a bulleted rule list.

INVEST is the checklist. The 3Cs is the workflow. Use both.

The question bank

78 questions, in 10 categories.

Work top to bottom for greenfield features. Cherry-pick for refinement of an existing story. If a category returns blank answers, that is a red card. Park it, resolve it, and only then bring the story into the sprint.

Context and scope

Before anyone writes a line of Gherkin, get crisp on what is actually being asked for. If you cannot answer these in one sentence each, you do not have a story yet, you have a wish.

Q01What specific change is required?
Q02Is this change related to the front end, the back end, or both?
Q03Is this a new capability, an enhancement, a bug fix, a spike, or a piece of tech debt?
Q04What is explicitly in scope, and what is explicitly out of scope?
Q05What is the smallest useful slice we could ship?
Q06Does this belong as one story, or should it be split into several?
Q07Is there an existing epic this rolls up into, or does it need a new one?

Questions B and C from your intake sheet live here.

Users and stakeholders

Stories are promises of a conversation, but first you need to know who the conversation is with. Capture the user, the benefiter, and anyone who can veto the work.

Q01Who will be impacted or benefit from this change?
Q02Who are the stakeholders, and what are their roles in this process?
Q03Which specific user persona or role is this story written for?
Q04How often do they do this task today? Daily, weekly, or once a quarter?
Q05What job are they trying to get done when they reach for this feature?
Q06Who has decision-making authority over the final behaviour?
Q07Who needs to approve the copy, the data handling, or the pricing logic?
Q08Who will support this in production, and have they been consulted?

Questions D and K from your intake sheet live here.

Why it matters

Every story needs a reason for being. If the why is missing, engineers will fill the vacuum with their own assumptions. That is how you end up with features no one asked for.

Q01Why is this change important?
Q02What is the expected benefit of implementing the change?
Q03What metric do we expect to move, and by how much?
Q04What is the cost of not doing this, measured in money, time, or risk?
Q05Which OKR, strategic theme, or customer commitment does this serve?
Q06Is there a deadline or external event forcing the timing?
Q07Is this a must-have, should-have, could-have, or would-have (MoSCoW)?
Q08Would we still build this if we only had one engineer for a week?

Questions E and F from your intake sheet live here.

Current state and future state

The "as is" forces you to describe the real workflow, not the one on the architecture diagram. The "to be" forces you to describe the finish line in terms a user would recognise.

Q01What is the current (as is) state of the system or process related to the change?
Q02What should the future (to be) state of the process look like after the change?
Q03What workaround are users relying on today?
Q04What do the current screens, inputs, and outputs look like?
Q05What will a user see, click, or receive that is different after this ships?
Q06What stays exactly the same, and have we said so out loud?
Q07Are there parts of the old behaviour we need to preserve for compliance or habit?

Questions G and H from your intake sheet live here.

Dependencies, constraints, and assumptions

Most stories fail in delivery because of something that was never written down. Force the conversation early.

Q01What dependencies or constraints are associated with this change?
Q02Which other teams, services, or APIs need to change at the same time?
Q03Are there feature flags, migrations, or data backfills this depends on?
Q04What regulatory, privacy, or accessibility requirements apply?
Q05What are we assuming is true? How would we know if that assumption is wrong?
Q06Is there a budget, licence, or vendor constraint we need to work inside?
Q07What technology or library choice is already decided and not up for debate?
Q08Is there a freeze, release train, or blackout window we need to plan around?

Question I from your intake sheet lives here, plus the assumption work most teams skip.

Acceptance criteria

This is the Confirmation in the 3Cs. Write it in Gherkin when the behaviour is testable, or as a bulleted list when it is a quality attribute. If you cannot describe how you would verify the story, you cannot finish it.

Q01What acceptance criteria have you defined for this change, using Gherkin (Given, When, Then)?
Q02What does "done" look like from the user's perspective?
Q03What are the three concrete examples that exercise the rule?
Q04What is the happy path?
Q05What edge case would embarrass us in production?
Q06What inputs are invalid, and what should happen when a user provides them?
Q07What data state do we start from in each scenario?
Q08What observable output confirms the rule fired correctly?
Q09Is there a scenario where two rules conflict, and which one wins?

Question J from your intake sheet, expanded into the work that actually makes stories testable.

Non-functional requirements

The stuff no one thinks to ask in refinement, then everyone argues about in the incident channel. Ask once, ask early.

Q01What is the expected load? Peak concurrent users, requests per second, rows per query?
Q02What is the acceptable latency for the user-facing action?
Q03What is the acceptable error rate, and how is it measured?
Q04What authentication, authorisation, and audit requirements apply?
Q05What personal or sensitive data is touched, and where does it live?
Q06Which WCAG level do we need to hit, and who signs off?
Q07How is this logged and monitored? Which dashboards or alerts get updated?
Q08What is the rollback plan if this ships broken?
Q09What is the expected behaviour when a downstream dependency is offline?

Risks, edges, and failure modes

A good story predicts the incident review. Ask the uncomfortable questions before engineering does.

Q01What is the worst thing a well-meaning user could do with this feature?
Q02What is the worst thing a malicious user could do with this feature?
Q03What happens if the network drops halfway through the action?
Q04What happens under concurrent edits, double-submits, or duplicate webhooks?
Q05What happens if the feature is disabled mid-session via a flag?
Q06What personal data could leak via logs, error messages, or shared screens?
Q07What breaks if the user is on a slow connection, an old browser, or a small screen?
Q08What is our plan if the metric moves in the wrong direction after release?

Testing and the Definition of Done

A story is not small if QA cannot finish it inside the sprint. Make testability part of refinement, not a negotiation at the end.

Q01How will QA verify this? Manual, automated, or both?
Q02What test data do we need, and who provisions it?
Q03Which environments need the change to consider it tested?
Q04Are there contract tests or downstream tests that need updating?
Q05Do we need a load test, a security scan, or an accessibility audit?
Q06What documentation, runbook, or help article needs to exist before release?
Q07Does the team's Definition of Done already cover this, or do we need a story-specific extension?

Rollout and learning

Shipping is step one. The story is not closed until someone has looked at whether it worked.

Q01Who gets this first? Internal, beta cohort, percentage rollout, or full release?
Q02What communication goes out, to whom, and when?
Q03What event or metric tells us the rollout is healthy?
Q04How long do we watch before declaring success?
Q05What would cause us to roll back, and who has authority to call it?
Q06What follow-up stories are already implied by this one?
Q07How does this change the onboarding, the help centre, or the sales demo?

Technique

Running the conversation with Example Mapping

Matt Wynne’s Example Mapping turns the question bank into a live, timed conversation. You need four colours of index card (or four shapes in a virtual board) and a 25-minute timer per story.

Yellow card

Story

The card itself. One per session. Written in the As a, I want, So that format.

Blue card

Rules

The acceptance criteria or business rules that must hold. Usually three to seven per story.

Green card

Examples

A concrete example for each rule. If you cannot produce one, the rule is too abstract.

Red card

Questions

Anything you cannot answer in the room. A pile of red cards means the story is not ready.

Run a session every other day, one story at a time, time-boxed to 25 minutes. If you finish early, the story is clear. If you run out of time with red cards on the table, the story is not sprint-ready. This is the fastest way to expose ambiguity before an engineer types `git checkout -b`.

Copy and paste

A clean intake template

If you are running intake through a spreadsheet, a form, or a wiki page, the columns below mirror this question bank and are enough to take a story from idea to sprint-ready. Adjust the fields for your domain.

Field	What to capture
A. Timestamp	When the request was raised.
B. Specific change required	The one-sentence ask.
C. Front end, back end, or both	Forces a scope conversation early.
D. Who is impacted or benefits	Name the user, not the department.
E. Why this is important	The pain or opportunity driving the work.
F. Expected benefit	The outcome or metric that moves.
G. Current (as is) state	The real workflow today, workarounds included.
H. Future (to be) state	What a user will see or do after the change.
I. Dependencies and constraints	What blocks this and what it blocks.
J. Acceptance criteria (Gherkin)	Given, When, Then for each rule.
K. Stakeholders and roles	Approvers, informed parties, support owners.
L. Non-functional requirements	Performance, security, accessibility, logging.
M. Risks and edge cases	What could go wrong, by whom, when.
N. Test plan	How QA verifies this, with what data, in which environment.
O. Rollout plan	Who gets it first, how success is measured.

Columns A through K match the intake sheet most teams already use. Columns L through O are the fields teams usually discover they need after the first production incident. Add them now.

When you have 15 minutes

The 10-question minimum set

Not every story needs the full bank. For smaller work, or for refinement when the sprint starts in an hour, this is the shortest responsible version.

01What specific change is required, in one sentence?
02Which user or role is this for, and how often do they do this?
03Why does it matter? What metric moves or what pain goes away?
04What is the current (as is) behaviour?
05What is the future (to be) behaviour a user would recognise?
06What are the three to five rules, in Given-When-Then form?
07What is the one edge case that would embarrass us in production?
08What does this depend on, and what assumption are we making?
09What are the performance, security, or accessibility constraints?
10How will QA verify this, and what is the rollback plan?

FAQ

Frequently asked questions

What are the 3 Cs of a user story?

Card, Conversation, and Confirmation. Coined by Ron Jeffries, the Card is the short written story, the Conversation is the shared understanding built during refinement, and the Confirmation is the acceptance criteria that proves the story is done. Most failed stories failed the Conversation step, not the Card.

What does INVEST stand for in user stories?

Bill Wake's 2003 acronym: Independent, Negotiable, Valuable, Estimable, Small, and Testable. A good story stands alone, invites discussion, delivers value, can be sized, fits inside a sprint, and has a clear way to verify it is done. Use INVEST as the pass-fail checklist before pulling a story into a sprint.

What is the standard user story template?

As a [specific user or role], I want [a clear capability], so that [an explicit outcome or value]. The template is a forcing function, not a format. If you cannot name a real user, a specific capability, and a measurable outcome, the story is not ready.

What is Example Mapping and why use it?

A refinement technique from Matt Wynne. You time-box 25 minutes per story and capture four things on coloured cards: the story (yellow), rules (blue), concrete examples of each rule (green), and open questions (red). A pile of red cards means the story is not ready for the sprint. It is the fastest way to expose ambiguity before engineering starts.

How many questions should I actually ask per story?

Enough to get rid of the red cards. For a small, well-understood story you might answer six questions across context, acceptance, and testing. For anything new, regulated, or cross-team, expect 20 or more. The question bank in this post is a menu, not a homework assignment.

Do acceptance criteria have to be in Gherkin?

No. Gherkin (Given, When, Then) is excellent for behavioural rules with concrete examples, especially when BDD tooling like Cucumber is in play. For non-behavioural criteria, such as performance targets or compliance rules, a bulleted list is clearer. Use the format that makes the check unambiguous.

How do I write a good user story as a business analyst or product manager?

Work in three passes. First, the Card: user, capability, outcome in one sentence. Second, the Conversation: walk through the question bank with engineering, design, and the domain expert until the red cards are gone. Third, the Confirmation: convert the rules and examples into acceptance criteria the team can test. Do not skip pass two.

What is the difference between acceptance criteria and a Definition of Done?

Acceptance criteria are story-specific and describe the behaviour the story must exhibit. The Definition of Done is team-wide and describes the quality bar every story must clear, such as unit tests passing, documentation updated, accessibility checked. You need both.

Or skip the questions

Let Backlog.cloud ask them for you.

Backlog.cloud joins your meeting, listens to the conversation, and produces user stories, acceptance criteria, decisions, and action items in your backlog before the call ends. Every question in this playbook, every time.

Start for free See pricing