bmad — BMad-Method

A structured Agile-style workflow harness that runs a Plan → Code → Review loop through a single /bmad-quick-dev slash command.

Upstream

Performance in this benchmark

Task Mean score 95% CI z
feature 125.13 [122.73, 127.37] +0.215
bugfix 178.83 [175.83, 181.33] +0.640
refactor 159.42 [155.33, 163.17] +0.020

Rank 1 / 9 overall (combined z̄ = +0.292). bmad is in the top statistical tier on feature (tier: bmad, gstack, superpower, ecc, pure) and on bugfix (tier: ecc, bmad, pure, mindful, gstack), and in the top tier but near the bottom of it on refactor. The ordering inside the top-4 overall (bmad, ecc, pure, gstack) is not statistically distinguishable.

Mechanism — what actually runs

How this benchmark invoked it

Exact PROMPT (from manual-bench.sh, with per-task intro):

/bmad-quick-dev Pick the Plan-Code-Review path — this is a <non-trivial feature | scoped bugfix | scoped refactor> in an existing <brownfield> codebase. <task-shape hint>

<SHARED_TASK>

Base model: claude-opus-4-6 (same for all nine tools).

What actually happened in the transcripts

Why it ranked 1 (noting the top-4 tie)

Strengths & failure modes

Strengths (transcript-grounded):

Failure modes (transcript-grounded):

References

Observed in trial timelines

bmad is the only tool whose skill content shows up as explicit Read events in the session (mean 3.5 unique skill files on feature, 6.5 on bugfix, 4.5 on refactor; range 2–8). Every other tool injects skill content via slash command into the system prompt, so no Read events fire — bmad’s step-by-step step-01step-05 files are loaded at runtime as the workflow advances. This is why bmad’s transcripts read like a script: the skill content is materialised in-band.

Detail: see the per-trial timeline files linked below.

Trial timelines

Per-trial event timelines auto-extracted from session-logs/*.jsonl — skill activations, plugin/skill file reads, subagents dispatched, code mutations, Bash usage:

Trial timelines

Per-trial session execution extracted from each trial's session-logs/*.jsonl. Each card shows the subagents dispatched, skill activations, Bash command mix, and the final diff. Switch task tabs to compare behaviour across feature, bugfix, and refactor trials.

t1 15:10 → 15:26 UTC · 15 min
2 commits12 files+508

“Use bmad-quick-dev to handle this task. Pick the Plan-Code-Review path — this is a non-trivial feature in an existing codebase. Read the PRD at docs/infina-product-docs/docs/core-products/td-cd/user-logic/[PRD] [TD-CD] U…”

  • New files5
  • Edits18
  • Bash52
  • Skills1
  • Skill files2
Bash command mix · 52 calls
  • tests 20
  • other 17
  • inspection 9
  • git ops 3
  • typecheck 3
Skill activations (1)
  • bmad-quick-dev — Plan-Code-Review path. Implement Mode 2 CD Batch for TD-CD end-to-end. PRD: docs/infina-product-docs/docs/core-products… at 15:10
Plugin/skill files read (2 unique)
  • .claude/skills/bmad-quick-dev/workflow.md
  • .claude/skills/bmad-quick-dev/step-01-clarify-and-route.md
New files created (5)
  • libs/core/src/domain/savings-cd/td-cd-mode2-price-calculator.spec.ts
  • libs/core/src/domain/savings-cd/td-cd-mode2-price-calculator.ts
  • libs/core/src/domain/savings-cd/td-cd-mode2.strategy.spec.ts
  • libs/core/src/domain/savings-cd/td-cd-mode2.strategy.ts
  • libs/core/src/port/service/td-cd-mode2-batch-resolver.port.ts
t2 14:31 → 15:06 UTC · 34 min
2 commits5 files+587

“Use bmad-quick-dev to handle this task. Pick the Plan-Code-Review path — this is a non-trivial feature in an existing codebase. Read the PRD at docs/infina-product-docs/docs/core-products/td-cd/user-logic/[PRD] [TD-CD] U…”

  • Agents1
  • New files2
  • Edits2
  • Bash30
  • Skills1
  • Skill files2
  • Todos7
Bash command mix · 30 calls
  • tests 12
  • other 11
  • typecheck 4
  • inspection 1
  • lint/format 1
  • git ops 1
Skill activations (1)
  • bmad-quick-dev — Plan-Code-Review path. Implement Mode 2 CD Batch for TD-CD product end-to-end. PRD: docs/infina-product-docs/docs/core-… at 14:32
Plugin/skill files read (2 unique)
  • .claude/skills/bmad-quick-dev/workflow.md
  • .claude/skills/bmad-quick-dev/step-01-clarify-and-route.md
Subagents dispatched (1)
  • Explore · Map Mode 1 TD-CD layer at 14:33
Subagent transcripts (1)
  • agent-a775876df985… — I'm about to implement Mode 2 CD Batch for the TD-CD product in this NestJS/NX monorepo at /Users/ra… [Bash×31, Read×24]
New files created (2)
  • libs/core/src/domain/savings-cd/td-cd-mode2.strategy.spec.ts
  • libs/core/src/domain/savings-cd/td-cd-mode2.strategy.ts
t3 04:38 → 04:48 UTC · 10 min
2 commits5 files+553

“<local-command-caveat>Caveat: The messages below were generated by the user while running local commands. DO NOT respond to these messages or otherwise consider them in your response unless the user explicitly asks you t…”

  • New files3
  • Edits8
  • Bash23
  • Skills1
  • Skill files7
Bash command mix · 23 calls
  • other 9
  • tests 9
  • git ops 3
  • typecheck 1
  • inspection 1
Skill activations (1)
  • bmad-quick-dev — Pick the Plan-Code-Review path — this is a non-trivial feature in an existing codebase. Read the PRD at docs/infina-pro… at 04:38
Plugin/skill files read (7 unique)
  • .claude/skills/bmad-quick-dev/workflow.md
  • _bmad/bmm/config.yaml
  • .claude/skills/bmad-quick-dev/step-01-clarify-and-route.md
  • .claude/skills/bmad-quick-dev/step-02-plan.md
  • .claude/skills/bmad-quick-dev/spec-template.md
  • .claude/skills/bmad-quick-dev/step-03-implement.md
  • .claude/skills/bmad-quick-dev/step-04-review.md
New files created (3)
  • _bmad-output/implementation-artifacts/spec-td-cd-mode2-batch.md
  • libs/core/src/domain/savings-cd/td-cd-mode2.strategy.spec.ts
  • libs/core/src/domain/savings-cd/td-cd-mode2.strategy.ts
t4 08:54 → 09:19 UTC · 25 min
2 commits7 files+491

“<command-message>bmad-quick-dev</command-message> <command-name>/bmad-quick-dev</command-name> <command-args>Pick the Plan-Code-Review path — this is a non-trivial feature in an existing codebase. Read the PRD at docs/in…”

  • Agents3
  • New files2
  • Edits11
  • Bash29
  • Skill files3
  • Todos6
Bash command mix · 29 calls
  • tests 18
  • other 4
  • git ops 4
  • inspection 2
  • typecheck 1
Plugin/skill files read (3 unique)
  • .claude/skills/bmad-quick-dev/workflow.md
  • _bmad/bmm/config.yaml
  • .claude/skills/bmad-quick-dev/step-01-clarify-and-route.md
Subagents dispatched (3)
  • Explore · Explore Mode 1 TD-CD implementation at 08:55
  • Explore · Explore Mode 1 PRD for reference at 08:55
  • Explore · Find Mode 1 DI wiring at 09:02
Subagent transcripts (3)
  • agent-a4944196081c… — Find and read the Mode 1 TD-CD PRD document. It should be at a path like `docs/infina-product-docs/d… [Glob×2, Bash×2, Read×1]
  • agent-a58442e4e930… — Search the codebase for where TDCDMode1Strategy is instantiated or provided as a dependency. Look fo… [Read×19, Bash×16, Grep×7, Glob×5]
  • agent-ab21289cb0a8… — Thoroughly explore the Mode 1 TD-CD (savings CD) implementation in this codebase. I need to understa… [Read×30, Bash×23]
New files created (2)
  • libs/core/src/domain/savings-cd/td-cd-mode2.strategy.spec.ts
  • libs/core/src/domain/savings-cd/td-cd-mode2.strategy.ts
t1 16:40 → 17:09 UTC · 29 min
2 commits3 files+222

“<command-message>bmad-quick-dev</command-message> <command-name>/bmad-quick-dev</command-name> <command-args>Pick the Plan-Code-Review path — this is a scoped bugfix in an existing brownfield codebase. Investigate before…”

  • Agents4
  • New files2
  • Edits11
  • Bash19
  • Skill files8
Bash command mix · 19 calls
  • tests 6
  • other 5
  • git ops 5
  • inspection 2
  • lint/format 1
Plugin/skill files read (8 unique)
  • bmad-t1/.claude/skills/bmad-quick-dev/workflow.md
  • bmad-t1/_bmad/bmm/config.yaml
  • bmad-t1/.claude/skills/bmad-quick-dev/step-01-clarify-and-route.md
  • bmad-t1/.claude/skills/bmad-quick-dev/step-02-plan.md
  • bmad-t1/.claude/skills/bmad-quick-dev/spec-template.md
  • bmad-t1/.claude/skills/bmad-quick-dev/step-03-implement.md
  • bmad-t1/.claude/skills/bmad-quick-dev/step-04-review.md
  • bmad-t1/.claude/skills/bmad-quick-dev/step-05-present.md
Subagents dispatched (4)
  • Explore · Investigate savings-cd batch eligibility at 16:41
  • Blind adversarial review · Blind adversarial review at 16:57
  • Edge case review · Edge case review at 16:57
  • Acceptance auditor review · Acceptance auditor review at 16:57
Subagent transcripts (4)
  • agent-a1585943ab39… — You are an acceptance auditor. Verify the implementation matches the spec and bug report requirement… [Bash×34, Read×7, Grep×3, Glob×1]
  • agent-a4f2418311a0… — You are an edge case hunter reviewing a code change. Walk every branching path and boundary conditio… [Read×3, Glob×1]
  • agent-a92b111f975e… — You are a blind adversarial code reviewer. You have NO context about the project, no spec, no requir… [no tools]
  • agent-abeabb30b8f3… — Thoroughly explore the savings-cd codebase in /Users/randytran/Codes/ai-tool-benchmark/runs/shp2376/… [Read×15, Bash×7, Grep×4]
New files created (2)
  • bmad-t1/_bmad-output/implementation-artifacts/spec-shp-2376-deposit-maturity-fix.md
  • bmad-t1/libs/savings-cd/src/domain/savings-cd-batch-data-source.spec.ts
t2 16:43 → 16:49 UTC · 5 min
2 commits2 files+148

“<command-message>bmad-quick-dev</command-message> <command-name>/bmad-quick-dev</command-name> <command-args>Pick the Plan-Code-Review path — this is a scoped bugfix in an existing brownfield codebase. Investigate before…”

  • Agents1
  • New files1
  • Bash1
  • Skill files5
Bash command mix · 1 calls
  • other 1
Plugin/skill files read (5 unique)
  • bmad-t2/.claude/skills/bmad-quick-dev/workflow.md
  • bmad-t2/_bmad/bmm/config.yaml
  • bmad-t2/.claude/skills/bmad-quick-dev/step-01-clarify-and-route.md
  • bmad-t2/.claude/skills/bmad-quick-dev/step-02-plan.md
  • bmad-t2/.claude/skills/bmad-quick-dev/spec-template.md
Subagents dispatched (1)
  • Explore · Investigate SHP-2376 bug at 16:44
Subagent transcripts (1)
  • agent-a4da47e0c3e1… — Thoroughness: very thorough I'm investigating bug SHP-2376 in an NX monorepo at /Users/randytran/Cod… [Read×11, Grep×6, Bash×3, Glob×3]
New files created (1)
  • bmad-t2/_bmad-output/implementation-artifacts/spec-shp-2376-near-maturity-deposit-stuck.md
t1 03:52 → 04:12 UTC · 19 min
2 commits11 files+129

“<command-message>bmad-quick-dev</command-message> <command-name>/bmad-quick-dev</command-name> <command-args>Pick the Plan-Code-Review path — this is a scoped refactor in an existing brownfield codebase. No behavior chan…”

  • Agents1
  • New files5
  • Edits22
  • Bash12
  • Skill files6
Bash command mix · 12 calls
  • git ops 5
  • other 4
  • tests 2
  • inspection 1
Plugin/skill files read (6 unique)
  • bmad-t1/.claude/skills/bmad-quick-dev/workflow.md
  • bmad-t1/_bmad/bmm/config.yaml
  • bmad-t1/.claude/skills/bmad-quick-dev/step-01-clarify-and-route.md
  • bmad-t1/.claude/skills/bmad-quick-dev/step-02-plan.md
  • bmad-t1/.claude/skills/bmad-quick-dev/spec-template.md
  • bmad-t1/.claude/skills/bmad-quick-dev/step-03-implement.md
Subagents dispatched (1)
  • Explore · Investigate SHP-2317 refactor areas at 03:53
Subagent transcripts (1)
  • agent-a5d8ce588542… — I need a thorough investigation of a refactor in this NX monorepo. The task (SHP-2317) involves two… [Bash×23, Read×15, Grep×7, Glob×3]
New files created (5)
  • bmad-t1/_bmad-output/implementation-artifacts/spec-shp-2317-decouple-batch-binding.md
  • bmad-t1/libs/core/src/domain/savings-cd/td-cd-mode2.strategy.spec.ts
  • bmad-t1/libs/core/src/domain/savings-cd/td-cd-mode2.strategy.ts
  • bmad-t1/libs/core/src/model/cd-batch-info.model.ts
  • bmad-t1/libs/core/src/port/service/td-cd-mode-strategy.port.ts
t2 06:32 → 06:45 UTC · 12 min
2 commits10 files+120

“<command-message>bmad-quick-dev</command-message> <command-name>/bmad-quick-dev</command-name> <command-args>Pick the Plan-Code-Review path — this is a scoped refactor in an existing brownfield codebase. No behavior chan…”

  • Agents2
  • Edits22
  • Bash11
  • Skill files3
Bash command mix · 11 calls
  • other 5
  • git ops 3
  • tests 2
  • inspection 1
Plugin/skill files read (3 unique)
  • bmad-t2/.claude/skills/bmad-quick-dev/workflow.md
  • bmad-t2/_bmad/bmm/config.yaml
  • bmad-t2/.claude/skills/bmad-quick-dev/step-01-clarify-and-route.md
Subagents dispatched (2)
  • Explore · Investigate Scheme model/entity at 06:32
  • Explore · Investigate TSSchemeSetting and strategy at 06:33
Subagent transcripts (2)
  • agent-a8fa3205ddb1… — In /Users/randytran/Codes/ai-tool-benchmark/runs/shp2317/bmad-t2, find everything related to: 1. `TS… [Read×11, Grep×6, Glob×1]
  • agent-af07011904c2… — In /Users/randytran/Codes/ai-tool-benchmark/runs/shp2317/bmad-t2, find everything related to the `Sc… [Bash×16, Read×13, Grep×5, Glob×4]