mirror of
https://github.com/obra/superpowers.git
synced 2026-05-07 09:39:05 +08:00
f2cbfbefebbfef77321e4c9abc9e949826bea9d7
19 Commits
| Author | SHA1 | Message | Date | |
|---|---|---|---|---|
|
|
f2cbfbefeb |
Release v5.1.0 (#1468)
* docs: add Codex App compatibility design spec (PRI-823) Design for making using-git-worktrees, finishing-a-development-branch, and subagent-driven-development skills work in the Codex App's sandboxed worktree environment. Read-only environment detection via git-dir vs git-common-dir comparison, ~48 lines across 4 files, zero breaking changes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: address spec review feedback for PRI-823 Fix three Important issues from spec review: - Clarify Step 1.5 placement relative to existing Steps 2/3 - Re-derive environment state at cleanup time instead of relying on earlier skill output - Acknowledge pre-existing Step 5 cleanup inconsistency Also: precise step references, exact codex-tools.md content, clearer Integration section update instructions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: address team review feedback for PRI-823 spec - Add commit SHA + data loss warning to handoff payload (HIGH) - Add explicit commit step before handoff (HIGH) - Remove misleading "mark as externally managed" from Path B - Add executing-plans 1-line edit (was missing) - Add branch name derivation rules - Add conditional UI language for non-App environments - Add sandbox fallback for permission errors - Add STOP directive after Step 0 reporting Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: clarify executing-plans in What Does NOT Change section Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: add cleanup guard test (#5) and sandbox fallback test (#10) to spec Both tests address real risk scenarios: - #5: cleanup guard bug would delete Codex App's own worktree (data loss) - #10: Local thread sandbox fallback needs manual Codex App validation Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: add implementation plan for Codex App compatibility (PRI-823) 8 tasks covering: environment detection in using-git-worktrees, Step 1.5 + cleanup guard in finishing-a-development-branch, Integration line updates, codex-tools.md docs, automated tests, and final verification. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs(codex-tools): add named agent dispatch mapping for Codex (#647) * fix(writing-skills): correct false 'only two fields' frontmatter claim (#882) * Replace subagent review loops with lightweight inline self-review The subagent review loop (dispatching a fresh agent to review plans/specs) doubled execution time (~25 min overhead) without measurably improving plan quality. Regression testing across 5 versions (v3.6.0 through v5.0.4) with 5 trials each showed identical plan sizes, task counts, and quality scores regardless of whether the review loop ran. Changes: - writing-plans: Replace subagent Plan Review Loop with inline Self-Review checklist (spec coverage, placeholder scan, type consistency) - writing-plans: Add explicit "No Placeholders" section listing plan failures (TBD, vague descriptions, undefined references, "similar to Task N") - brainstorming: Replace subagent Spec Review Loop with inline Spec Self-Review (placeholder scan, internal consistency, scope check, ambiguity check) - Both skills now use "look at it with fresh eyes" framing Testing: 5 trials with the new skill show self-review catches 3-5 real bugs per run (spawn positions, API mismatches, seed bugs, grid indexing) in ~30s instead of ~25 min. Remaining defects are comparable to the subagent approach. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Revert "Replace subagent review loops with lightweight inline self-review" This reverts commit |
||
|
|
9ccce3bf07 |
Add context isolation principle to all delegation skills
Subagents should never inherit the parent session's context or history. The dispatcher constructs exactly what each subagent needs, keeping both sides focused: the subagent on its task, the controller on coordination. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> |
||
|
|
daa3fb2322 |
Add architecture guidance and capability-aware escalation to skills
Add design-for-isolation and working-in-existing-codebases guidance to brainstorming. Add file size awareness and escalation prompts to SDD implementer and code quality reviewer. Writing-plans gets architecture section sizing guidance. Spec and plan reviewers get architecture and file size checks. |
||
|
|
f57638a747 |
refactor: restructure specs and plans directories
- Specs (brainstorming output) now go to docs/superpowers/specs/ - Plans (writing-plans output) now go to docs/superpowers/plans/ - User preferences for locations override these defaults - Update all skill references and test files Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
||
|
|
c7816ee2a6 |
docs: change main branch red flag to require explicit user consent
Instead of prohibiting main branch work entirely, allow it with explicit user consent. This is more flexible while still ensuring users are aware of the implications. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
||
|
|
b63d485955 |
docs(subagent-driven-development): add main branch red flag to Never list
Add explicit warning against starting implementation on main/master branch without first using a worktree for isolation. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
||
|
|
fa3f46d4e9 |
docs(subagent-driven-development): add using-git-worktrees as required skill
Adds using-git-worktrees as the first required workflow skill in the Integration section. This makes explicit that an isolated workspace should be set up before starting subagent-driven development. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
||
|
|
2a19be0b78 |
Document skill description trap: descriptions override flowcharts
Testing revealed that skill descriptions summarizing workflow cause
Claude to follow the description instead of reading the skill body.
- A description saying "code review between tasks" caused ONE review
- The flowchart clearly showed TWO reviews (spec compliance + quality)
- Minimal description ("Use when...") correctly deferred to flowchart
Updated writing-skills with:
- "Description = When to Use, NOT What the Skill Does" section
- Cautionary tale about this actual failure
- Examples of good (triggers only) vs bad (workflow summary) descriptions
Updated subagent-driven-development:
- Removed workflow summary from description
- Now just: "Use when executing implementation plans..."
Updated test runner:
- Added --dangerously-skip-permissions for automated testing
|
||
|
|
ed06dcbe27 | Fix skill description to match flowchart: two-stage review (spec then quality) | ||
|
|
a9b94ae5d7 |
Rewrite subagent-driven-development with executable flowcharts
- Add graphviz/dot flowcharts as primary executable instructions - Extract prompt templates to separate collateral files: - ./implementer-prompt.md - ./spec-reviewer-prompt.md - ./code-quality-reviewer-prompt.md - Use qualified skill names (superpowers:skill-name) - Make flowchart labels explicit and action-oriented - Add "When to Use" decision flowchart - Keep detailed prose as supporting content |
||
|
|
466332f698 |
Emphasize spec compliance review must complete before code quality
Made sequencing explicit: - Spec compliance review loop must fully complete (✅) before code quality - Added "Do NOT proceed to code quality review until spec compliance is ✅" - Code Quality Review section starts with "Only run after spec compliance review is complete" - Red Flags: Added "Start code quality review before spec compliance is ✅ (wrong order)" This ensures we don't waste time reviewing code quality of the wrong implementation. Verify they built the right thing first, then verify they built it well. |
||
|
|
87afde2390 |
Make spec compliance reviewer skeptical and verification-focused
The spec compliance reviewer now: - Does NOT trust implementer's report - Is warned implementer finished suspiciously quickly - MUST verify everything by reading actual code - Compares implementation to requirements line by line - Reports issues with file:line references Key additions: - "Do Not Trust the Report" section - Explicit DO NOT / DO lists - "Verify by reading code, not by trusting report" - Changed "What Was Implemented" to "What Implementer Claims They Built" This prevents rubber-stamping and ensures independent verification of spec compliance against actual codebase. |
||
|
|
2a6a40fe10 |
Improve subagent-driven-development workflow
Key improvements based on feedback: 1. Read plan once, not per task - Extract all tasks in Step 1 - Reference extracted tasks in Step 2 - Eliminates redundant file reading 2. Enable questions during work - Not just before, but also while working - "It's always OK to ask questions" - Don't guess or make assumptions 3. Add self-review before reporting - Completeness: implemented everything? - Quality: best work, clear names? - Discipline: avoided overbuilding? - Testing: comprehensive, real behavior? - Catches issues before handoff 4. Add spec compliance review - Separate reviewer checks: built the right thing? - Flags missing requirements - Flags extra/unneeded work - Flags misunderstandings - Runs BEFORE code quality review 5. Make reviews loops, not one-shot - Reviewer finds issues - Implementer fixes - Reviewer reviews again - Repeat until approved - Applies to both spec and code quality Two-stage review process: - Stage 1: Spec compliance (right thing?) - Stage 2: Code quality (built well?) This enables subagents to do their best work with clear requirements, opportunities to clarify, self-critique, and thorough review loops. |
||
|
|
97ce1f8fe0 |
Update subagent-driven-development: controller provides full task text
Changed workflow so controller provides complete task context directly rather than making subagent read plan file. Key changes: - Controller reads plan and extracts full task text - Controller provides scene-setting context (dependencies, architecture) - Subagent receives complete information in prompt (no file reading) - Subagent can ask clarifying questions before beginning work - Controller handles questions/concerns before subagent proceeds Benefits: - No file reading overhead for subagent - Controller curates exactly what context is needed - Questions surfaced before work begins (not after) - Subagent has complete information to do best work This enables subagents to start with clarity rather than ambiguity. |
||
|
|
e3d881b7b6 |
Release v3.2.1: Add code-reviewer agent to plugin
Fixes #55 Added superpowers:code-reviewer agent to plugin's agents/ directory. Previously, skills referenced a code-reviewer agent that users needed to configure personally, causing failures for new users. Changes: - New: agents/code-reviewer.md - Agent definition with systematic review checklist - Updated: skills/requesting-code-review/SKILL.md - Use superpowers:code-reviewer - Updated: skills/subagent-driven-development/SKILL.md - Use superpowers:code-reviewer - Version bump: 3.2.0 -> 3.2.1 - Added: .claude-plugin/marketplace.json for local development testing |
||
|
|
79436abffa |
Update all superpowers skill references to use namespace prefix
Skills are now namespaced as superpowers:<name> when referenced. Updated all REQUIRED SUB-SKILL, RECOMMENDED SUB-SKILL, and REQUIRED BACKGROUND references to use the superpowers: prefix. Also added -design suffix to brainstorming skill's design document filename to distinguish from implementation plan documents. Files updated: - brainstorming: Added -design suffix, updated skill references - executing-plans: Updated finishing-a-development-branch reference - subagent-driven-development: Updated finishing-a-development-branch reference - systematic-debugging: Updated root-cause-tracing and test-driven-development references - testing-skills-with-subagents: Updated test-driven-development reference - writing-plans: Updated executing-plans and subagent-driven-development references - writing-skills: Updated test-driven-development, systematic-debugging, and testing-skills-with-subagents references |
||
|
|
141953a4be |
Improve skill cross-references for clarity and compliance
Update all skill references to use explicit requirement markers: - REQUIRED BACKGROUND: For prerequisite understanding - REQUIRED SUB-SKILL: For mandatory workflow dependencies - Complementary skills: For optional but helpful related skills Changes: - Remove old path format (skills/collaboration/X → X) - Add explicit "REQUIRED" markers to make dependencies clear - Update Integration sections with categorized skill relationships - Fix non-existent skill references - Update cross-reference documentation in writing-skills This makes it immediately clear which skills MUST be used vs optional references, helping Claude understand and comply with skill dependencies. |
||
|
|
48410c7f19 |
Standardize skill frontmatter names to lowercase and kebab-case
- Update all 20 skill frontmatter names to match their directory names in lowercase - Fix defense-in-depth name (was Defense-in-Depth-Validation) - Fix receiving-code-review name (was Code-Review-Reception) - Update all skill announcements and cross-references to use lowercase names - Update commands redirects to reference lowercase skill names Ensures consistent naming: skill directory names, frontmatter names, and documentation references all use lowercase kebab-case format (e.g., brainstorming, test-driven-development) |
||
|
|
9c9547cc04 | Now that skills are a first-class thing in Claude Code, restore them to the primary plugin |