fix: prevent plan mode layering with planning skills

Bot consistently enters EnterPlanMode before invoking writing-plans, creating redundant approval gates and read-only restrictions. Add red flag entry to using-superpowers and explicit warning to writing-plans skill. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Remove session scope language from SUBAGENT-STOP
2026-04-22 01:19:04 +08:00 · 2026-02-11 21:29:10 -08:00 · 2026-02-11 16:10:17 -08:00 · 2026-02-11 16:10:17 -08:00 · 2026-02-11 16:10:17 -08:00 · 2026-02-11 16:10:17 -08:00
10 changed files with 84 additions and 31 deletions
--- a/.gitattributes
+++ b/.gitattributes
@@ -1,5 +1,6 @@
 # Ensure shell scripts always have LF line endings
 *.sh text eol=lf
+hooks/session-start text eol=lf

 # Ensure the polyglot wrapper keeps LF (it's parsed by both cmd and bash)
 *.cmd text eol=lf
--- a/docs/README.codex.md
+++ b/docs/README.codex.md
@@ -32,6 +32,12 @@ Fetch and follow instructions from https://raw.githubusercontent.com/obra/superp

 3. Restart Codex.

+4. **For subagent skills** (optional): Skills like `dispatching-parallel-agents` and `subagent-driven-development` require Codex's collab feature. Add to your Codex config:
+   ```toml
+   [features]
+   collab = true
+   ```
+
 ### Windows

 Use a junction instead of a symlink (works without Developer Mode):
--- a/hooks/hooks.json
+++ b/hooks/hooks.json
@@ -6,7 +6,7 @@
        "hooks": [
          {
            "type": "command",
-            "command": "${CLAUDE_PLUGIN_ROOT}/hooks/session-start.sh",
+            "command": "${CLAUDE_PLUGIN_ROOT}/hooks/run-hook.cmd session-start",
            "async": true
          }
        ]
--- a/hooks/run-hook.cmd
+++ b/hooks/run-hook.cmd
@@ -1,43 +1,46 @@
 : << 'CMDBLOCK'
@echo off
-REM ============================================================================
-REM DEPRECATED: This polyglot wrapper is no longer used as of Claude Code 2.1.x
-REM ============================================================================
+REM Cross-platform polyglot wrapper for hook scripts.
+REM On Windows: cmd.exe runs the batch portion, which finds and calls bash.
+REM On Unix: the shell interprets this as a script (: is a no-op in bash).
 REM
-REM Claude Code 2.1.x changed the Windows execution model for hooks:
+REM Hook scripts use extensionless filenames (e.g. "session-start" not
+REM "session-start.sh") so Claude Code's Windows auto-detection -- which
+REM prepends "bash" to any command containing .sh -- doesn't interfere.
 REM
-REM   Before (2.0.x): Hooks ran with shell:true, using the system default shell.
-REM                   This wrapper provided cross-platform compatibility by
-REM                   being both a valid .cmd file (Windows) and bash script.
-REM
-REM   After (2.1.x):  Claude Code now auto-detects .sh files in hook commands
-REM                   and prepends "bash " on Windows. This broke the wrapper
-REM                   because the command:
-REM                     "run-hook.cmd" session-start.sh
-REM                   became:
-REM                     bash "run-hook.cmd" session-start.sh
-REM                   ...and bash cannot execute a .cmd file.
-REM
-REM The fix: hooks.json now calls session-start.sh directly. Claude Code 2.1.x
-REM handles the bash invocation automatically on Windows.
-REM
-REM This file is kept for reference and potential backward compatibility.
-REM ============================================================================
-REM
-REM Original purpose: Polyglot wrapper to run .sh scripts cross-platform
 REM Usage: run-hook.cmd <script-name> [args...]
-REM The script should be in the same directory as this wrapper

 if "%~1"=="" (
    echo run-hook.cmd: missing script name >&2
    exit /b 1
 )
-"C:\Program Files\Git\bin\bash.exe" -l "%~dp0%~1" %2 %3 %4 %5 %6 %7 %8 %9
-exit /b
+
+set "HOOK_DIR=%~dp0"
+
+REM Try Git for Windows bash in standard locations
+if exist "C:\Program Files\Git\bin\bash.exe" (
+    "C:\Program Files\Git\bin\bash.exe" "%HOOK_DIR%%~1" %2 %3 %4 %5 %6 %7 %8 %9
+    exit /b %ERRORLEVEL%
+)
+if exist "C:\Program Files (x86)\Git\bin\bash.exe" (
+    "C:\Program Files (x86)\Git\bin\bash.exe" "%HOOK_DIR%%~1" %2 %3 %4 %5 %6 %7 %8 %9
+    exit /b %ERRORLEVEL%
+)
+
+REM Try bash on PATH (e.g. user-installed Git Bash, MSYS2, Cygwin)
+where bash >nul 2>nul
+if %ERRORLEVEL% equ 0 (
+    bash "%HOOK_DIR%%~1" %2 %3 %4 %5 %6 %7 %8 %9
+    exit /b %ERRORLEVEL%
+)
+
+REM No bash found - exit silently rather than error
+REM (plugin still works, just without SessionStart context injection)
+exit /b 0
 CMDBLOCK

-# Unix shell runs from here
-SCRIPT_DIR="$(cd "$(dirname "$0")" && pwd)"
+# Unix: run the named script directly
+SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]:-$0}")" && pwd)"
 SCRIPT_NAME="$1"
 shift
-"${SCRIPT_DIR}/${SCRIPT_NAME}" "$@"
+exec bash "${SCRIPT_DIR}/${SCRIPT_NAME}" "$@"
--- a/hooks/session-start.sh
+++ b/hooks/session-start.sh
--- a/skills/brainstorming/SKILL.md
+++ b/skills/brainstorming/SKILL.md
@@ -13,7 +13,9 @@ Start by understanding the current project context, then ask questions one at a

 **Understanding the idea:**
 - Check out the current project state first (files, docs, recent commits)
- Ask questions one at a time to refine the idea
+- Before asking detailed questions, assess scope: if the request describes multiple independent subsystems (e.g., "build a platform with chat, file storage, billing, and analytics"), flag this immediately. Don't spend questions refining details of a project that needs to be decomposed first.
+- If the project is too large for a single spec, help the user decompose into sub-projects: what are the independent pieces, how do they relate, what order should they be built? Then brainstorm the first sub-project through the normal design flow. Each sub-project gets its own spec → plan → implementation cycle.
+- For appropriately-scoped projects, ask questions one at a time to refine the idea
 - Prefer multiple choice questions when possible, but open-ended is fine too
 - Only one question per message - if a topic needs more exploration, break it into multiple questions
 - Focus on understanding: purpose, constraints, success criteria
--- a/skills/brainstorming/spec-document-reviewer-prompt.md
+++ b/skills/brainstorming/spec-document-reviewer-prompt.md
@@ -23,6 +23,7 @@ Task tool (general-purpose):
    | Consistency | Internal contradictions, conflicting requirements |
    | Clarity | Ambiguous requirements |
    | YAGNI | Unrequested features, over-engineering |
+    | Scope | Focused enough for a single plan — not covering multiple independent subsystems |
    | Architecture | Units with clear boundaries, well-defined interfaces, independently understandable and testable |

    ## CRITICAL
--- a/skills/using-superpowers/SKILL.md
+++ b/skills/using-superpowers/SKILL.md
@@ -3,6 +3,10 @@ name: using-superpowers
 description: Use when starting any conversation - establishes how to find and use skills, requiring Skill tool invocation before ANY response including clarifying questions
 ---

+<SUBAGENT-STOP>
+If you were dispatched as a subagent to execute a specific task, skip this skill.
+</SUBAGENT-STOP>
+
 <EXTREMELY-IMPORTANT>
 If you think there is even a 1% chance a skill might apply to what you are doing, you ABSOLUTELY MUST invoke the skill.

@@ -27,6 +31,10 @@ If CLAUDE.md says "don't use TDD" and a skill says "always use TDD," follow CLAU

 **In other environments:** Check your platform's documentation for how skills are loaded.

+## Platform Adaptation
+
+Skills use Claude Code tool names. Non-CC platforms: see `references/codex-tools.md` for tool equivalents.
+
 # Using Skills

 ## The Rule
@@ -73,6 +81,7 @@ These thoughts mean STOP—you're rationalizing:
 | "I'll just do this one thing first" | Check BEFORE doing anything. |
 | "This feels productive" | Undisciplined action wastes time. Skills prevent this. |
 | "I know what that means" | Knowing the concept ≠ using the skill. Invoke it. |
+| "Let me enter plan mode first" | If a skill handles planning (writing-plans, brainstorming), use the skill directly. EnterPlanMode is redundant — never layer it with a planning skill. |

 ## Skill Priority

--- a/skills/using-superpowers/references/codex-tools.md
+++ b/skills/using-superpowers/references/codex-tools.md
@@ -0,0 +1,25 @@
+# Codex Tool Mapping
+
+Skills use Claude Code tool names. When you encounter these in a skill, use your platform equivalent:
+
+| Skill references | Codex equivalent |
+|-----------------|------------------|
+| `Task` tool (dispatch subagent) | `spawn_agent` |
+| Multiple `Task` calls (parallel) | Multiple `spawn_agent` calls |
+| Task returns result | `wait` |
+| Task completes automatically | `close_agent` to free slot |
+| `TodoWrite` (task tracking) | `update_plan` |
+| `Skill` tool (invoke a skill) | Skills load natively — just follow the instructions |
+| `Read`, `Write`, `Edit` (files) | Use your native file tools |
+| `Bash` (run commands) | Use your native shell tools |
+
+## Subagent dispatch requires collab
+
+Add to your Codex config (`~/.codex/config.toml`):
+
+```toml
+[features]
+collab = true
+```
+
+This enables `spawn_agent`, `wait`, and `close_agent` for skills like `dispatching-parallel-agents` and `subagent-driven-development`.
--- a/skills/writing-plans/SKILL.md
+++ b/skills/writing-plans/SKILL.md
@@ -5,6 +5,8 @@ description: Use when you have a spec or requirements for a multi-step task, bef

 # Writing Plans

+**IMPORTANT:** Invoke this skill directly — do NOT use EnterPlanMode or platform plan mode. This skill has its own workflow and approval checkpoint (execution handoff). Layering plan mode on top is redundant and restrictive.
+
 ## Overview

 Write comprehensive implementation plans assuming the engineer has zero context for our codebase and questionable taste. Document everything they need to know: which files to touch for each task, code, testing, docs they might need to check, how to test it. Give them the whole plan as bite-sized tasks. DRY. YAGNI. TDD. Frequent commits.
@@ -18,6 +20,10 @@ Assume they are a skilled developer, but know almost nothing about our toolset o
 **Save plans to:** `docs/superpowers/plans/YYYY-MM-DD-<feature-name>.md`
 - (User preferences for plan location override this default)

+## Scope Check
+
+If the spec covers multiple independent subsystems, it should have been broken into sub-project specs during brainstorming. If it wasn't, suggest breaking this into separate plans — one per subsystem. Each plan should produce working, testable software on its own.
+
 ## File Structure

 Before defining tasks, map out which files will be created or modified and what each one is responsible for. This is where decomposition decisions get locked in.
Author	SHA1	Message	Date
Drew Ritter	7fecd85842	fix: prevent plan mode layering with planning skills Bot consistently enters EnterPlanMode before invoking writing-plans, creating redundant approval gates and read-only restrictions. Add red flag entry to using-superpowers and explicit warning to writing-plans skill. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 21:29:10 -08:00
Drew Ritter	d1e39450c0	Remove session scope language from SUBAGENT-STOP Drop 'top-level sessions only' which may cause the model to treat using-superpowers as a first-turn-only skill, breaking skill chaining on follow-up turns. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 16:10:17 -08:00
Drew Ritter	fa827dd12a	Soften SUBAGENT-STOP to only skip this skill, not suppress all behavior Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 16:10:17 -08:00
Drew Ritter	998be8aacd	Remove skill override directives from dispatch templates Let the SUBAGENT-STOP gate in using-superpowers handle skill leakage instead of per-template directives. This avoids blocking non-superpowers skills that users may want subagents to use. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 16:10:17 -08:00
Drew Ritter	39ae6ec2fb	Move Codex tool mapping to progressive disclosure reference file Addresses Jesse's review feedback on PR #450: - Move inline routing table from using-superpowers to references/codex-tools.md, leveraging Codex's native progressive disclosure for companion files - Narrow SUBAGENT-STOP from "Do not invoke skills" to "Do not invoke superpowers skills" so subagents can still use non-superpowers skills Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 16:10:17 -08:00
Drew Ritter	a901c14c68	Sharpen subagent skill override directive in dispatch templates Replace verbose scope explanations with a direct override statement that explicitly claims priority over prior guidance (i.e. the using-superpowers 1% rule injected by Codex's skill discovery). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 16:10:17 -08:00
Drew Ritter	4c78b2dad1	Document collab feature requirement for Codex subagent skills Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 16:10:17 -08:00
Drew Ritter	e014832137	Prevent Codex subagent skill leakage via gate check and dispatch routing table Codex subagents inherit filesystem access and can discover superpowers skills via native discovery. Without guidance, they activate the 1% rule and invoke full skill workflows instead of executing their assigned task. - Add SUBAGENT-STOP gate check above the 1% rule in using-superpowers - Add Codex dispatch routing table (spawn_agent/wait/close_agent) - Add scope directives to all 4 subagent dispatch templates Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 16:10:17 -08:00
Jesse Vincent	4d54210a3c	fix: restore polyglot wrapper to fix Windows hook window spawning Claude Code spawns hook commands with shell:true + windowsHide:true, but on Windows the execution chain cmd.exe -> bash.exe causes Git Bash (MSYS2) to allocate its own console window, bypassing the hide flag. This creates visible terminal windows that steal focus on every SessionStart event (startup, resume, clear, compact). The fix: - Rename session-start.sh to session-start (no extension) so Claude Code's .sh auto-detection regex doesn't fire and prepend "bash" - Restore run-hook.cmd polyglot wrapper to control bash invocation on Windows (tries known Git Bash paths, then PATH, then exits silently if no bash found) - On Unix, the polyglot's shell portion runs the script directly This avoids Claude Code's broken .sh auto-prepend, gives us control over how bash is invoked on Windows, and gracefully handles missing bash instead of erroring. Addresses: #440, #414, #354, #417, #293 Upstream: anthropics/claude-code#14828	2026-02-10 18:34:58 -08:00
Jesse Vincent	8abf9d0b02	Merge pull request #443 from obra/wip/project-decomposition Add project-level scope assessment to brainstorming pipeline	2026-02-10 10:42:20 -08:00
Jesse Vincent	a1a1ae5519	Merge pull request #441 from obra/wip/architecture-guidance Add architecture guidance and capability-aware escalation	2026-02-10 10:40:57 -08:00
Jesse Vincent	84cd6e7c20	fix: move scope assessment into understanding phase Testing showed the model skipped scope assessment when it was a separate step after "Understanding the idea." Inlining it as the first thing in understanding ensures it fires before detailed questions.	2026-02-08 16:50:33 -08:00
Jesse Vincent	eac29a909f	feat: add project-level scope assessment to brainstorming pipeline Brainstorming now assesses whether a project is too large for a single spec and helps decompose into sub-projects. Spec reviewer checks scope. Writing-plans has a backstop if brainstorming missed it.	2026-02-08 16:50:33 -08:00