superpowers

mirror of https://github.com/obra/superpowers.git synced 2026-04-30 14:09:04 +08:00

Author	SHA1	Message	Date
Jesse Vincent	e795530c23	fix(tests): make SDD integration test actually run its assertions The SDD integration test silently bailed before printing any verification results. Three independent bugs caused this: 1. `WORKING_DIR_ESCAPED` was computed from `$SCRIPT_DIR/../..` without resolving `..` segments. The resulting "directory" name contained literal `..` so `find` was looking in a path that doesn't exist. 2. With `set -euo pipefail`, the `find ... \| sort -r \| head -1` pipeline could exit non-zero (SIGPIPE on the producer when head closes early), killing the script silently before assertions ran. 3. The `claude -p` invocation never passed `--plugin-dir`, so it loaded the installed plugin instead of the working tree. Local edits to skills under test were not actually being tested. Other adjustments: - Run claude from inside the unique TEST_PROJECT directory instead of from the plugin root, so its session JSONL lives in its own `~/.claude/projects/` folder and doesn't race other concurrent claude sessions for "most recent file". - Use the same character-normalization claude does (every non-alphanumeric becomes `-`) when computing the session dir name; macOS-resolved `/private/var/...` paths and tmp dirs with `.`/`_` in their names need this to round-trip correctly. - Accept either `"name":"Agent"` or `"name":"Task"` in the subagent count — the harness renamed the tool but the test wasn't updated. Verified on this branch: all six verification tests now pass against a real end-to-end SDD run (skill invoked, 7 subagents dispatched, 6 TodoWrite calls, working code produced, tests pass, no extra features).	2026-04-28 12:20:31 -07:00
Jesse Vincent	f57638a747	refactor: restructure specs and plans directories - Specs (brainstorming output) now go to docs/superpowers/specs/ - Plans (writing-plans output) now go to docs/superpowers/plans/ - User preferences for locations override these defaults - Update all skill references and test files Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-03-06 13:01:31 -08:00
Jesse Vincent	faa65e7163	Add token usage analysis to subagent-driven-development test - Rewrote analyze-token-usage.py to parse main session file correctly - Extracts usage from toolUseResult fields for each subagent - Shows breakdown by agent with descriptions - Integrated into test-subagent-driven-development-integration.sh - Displays token usage automatically after each test run	2025-12-17 16:44:50 -08:00
Jesse Vincent	24ca8cd9d5	fix: verify skill usage via session transcript not text output The skill instructions are internal and don't appear in user-facing output. Updated verification to parse the session JSONL transcript and check for actual tool usage: - Skill tool invocation - Task tool (subagents) - TodoWrite (tracking) - Implementation results	2025-12-17 16:44:50 -08:00
Jesse Vincent	8fbeca830a	test: use bypassPermissions mode for unrestricted testing dontAsk mode was auto-denying Write tool. Use bypassPermissions instead to allow full tool access in this controlled test environment.	2025-12-17 16:44:50 -08:00
Jesse Vincent	67de772d7f	test: auto-approve permissions with --permission-mode dontAsk Headless tests need automatic permission approval to write files. Using dontAsk mode to auto-approve permissions for test directory.	2025-12-17 16:44:50 -08:00
Jesse Vincent	cf72863792	test: add --add-dir flag for temp directory access Claude needs explicit permission to access the temp test directory. Added --add-dir flag to grant access to the test project.	2025-12-17 16:44:50 -08:00
Jesse Vincent	baa23b16bb	test: show Claude output in real-time during integration test Use tee instead of redirection so test output is visible during execution while still being saved to file for analysis.	2025-12-17 16:44:50 -08:00
Jesse Vincent	0aba33be1c	fix: run integration test from superpowers dir to access local dev skills The superpowers-dev marketplace makes skills available only when running from the plugin directory. Updated test to run claude from superpowers directory while working on the test project.	2025-12-17 16:44:50 -08:00
Jesse Vincent	06310d6f5f	Fix tests to use --allowed-tools flag Claude Code headless mode requires --allowed-tools flag to actually execute tool calls. Without it, Claude only responds as if it's doing things but doesn't actually use tools. Changes: - Updated run_claude helper to accept allowed_tools parameter - Updated integration test to use --allowed-tools=all - This enables actual tool execution (Write, Task, Bash, etc.) Now the integration test should actually execute the workflow instead of just talking about it.	2025-12-17 16:44:50 -08:00
Jesse Vincent	dc11a093c3	Fix syntax error in integration test Simplified command substitution to avoid shell parsing issues. Instead of nested heredoc in command substitution, write prompt to file first then read it.	2025-12-17 16:44:50 -08:00
Jesse Vincent	fa946ae465	Add integration test for subagent-driven-development Created full end-to-end integration test that executes a real plan and verifies the new workflow improvements actually work. New test: test-subagent-driven-development-integration.sh - Creates real Node.js test project - Generates implementation plan (2 tasks) - Executes using subagent-driven-development skill - Verifies 8 key behaviors: 1. Plan read once at beginning (not per task) 2. Full task text provided to subagents (not file reading) 3. Subagents perform self-review 4. Spec compliance review before code quality 5. Spec reviewer reads code independently 6. Working implementation produced 7. Tests pass 8. No extra features added (spec compliance) Integration tests are opt-in (--integration flag) due to 10-30 min runtime. Updated run-skill-tests.sh: - Added --integration flag - Separates fast tests from integration tests - Shows note when integration tests skipped Updated README with integration test documentation. Run with: ./run-skill-tests.sh # Fast tests only ./run-skill-tests.sh --integration # Include integration tests	2025-12-17 16:44:50 -08:00

12 Commits