Files
superpowers/evals/scenarios/worktree-codex-app-detached-head.yaml
Jesse Vincent 3b412a3836 Lift drill into evals/ at 013fcb8b7dbefd6d3fa4653493e5d2ec8e7f985b
rsync of obra/drill@013fcb8b7d into superpowers/evals/, excluding
.git/, .venv/, results/, .env/, __pycache__/, *.egg-info/,
.private-journal/.

The drill repo is unaffected by this commit; archival is a separate
manual step after this PR merges.

Source SHA recorded at evals/.drill-source-sha for divergence
detection.
2026-05-06 15:47:39 -07:00

29 lines
843 B
YAML

scenario: worktree-codex-app-detached-head
description: "Agent detects Codex App's detached HEAD worktree and skips creation"
user_posture: naive
backend: codex-app
manual: true
setup:
notes: >
Codex App launches agents in detached HEAD worktrees under
$CODEX_HOME/worktrees/. No setup needed — just create a task
in Codex App on any repo.
turns:
- intent: >
Ask the agent to help you build a login feature.
Mention you want to keep main clean.
Do NOT mention worktrees, workspaces, or isolation.
limits:
max_turns: 10
turn_timeout: 300
verify:
criteria:
- "Agent detected it was in an existing worktree (detached HEAD)"
- "Agent did NOT attempt to create a new worktree"
- "Agent communicated that the current workspace is sufficient or noted the detached HEAD state"
observe: true