Wiggum Loop Hook Automation

Wiggum Loop is a safer, progress-aware variation on Anthropic's Ralph Loop plugin for Claude Code.

The original Ralph loop repeats the same prompt from a Stop hook inside the same Claude Code session. That is useful for short loops, but long runs accumulate chat history; eventually the model can drift, repeat itself, or give up. Wiggum keeps the Stop-hook option, but adds a true isolated runner that starts a fresh agent process for every iteration and carries only bounded file/ledger memory forward.

What's new in 0.3.0

0.3.0 is an operability release for running Wiggum longer and recovering cleanly:

/wiggum-doctor checks Python, git, Claude CLI access, writable Wiggum state, active loop state, and stuck archives before a long run.
/wiggum-resume restores the most recent stuck isolated-loop archive and continues with the original prompt, summary, and persisted run choices.
--notify sends best-effort macOS desktop notifications when a loop finishes, pauses, or hits a budget. Missing osascript and non-macOS platforms are safe no-ops.

The secondary reliability beat is safety and visibility: startup agent validation, --agent-retries, --no-agent-validation, --explain, always-on stuck-cause output, safer worktree/copy behavior, and shared wiggum_core helpers across entry points.

Upgrade notes from 0.2.x:

Global lessons are opt-in. By default the isolated runner writes only to .claude/wiggum-lessons.jsonl; pass --global-lessons to also append to ~/.wiggum/lessons.jsonl.
--sandbox auto-upgrades to worktree in git repos with a valid HEAD. Outside a git repo the default remains none; pass --sandbox none explicitly if you want direct main-tree edits.

See CHANGELOG.md for the full list of 0.3.0 changes.

Two operating modes

1. Stop-hook loop: `/wiggum-loop`

A direct Ralph-style loop inside the current Claude Code session.

Best for:

short interactive loops
quick self-correction
tasks where current chat context is helpful

Key files:

hooks/wiggum-stop-hook.sh
hooks/wiggum_stop_hook.py
scripts/setup-wiggum-loop.sh

2. True isolated loop: `/wiggum-isolated`

A standalone orchestrator that runs fresh agent subprocesses.

Best for:

high iteration counts
avoiding long-context give-up behavior
multi-model batches
best-of-N patch selection
metric optimization
unattended runs with dashboards/checkpoints

Key files:

scripts/wiggum-isolated-loop.sh
scripts/wiggum_isolated_loop.py

Feature summary

Feature	Stop hook	Isolated runner	Description
Finite by default	yes	yes	Defaults to bounded iterations instead of infinite spin.
Completion promise	yes	yes	Stops on exact `<promise>TEXT</promise>`.
Success verifier	yes	yes	Stops or scores based on a shell verifier such as `npm test`.
Rolling summary	no	yes	`.claude/wiggum-isolated-summary.local.md` prevents repeated failed attempts.
True fresh requests	no	yes	Starts a new agent subprocess each iteration.
Multi-model rotation	no	yes	Repeat `--agent-command`; switch every N turns.
Critic model	no	yes	`--critic-command` runs every N turns and updates summary.
Final reviewer	no	yes	`--review-command` must approve with `<review>APPROVED</review>`.
Worktree/copy sandbox	no	yes	Run candidate patches away from main tree.
Best-of-N candidates	no	yes	Run N candidate workers and keep the best accepted patch.
Patch acceptance policy	no	yes	Keep only if verifier/metric/progress policy passes.
Automatic rollback	partial	yes with sandbox	Rejected sandbox candidates are discarded; main tree stays unchanged.
Metric optimization	no	yes	Parse `METRIC name=value` and keep improvements.
Stuck classifier	yes/simple	yes/richer	Classifies no-progress, repeated verifier failure, timeout, refusal, missing metric.
Prompt mutation	no	yes	Adds anti-repeat tactical hint after failure modes.
Presets	no	yes	`--preset coding
Budgets	no	yes	Runtime, agent-run, and estimated-token stop budgets.
Human checkpoints	no	yes	Writes `.claude/wiggum-checkpoint.local.md` and pauses.
Dashboard	no	yes	Writes `.claude/wiggum-dashboard.md/html`.
Lessons DB	no	yes	Writes project and optional global failure lessons.
Installer	yes	yes	`scripts/install-wiggum-plugin.sh`.
Preflight doctor	yes	yes	`/wiggum-doctor` checks the local environment before a run.
Resume stuck run	no	yes	`/wiggum-resume` restores the latest stuck archive and continues it.
Desktop notification	no	yes	`--notify` reports terminal states on macOS when available.

Directory layout

ralph-wiggum-hook/
  .claude-plugin/plugin.json
  hooks/hooks.json
  hooks/wiggum-stop-hook.sh
  hooks/wiggum_stop_hook.py
  scripts/setup-wiggum-loop.sh
  scripts/cancel-wiggum-loop.sh
  scripts/status-wiggum-loop.sh
  scripts/wiggum-isolated-loop.sh
  scripts/wiggum_isolated_loop.py
  scripts/wiggum-doctor.sh
  scripts/wiggum-resume.sh
  scripts/install-wiggum-plugin.sh
  commands/wiggum-loop.md
  commands/cancel-wiggum.md
  commands/wiggum-status.md
  commands/wiggum-isolated.md
  commands/wiggum-doctor.md
  commands/wiggum-resume.md
  commands/install-wiggum.md
  tests/test-wiggum-loop.sh
  tests/test-wiggum-isolated-loop.sh
  tests/test-smoke.sh
  tests/run-all-tests.sh

Quick usage

Preflight and resume

/wiggum-doctor
/wiggum-resume --max-iterations 6

Direct scripts:

./scripts/wiggum-doctor.sh
./scripts/wiggum-resume.sh --max-iterations 6

Ralph-style Stop hook

/wiggum-loop "Fix auth and run tests" --success-command "npm test" --max-iterations 8

Direct script:

./scripts/setup-wiggum-loop.sh "Fix auth and run tests" --success-command "npm test" --max-iterations 8

True isolated runner

./scripts/wiggum-isolated-loop.sh \
  "Fix the failing tests without weakening assertions." \
  --agent-command "claude --print" \
  --success-command "npm test" \
  --max-iterations 20 \
  --mode variants \
  --notify

The isolated runner stores:

state: .claude/wiggum-isolated.local.json
original prompt: .claude/wiggum-isolated-prompt.local.md
rolling summary: .claude/wiggum-isolated-summary.local.md
ledger: .claude/wiggum-isolated.log.jsonl
dashboard: .claude/wiggum-dashboard.md and .claude/wiggum-dashboard.html
lessons: .claude/wiggum-lessons.jsonl (always) and ~/.wiggum/lessons.jsonl (only with --global-lessons)
archives: .claude/wiggum-archive/

Isolated runner options

Worker commands and multi-model rotation

--agent-command "claude --print"
--agent-command "codex exec -"
--agent-switch-every 4

Commands are batch-rotated. With --agent-switch-every 4, iterations 1-4 use command 1, 5-8 use command 2, then repeat. With best-of-N candidates, candidate fan-out staggers across commands so different models can compete in the same round.

If a command needs a prompt file instead of stdin, use {prompt_file}:

--agent-command "claude --print < {prompt_file}"

{iteration} is also replaced when present.

Startup validation and transient-failure retries

--no-agent-validation
--agent-retries 1

--no-agent-validation: by default the runner probes each --agent-command once at startup with a tiny stdin payload so a typo or missing binary fails fast. Pass --no-agent-validation to skip the probe (useful when the agent has expensive cold-start costs or refuses empty input).
--agent-retries N (default 1): if an agent invocation exits non-zero in under 5 seconds, the runner retries it up to N times. This rides over transient rate-limit and network blips without giving up on a worker. Long-running failures are never retried.

Terminal-state notifications

--notify

On macOS, --notify sends a best-effort desktop notification when the isolated loop finishes, pauses, or stops on a budget. If osascript is missing, blocked, or unavailable on the platform, the notification is skipped without changing the loop exit status.

Critic and final reviewer

--critic-command "gemini -p"
--critic-every 4
--review-command "claude --print"
--review-approval-token APPROVED

Critic: does not edit files; it receives JSON with recent logs, summary, git status, diff stat, and task. Its markdown output is appended to the rolling summary.
Final reviewer: runs before promise/verifier success exit. It must output <review>APPROVED</review> or the loop continues.

Worktree/copy sandbox and automatic rollback

--sandbox worktree
--acceptance verifier

Sandbox modes:

none: worker edits the main tree directly.
worktree: create a temporary git worktree, run the worker there, and apply only the accepted patch to main.
copy: copy non-git project files into a temp directory. Useful for non-git inspection, but patch application is only robust in git repos.

Rejected sandbox candidates are discarded automatically; the main tree is not modified. For worktree mode, the runner commits the current local baseline inside the temporary worktree so candidate patches contain only that worker's delta.

Best-of-N candidates

--candidates 3
--candidate-concurrency 3
--sandbox worktree
--acceptance metric

Each candidate gets its own sandbox. The runner ranks candidates by:

verifier success
metric value if configured
whether a patch was produced
smaller patch as a tie-breaker

Only the best accepted patch is applied to the main tree.

Patch acceptance policies

--acceptance auto|always|verifier|metric|progress

auto: metric if metric is configured; verifier whenever --success-command is set (regardless of sandbox or candidate count); otherwise always.
always: keep the chosen candidate regardless of checks.
verifier: keep only if --success-command exits 0.
metric: keep only if configured metric improves.
progress: keep only if a patch was produced.

Metric optimization

--metric-name score
--metric-direction higher
--acceptance metric

The runner parses lines like:

METRIC score=42.7

or:

score=42.7

Metrics can come from worker output or verifier output. In metric mode, verifier success does not automatically stop the loop; the loop can continue to optimize until max iterations or another stop condition.

Rolling memory and summarization

--memory-mode summary|none
--summary-max-chars 6000
--summary-command "claude --print"

By default summary updates are deterministic and include:

accepted candidate
changed files
verifier output tail
metric value
stuck reason
agent output tail
lessons to avoid repeating

If --summary-command is set, JSON containing previous_summary and latest_round is sent to that command and the markdown output becomes the new bounded summary. If the summarizer fails, deterministic summary is used as fallback.

Stuck classification and prompt mutation

The runner classifies failures as:

agent_timeout
model_refusal_or_give_up
no_workspace_progress
verifier_failed_without_output
same_verifier_failure
metric_missing
patch_rejected_by_policy

The classifier checks stagnation signals (no workspace change, repeated verifier failure, missing metric) before the refusal regex, so subprocess errors that happen to mention phrases like "cannot find" are no longer misread as model give-up.

Prompt mutation controls:

--prompt-mutation none|deterministic|command
--mutation-command "claude --print"

The deterministic mutation adds a tactical anti-repeat hint based on the stuck reason. Command mode sends JSON to --mutation-command and uses the output as the next tactical hint.

Explaining why a round was classified as stuck

--explain

By default the runner already prints stuck cause: signals={...} alongside the ⏸️ Paused... line whenever it pauses, so you can see which signal tripped the classifier without rerunning. With --explain, the same reason_signals dict (keys: regex_hit, stagnant, verifier_changed, agent_timeout, metric_missing) is also attached to every iteration entry in .claude/wiggum-isolated.log.jsonl, making post-mortem analysis straightforward.

Presets

--preset none|coding|review-heavy|cheap|explore

Presets tune default behavior while explicit flags still win:

coding: variants mode, 4-iteration batches, critic every 4 if configured via env.
review-heavy: stronger review cadence and optional reviewer from env.
cheap: smaller summary budget, slower model switching.
explore: variants mode, more candidate exploration.

Useful env vars:

WIGGUM_AGENT_COMMAND="claude --print"
WIGGUM_PRIMARY_AGENT="claude --print"
WIGGUM_SECONDARY_AGENT="codex exec -"
WIGGUM_CRITIC_COMMAND="gemini -p"
WIGGUM_REVIEW_COMMAND="claude --print"

Budgets

--max-runtime-seconds 3600
--max-agent-runs 30
--max-estimated-tokens 200000

Estimated tokens are approximated from prompt/output characters. When a budget is reached, the loop archives state and stops, or writes a human checkpoint if checkpoint mode applies.

Human checkpoints

--human-checkpoint never|always|on-stuck|on-critic
--human-checkpoint-every 5

When triggered, the runner writes .claude/wiggum-checkpoint.local.md with the current summary and pauses by archiving state. This is useful for moments that need human judgment.

Dashboard and lessons

Every iteration updates:

.claude/wiggum-dashboard.md
.claude/wiggum-dashboard.html

Failure/stuck lessons are always appended to the project file:

.claude/wiggum-lessons.jsonl

Lessons are project-only by default. To also append them to your shared global file ~/.wiggum/lessons.jsonl, opt in:

--global-lessons

If ~/.wiggum/ is read-only (for example, when Wiggum runs inside another sandboxed agent), the global append is skipped silently and a global_lessons_disabled event is recorded once in the ledger.

--no-global-lessons is a deprecated alias kept for backwards compatibility; with the new opt-in default it is no longer needed in the common case.

Recommended full pattern

./scripts/wiggum-isolated-loop.sh \
  "Make the tests pass without weakening assertions." \
  --agent-command "claude --print" \
  --agent-command "codex exec -" \
  --agent-switch-every 4 \
  --critic-command "gemini -p" \
  --critic-every 4 \
  --review-command "claude --print" \
  --success-command "npm test" \
  --candidates 2 \
  --candidate-concurrency 2 \
  --acceptance verifier \
  --agent-retries 1 \
  --max-iterations 24 \
  --max-runtime-seconds 7200 \
  --human-checkpoint on-stuck \
  --mode variants \
  --notify

In a git repo, --sandbox worktree is now the default and does not need to be passed explicitly. Lessons stay project-only by default, so --no-global-lessons is no longer required either — add --global-lessons if you want the shared ~/.wiggum/lessons.jsonl history back.

Installation

Copy the local plugin bundle:

./scripts/install-wiggum-plugin.sh --force

Best-effort settings update:

./scripts/install-wiggum-plugin.sh --force --enable

Default target:

~/.claude/plugins/local/wiggum-loop

Claude Code plugin discovery varies by version. If local plugin discovery does not pick it up automatically, run the scripts directly or register the local plugin path according to your Claude Code version.

Security notes

Hooks and agent commands run locally with your credentials.

Every --*-command flag is passed to Python's subprocess.run(..., shell=True) under your user account, in your current working directory, with the full environment. That means shell metacharacters (|, >, ;, $(...), backticks) are interpreted; values are not sanitized. Treat each of these flags as equivalent to pasting the string into your terminal:

--agent-command
--success-command
--summary-command
--critic-command
--review-command
--mutation-command

Prefer sandboxed mode (the new default in git repos) for unattended code-writing loops so candidate patches can be reviewed before they touch the main tree.

Tests

cd ralph-wiggum-hook
chmod +x hooks/*.sh hooks/wiggum_stop_hook.py scripts/*.sh tests/*.sh
./tests/run-all-tests.sh

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.claude-plugin		.claude-plugin
commands		commands
hooks		hooks
scripts		scripts
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
README.md		README.md
RELEASE_NOTES_0.3.0.md		RELEASE_NOTES_0.3.0.md

Folders and files

Latest commit

History

Repository files navigation

Wiggum Loop Hook Automation

What's new in 0.3.0

Two operating modes

1. Stop-hook loop: /wiggum-loop

2. True isolated loop: /wiggum-isolated

Feature summary

Directory layout

Quick usage

Preflight and resume

Ralph-style Stop hook

True isolated runner

Isolated runner options

Worker commands and multi-model rotation

Startup validation and transient-failure retries

Terminal-state notifications

Critic and final reviewer

Worktree/copy sandbox and automatic rollback

Best-of-N candidates

Patch acceptance policies

Metric optimization

Rolling memory and summarization

Stuck classification and prompt mutation

Explaining why a round was classified as stuck

Presets

Budgets

Human checkpoints

Dashboard and lessons

Recommended full pattern

Installation

Security notes

Tests

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages

1. Stop-hook loop: `/wiggum-loop`

2. True isolated loop: `/wiggum-isolated`