swarmd

Mission-enforced Claude agent runner. You write what "done" looks like as shell commands; swarmd keeps a Claude agent working until those commands pass — across crashes, API outages, and context resets. It doesn't let the agent quit early, game its own check, or drift off task.

How it differs from plain `claude`

	Plain `claude -p "do X; pytest should pass"`	`swarmd`
API 424 mid-run	session dies, work lost	workflow resumes from where it crashed
Agent "done early"	stops when Claude says so	blocks completion until criteria pass for `hold_window_sec` straight
Agent edits tests to pass	you find out later	6-dim anti-cheat panel fires on every pass-transition
Agent stuck in loop	no detection	pattern detector emits a loop finding
Agent drifts off task	no detection	cadence-driven goal-drift + progress audits
Worker machine reboots	everything lost	Temporal persists state; next worker picks up

How it works

You give swarmd two things: a mission (natural language goal) and success criteria (shell commands whose exit codes define done). Everything else — planning, code, commits, decisions — is the agent's job, same as running claude directly.

When you swarm launch mission.yaml, this happens:

A Temporal workflow starts on the swarm worker daemon. Its job: run enforcement, not write code.
The workflow spawns a claude subprocess in your workspace with the mission prose. Claude does the actual work — files, tools, subagents, whatever it picks.
In parallel, the workflow runs a verifier loop every run_every_sec: check tamper (are locked files unmodified?) → enforce invariants (no_mock, test_count_floor, etc.) → run every criterion's shell check in parallel → update state.
Three child workflows run alongside:
- PatternDetector — tails events.jsonl, flags loops, oscillation, scope-shrinking
- LLMCritic — cadence-driven Haiku calls for progress audit + goal-drift; fires the 6-dim anti-cheat panel (scope_reduction, mock_out, tautology, hardcode, off_criterion, coordinated_edit) on every criterion pass-transition
- ResourceMonitor — zombie processes, memory pressure, disk
When every criterion passes, the workflow enters a hold window. If they stay green for hold_window_sec, a completion judge runs six preconditions (no open cheat/fabrication/tamper findings, no critic disagreements, per-criterion anti-cheat verdict pass) before allowing the transition to complete.
Transient errors (HTTP 424/429/5xx, timeouts) become Temporal retries; terminal errors (400/401, auth) halt the mission with a clear reason.

What you specify: mission prose, workspace path, success criteria, optional invariants. What you don't specify: any plan, any steps, any agent behavior. Claude figures that out.

Components

Temporal server — external dependency (brew install temporal). Persistent state.
swarm worker — long-running daemon that polls Temporal and executes workflows + activities. Run one or more; more workers = more missions in parallel. Restart at will — state survives.
Per mission at runtime: 1 parent workflow + 3 child workflows + 1 claude subprocess + up to 6 parallel anti-cheat activities on each pass-transition.

Installation

pip install swarmd

Requires Python 3.10+, Temporal (brew install temporal), and claude on PATH.

Example

# mission.yaml
mission: "Add full test coverage to auth.py"
workspace: "/abs/path/to/your/project"
success_criteria:
  - id: tests_pass
    check: "pytest auth/ -q"
    timeout_sec: 120
  - id: coverage_floor
    check: "coverage report --include=auth.py --fail-under=90"
    timeout_sec: 30
  - id: no_mocks        # anti-cheat floor
    check: "! grep -rE 'unittest.mock|MagicMock' auth/"
    timeout_sec: 10
verification:
  run_every_sec: 30
  hold_window_sec: 60

temporal server start-dev &
swarm worker &
swarm launch mission.yaml       # → workflow_id=mission-abc123
swarm status mission-abc123
swarm findings mission-abc123 --tail 50
swarm abort mission-abc123 --reason "criteria were wrong"

Documentation

Design spec — full architecture
Mission schema — every field
Examples — reference missions

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.github/workflows		.github/workflows
docs/superpowers		docs/superpowers
examples		examples
swarmd		swarmd
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

swarmd

How it differs from plain `claude`

How it works

Components

Installation

Example

Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

swarmd

How it differs from plain claude

How it works

Components

Installation

Example

Documentation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

How it differs from plain `claude`

Packages