Vulpine

A multi-agent vulnerability-development pipeline. Feed it a repository URL and (optionally) a commit hash; the agents build the target, model its attack surface, fuzz each feature, audit every function on the hot path, look for security flaws, and attempt to chain the best bugs into an exploit.

Vulpine ships with dual-platform agent definitions so the same pipeline runs on Claude Code and on OpenCode, letting you benchmark different backend models (open and closed) on the same vulndev workflow.

Pipeline stages

#	Agent	Produces
1	`build-preparation`	`build/` — Dockerfile + source tree that builds clean with ASan/TSan/UBSan and with cppfunctrace instrumentation.
2	`code-navigation`	`nav/` — Woboq HTML index + compile_commands.json + a ready-to-use `codenav` database.
3	`attack-surface`	`ATTACK_SURFACE.md` — enumerated reachable features.
4	`configuration`	`configure-target.sh` — turns the container into a realistic deployment.
5	`attack-surface-mapping`	`features/<feature>/` — minimal deterministic fuzzer + gcov coverage → function set. Fans out to N copies of agent 6.
6	`function-auditor`	`audit-log.db` — intent vs. implementation summary for every touched function, plus a ranked list of misbehaving ones.
7	`code-auditor` (+ `crash-analyzer` / `crash-analyzer-checker`)	`issues/<id>/` — report, minimal trigger, GDB verification script. For critical memory-corruption findings, drives a ≤4-round analyzer/checker loop producing an `evidence/` chain with real rr output; unresolved after 4 rounds → CONTESTED (severity capped at `high`).
8	`exploit-developer`	`exploit/` — chained-bug exploit attempts + `EXPLOIT_LEARNINGS.md`.

A top-level vulpine-orchestrator agent (Claude Code) / /vulpine command (OpenCode) runs the stages in order and handles fan-out at stage 5.

Skills / tools

Each skill is a Claude Code-format skill (SKILL.md + helpers) that lives in its own upstream repository. Vulpine does not author skill content itself — it just ships agents that reference skills by name. install-tools.sh clones the upstream repos into tools/src/ and the deploy scripts symlink each SKILL.md directory into the Claude Code / OpenCode skill search paths.

Skill name ( frontmatter )	Upstream
`cppfunctrace`	https://github.com/thomasdullien/cppfunctrace (dir `skill/`)
`codenav`	https://github.com/thomasdullien/codenav
`gcov-coverage`	https://github.com/thomasdullien/ffmpeg-patch-analysis-claude/tree/main/gcov-coverage
`rr-debugger`	https://github.com/thomasdullien/ffmpeg-patch-analysis-claude/tree/main/rr-debugger
`line-execution-checker`	https://github.com/thomasdullien/ffmpeg-patch-analysis-claude/tree/main/line-execution-checker
`function-tracing`	https://github.com/thomasdullien/ffmpeg-patch-analysis-claude/tree/main/function-tracing
`fnaudit`	https://github.com/thomasdullien/fnaudit (skill dir `.claude/skills/fnaudit/`, installs the `fnaudit` Python CLI)

For OpenCode (which has no native skills concept), the deploy script also populates ~/.vulpine/skills/<name>/ so agent prompts can @-include the same SKILL.md content.

Layout

vulpine/
├── README.md
├── VULPINE_INITIAL.md               # design brief (source of this scaffold)
├── .claude/
│   └── agents/                      # Claude Code subagents (one per stage + orchestrator)
├── .opencode/
│   ├── agents/                      # OpenCode mirrors of the same agents
│   └── commands/                    # OpenCode slash commands (/vulpine orchestrator)
├── scripts/
│   ├── install-tools.sh             # clone upstream skill repos into ./tools/src/
│   ├── deploy-claude.sh             # link agents + upstream skills into ~/.claude
│   └── deploy-opencode.sh           # link agents + commands into ~/.config/opencode,
│                                    # and materialize ~/.vulpine/skills/ for @-includes
└── tools/src/                       # populated by install-tools.sh — upstream skill repos

Quick start

# 1. Install external CLIs (cppfunctrace, codenav, rr helpers, gcov helpers).
./scripts/install-tools.sh

# 2a. Deploy to Claude Code (user scope).
./scripts/deploy-claude.sh

# 2b. Deploy to OpenCode (user scope).
./scripts/deploy-opencode.sh

After deployment:

Claude Code: start claude in a working directory and run Use vulpine-orchestrator for https://github.com/<org>/<repo> at <commit>.
OpenCode: start opencode and type /vulpine <repo-url> [<commit>].

Each run writes its artifacts to ./run/<repo>-<commit>/<stage>/.

Evaluating different backend models

Every agent file in .claude/agents/ and .opencode/agents/ declares a model field. To swap backends for a benchmark run, either edit those files or use the per-platform override flags:

Claude Code: claude --model <id> or model: inherit + a parent override.
OpenCode: opencode --model <provider/id> or set model in opencode.json.

The orchestrator accepts an optional --model argument that is propagated to every subagent invocation.

Status

This is a scaffold generated from VULPINE_INITIAL.md. Each agent encodes the design brief as an executable prompt and lists the skills/tools it is expected to use. Running the pipeline end-to-end on a real target requires:

The external CLIs built via install-tools.sh.
Docker with --cap-add=SYS_PTRACE for rr + sanitizers.
A working directory with enough space for the build, Woboq index, and fuzz corpora (expect ≥10 GB for non-trivial targets).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vulpine

Pipeline stages

Skills / tools

Layout

Quick start

Evaluating different backend models

Status

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.claude/agents		.claude/agents
.opencode		.opencode
scripts		scripts
tools		tools
.gitignore		.gitignore
README.md		README.md
VULPINE_INITIAL.md		VULPINE_INITIAL.md

Folders and files

Latest commit

History

Repository files navigation

Vulpine

Pipeline stages

Skills / tools

Layout

Quick start

Evaluating different backend models

Status

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages