Skip to content

emberian/cv

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

85 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ”ฎ clustervision

Vibecoding is a clusterfuck. When it gets hazy, you need clustervision.

Your AI coding sessions are gold. Stop letting them rot in scattered folders.

Find, search, convert, prune, and even resurrect every AI coding-agent session you've ever run โ€” across 20 harnesses, through one unified format.

claude ยท codex ยท grok ยท opencode ยท gemini ยท hermes ยท openclaw ยท cursor ยท โ€ฆ

search by meaning ยท read behind every compaction ยท port across harnesses ยท resurrect a maxed-out session ยท pack a new task from your whole history

No database, no import, no telemetry: clustervision reads your real local session storage and turns the scattered pile of .jsonl into one searchable, lossless corpus. Full-text and semantic search across everything you've ever run; the verbatim detail behind every compaction boundary; one-command porting between harnesses; and a custom compaction that pulls a maxed-out session back from the context wall.


clustervision timeline โ€” a GitHub-style activity heatmap of every agent session across every harness, with a constellation feed below

the desktop app reading 2,245 real local sessions across 8 harnesses โ€” heatmap, harness legend, and a live feed


The problem (you've felt this)

You spent three hours teaching an agent your codebase. That context โ€” the dead ends, the decisions, the hard-won understanding โ€” is some of the most valuable output you produce. And then:

  • ๐Ÿ™ˆ You can't find it. It's one of hundreds of .jsonl files in a folder named after a path. (Yes, you keep the important session IDs in a notes app. Everyone does.)
  • โ›“๏ธ It's trapped. Most harnesses only resume a session from the exact directory it ran in. Move the project, lose the thread.
  • ๐Ÿ๏ธ It's stranded. Started in Claude Code but want to continue in Codex? Tough. Every tool speaks its own dialect and none of them talk.

Restarting from scratch is the most expensive thing you do all day. clustervision makes sure you never have to.

What you can actually do with it

# ๐Ÿ”Ž find that session from last week by what it was ABOUT, not where it lived
cv search "the flux inference refactor"        # full-text, instant
cv search --semantic "formalizing proofs"      # by meaning โ€” no keyword overlap needed

# ๐Ÿš€ take a Claude session and continue it in Codex. for real.
cv convert da9174f4 --to codex
#   โœฆ wrote ~/.codex/sessions/2026/โ€ฆ/rollout-โ€ฆ.jsonl
#   โ†ณ codex resume 019e75e0-โ€ฆ

# ๐Ÿงณ break a session out of its directory jail (brings CLAUDE.md / MEMORY.md along)
cv port da9174f4 --to-dir ~/new/home

# ๐Ÿชฆ a session hit the context wall and won't reopen? prune the bulk and revive it
cv prune da9174f4 --thinking --revive
#   โœฆ recorded context 976k โ†’ 149k ยท resume with: claude --resume <new-id>

# ๐Ÿ‘๏ธ watch every agent on your machine work, live, in one feed
cv scry

โ€ฆand a couple that didn't exist before:

  • ๐Ÿง  An MCP server so a running agent can read other agents' sessions โ€” "what happened in this project before?", "what's my sibling agent doing right now?", "have I solved this before?" โ€” mid-task, without leaving its harness.
  • ๐ŸŒ A zero-install web viewer: drag a zip of any harness folder into your browser and explore it. Nothing uploaded, all WASM.

๐Ÿช 20 harnesses, one IR

Harness Parse Convert to Harness Parse Convert to
โœ… Claude Code โœ… โœ… โœ… Cursor โœ… โ€”
โœ… Codex CLI โœ… โœ… โœ… Kimi CLI โœ… โœ…
โœ… Grok CLI โœ… โœ… โœ… Qwen Code โœ… โœ…
โœ… OpenCode โœ… โœ… โœ… LM Studio โœ… โœ…
โœ… Gemini / Antigravity โœ… โœ… โœ… Cline โœ… โœ…
โœ… Hermes (Nous) โœ… โœ… โœ… Roo Code โœ… โœ…
โœ… OpenClaw โœ… โœ… โœ… Continue โœ… โœ…
๐Ÿ”’ Claude / ChatGPT apps detectedยน โ€” โœ… Goose (Block) โœ… โ€”
โœ… Zed (agent panel) โœ… โ€”

ยน The Claude app keeps transcripts server-side; the ChatGPT app keeps them locally but encrypted at rest. We detect the install and document exactly why neither is readable โ€” see docs/FORMATS.md.

Conversion is N-way among the 13 emit-capable harnesses: any โ†’ any, mediated by one unified IR. Every format reverse-engineered in docs/FORMATS.md. Bringing your own? โ†’ ADDING_HARNESS.md ๐Ÿ’›

โœจ More than a viewer

  • ๐Ÿงต Splice & loom โ€” compose a new session from spans of others (cv splice A:0-12 B:6-), or fork-and-graft a branch and generate its continuation with an LLM (cv loom โ€ฆ --generate). Works via OpenRouter / Anthropic / LM Studio (free, local, offline). Loom agent transcripts like a Janus loom, across any harness.

  • ๐Ÿง  Distill โ€” cv distill <id> turns a session into a durable MEMORY.md digest (decisions, gotchas, where things live). Your archive compounds instead of rotting.

  • ๐Ÿ”ฎ Recall โ€” semantic "have I solved this before?" โ€” as a cv recall command and an MCP tool that hands a running agent the relevant past span.

  • ๐Ÿงฌ Provenance โ€” search answers what was said; the event catalog answers what was done. Every tool call across every session is classified and queryable: cv touched <file> lists every session that ever edited a file, and cv blame <file> ties a file's git history back to the agent conversation that wrote it โ€” "why does this code exist?", answered by the actual reasoning that produced it (with a cv show --range jump to the moment of the edit).

  • ๐ŸŒณ Anatomy of a run โ€” a deep agent session isn't a flat transcript, it's a forest. cv workflow <id> renders a Workflow-tool run as its real shape โ€” the phase tree, the agents under each phase, their journaled outcomes/tokens/tool-calls, and the driving script. cv tools <id> is cross-agent tool analytics over the whole orchestrator+sub-agent forest (per-agent histograms, which agent used what, a wall-clock timeline). cv compaction <id> finds every context-compaction seam โ€” trigger, pre-compaction size, and the summary that seeded the next window โ€” and cv show --pre-compaction reads back the span the continued agent lost. (cv dataset --subagents pulls the whole forest into a training set.)

  • โœ‚๏ธ Prune โ€” cv prune <id> is custom compaction: instead of abandoning a giant session to the summarizer (which rewrites your history into a lossy paragraph), it snips the bulky old tool payloads โ€” large file reads, command logs, screenshots โ€” into a sidecar and leaves a tiny [PRUNED id=โ€ฆ] marker, producing a new, resumable session (claude --resume <new-id>). Your prompts and the exact flow stay verbatim; the most recent turns stay sharp; originals are one cv prune โ€ฆ --retrieve away. Add --thinking to also lift the oldest reasoning, and --revive to resurrect a session already stuck at the context wall โ€” Claude Code's resume gate trusts a stale usage number recorded in the file, so a maxed session refuses to reopen even once its real content fits; --revive rewrites that number to the honest post-prune size and the gate lets you back in. (Algorithm adapted from the validated flatten-mcp.)

  • ๐Ÿ”’ Redact โ€” cv redact <id> scrubs secrets/PII so a transcript is safe to share.

  • ๐ŸŽ Share โ€” cv share <id> โ†’ one self-contained, redacted-by-default HTML artifact: dark crystal-ball theme, collapsible thinking/tool folds, opens offline in any browser, uploads nothing (CSP-pinned so it can't).

  • ๐Ÿ“ฆ Pack โ€” cv pack "<task>" compiles a context bundle from your whole corpus: relevant past spans + what files those sessions actually touched (event catalog), as a CLAUDE.md digest, a system prompt, or a synthetic resumable session in any harness. Never explain your codebase to an agent twice.

  • ๐Ÿ–ฅ๏ธ Desktop app + ๐ŸŒ web viewer โ€” the Tauri app reads all your local sessions natively (zero setup) and lays the corpus out beautifully:

    • a Projects lens โ€” every repo, every agent that touched it, over time;
    • a GitHub-style activity heatmap timeline (a constellation of your working days);
    • side-by-side Compare, a Stats dashboard, and a visual loom composer (OpenRouter or free local LM Studio generation);
    • sub-agent trees โ€” a Claude Task session's children, nested and lazy-loaded inline, each labeled with its task prompt;
    • a Structure explorer (<cv-forest>) โ€” Overview / Forest / Workflows / Tools / Compaction tabs that turn a run's anatomy into something you can drill through.

    The browser build is zero-install โ€” drop a harness zip, nothing uploaded (all WASM).

  • ๐Ÿ”Œ Harness integrations โ€” plug clustervision into the agents' own hooks/MCP/plugins (integrations/): SessionEnd โ†’ archive + distill, SessionStart โ†’ recall.

๐Ÿ“– Manual

Full docs โ€” every CLI command, the MCP tools, the daemon's HTTP API, the app, cross-harness conversion, and the harness table โ€” live in the user manual (mdBook, also under manual/).

๐Ÿ› ๏ธ Install

Prebuilt binaries for macOS / Linux / Windows (arm64 ยท x64 ยท x86) ship on every release โ€” grab the latest from Releases (cv ยท cv-mcp ยท cvd ยท cv-tui ยท cv-search), or one-line it:

curl --proto '=https' --tlsv1.2 -LsSf https://github.com/emberian/cv/releases/latest/download/cv-installer.sh | sh

Or from source:

cargo build --release          # โ†’ target/release/{cv, cv-mcp, cvd, cv-tui, cv-search}

๐Ÿง  Let agents read each other's minds (MCP)

claude mcp add clustervision -- /path/to/target/release/cv-mcp

Tools incl: list_sessions ยท search_sessions ยท read_session ยท project_sessions ยท recall (semantic "where was this solved before") ยท prune_session/prune_retrieve (custom compaction, in-place). โ€ฆand a few more your agents can discover for themselves. ๐Ÿ˜‰

๐Ÿ“ก Archive your whole fleet (cvd)

cvd sync     # snapshot every session into ~/.clustervision
cvd watch    # follow live + archive as sessions change โ†’ a fleet activity feed

๐Ÿงฌ The OpenSession standard

After staring into seven different transcript formats, we wrote down the one they should have agreed on: OpenSession โ€” a small, honest, harness-neutral interchange format (the key heresy: cwd is metadata, not identity). clustervision's IR is its reference implementation. If you ship a harness, emit OpenSession and everyone's sessions become portable by construction. ๐Ÿค

๐Ÿ—๏ธ Under the hood

One IR (Session โ†’ Message โ†’ Block{Text|Thinking|ToolUse|ToolResult|File|Image}), one Adapter per harness (discover + parse + emit), plus loom / redact / prune / watch / ingest modules โ€” and small crates on top: cv (CLI) ยท cv-mcp (MCP) ยท cvd (daemon + serve) ยท cv-search (tantivy + model2vec) ยท cv-llm (distill/generate) ยท cv-web (WASM) ยท app/ (Tauri desktop).

parse(any harness) โ†’ ๐Ÿ”ฎ unified IR โ†’ search ยท convert ยท port ยท prune ยท loom ยท distill ยท archive ยท view

๐Ÿงช Status

Built in a wild few sessions, much of it by a swarm of agents working disjoint files. โœจ Honest about the edges:

  • 20 harnesses parse; the 7 core ones also emit (N-way conversion). The rest โ€” Cursor, Kimi, Qwen, LM Studio, Cline, Roo, Continue, Goose, Zed, and the ChatGPT/Claude.ai account data exports (chatgpt-export/claude-export โ€” register their location with cv config --add-export <path>) โ€” are parse-only for now.
  • Robustness: 2000+ real sessions parse with 0 panics; parsers are fuzz-tested against hostile input.
  • The full-text index trades disk for speed; Gemini's protobuf .pb is opaque; a few sidecar tool-call streams aren't merged yet.
  • Historical format variants are an explicit goal โ€” see ADDING_HARNESS.md, and please send your own harness logs (we can only test what we can see).

PRs and weird old transcripts deeply welcome. ๐Ÿ’œ

made with ๐Ÿ”ฎ and an unreasonable amount of enthusiasm ยท MIT/Apache-2.0

About

agents can read each others minds (session logs)

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors