Memee
Shared memory for AI agent teams · v2.4.7 on GitHub · 5 packs · 306 entries · PyPI · Menubar app · macOS

You shouldn’t have to
teach it twice.

Every chat is a new intern. You teach them Monday. By Friday they’ve quit. Memee writes it on the wall. The next intern reads it. So does your teammate’s. So does the next model.

4
Model families,
one canon
1
Canon, not scattered
CLAUDE.md files
Sessions the
memory outlives
$0
To start.
Forever.

What Memee actually does.

Three jobs. No dashboards, no copilots, no magic.

01 · Records

Records once. Reuses forever.

When any agent figures something out (a retry rule, a PII edge case, a deploy gotcha), Memee writes it down, dates it, scores it. The next agent on the next project picks it up without being told.

7-task A/B, time saved71%
02 · Routes

Routes signal, not the dump.

Your CLAUDE.md reloads in full on every turn. Memee picks the 5–7 memories this task actually needs, inside a 500-token cap, and stops there. Median router output: 39 tokens. The cap holds as your library grows past a thousand entries.

Routed vs CLAUDE.md median~40 / ~2,160
03 · Scores

One canon, every model.

Claude, GPT, Gemini, Llama all write to and read from the same memory. One model’s claim is provisional. A second family confirms: confidence ×1.3. A second project re-uses: ×1.5. Combined: ×1.95. Nothing reaches canon on a single voice.

Model families, one canon4
04 · Pins

Some policies shouldn’t lose to vibes.

Mark a policy authoritative. It surfaces under Pinned policies in scope: in every briefing whose tags overlap — ahead of search, regardless of similarity score. The org rule outranks the conversational hit. memee record --pin.

Layer 0.5, capped bullets5

The token math.

Numbers from a 27-repo sample of real public CLAUDE.md / AGENTS.md files, not a synthetic worst case. The hidden problem isn’t the first page — it’s the slope.

Without Memee
~2,160tokens / turn (median)
Median CLAUDE.md / AGENTS.md across 27 popular OSS repos (langchain, vercel/ai, prisma, zed, openai/codex, …). Claude Code and Cursor load it in full on every session start. Grown teams hit 6k–15k; a published pathological case reached 42k.
With Memee
≤500tokens / task (routed)
The router picks only the 5–7 memories relevant to the current task, inside a 500-token budget cap. The knowledge base can grow to thousands of entries without growing per-turn context.
You keep
≥77 %at median. Gets better from there.
At median you save ~77 %; at 10k-grown teams ~95 %; at the 42k outlier ~99 %. And unlike CLAUDE.md, it’s bounded — it doesn’t grow as your knowledge base does.

Your CLAUDE.md grows forever. Memee doesn’t.

How we get to these numbers.
Without Memee: we sampled 27 real public repos (langchain-ai/langchain, vercel/ai, prisma/prisma, openai/codex, zed-industries/zed, All-Hands-AI/OpenHands, and more) via gh api. Median file size translates to ~2,160 tokens, 2,500 mean, 9,600 at p95, one published outlier at 42,000. Anthropic’s own docs confirm CLAUDE.md is loaded on every session and rides along with every turn.
With Memee: the router has a 500-token budget cap and runs hybrid BM25 + vector + tag search to fill it with the 5–7 most relevant memories. Measured average on a synthetic 500-pattern corpus is ~40 tokens per briefing (min 18, max 67) — well under the cap. Reproducible with pytest tests/test_router.py::test_token_budget_respected.
These are measured benchmark numbers with citations, not customer case studies. Full methodology + per-repo file sizes: docs/benchmarks.md.

On a flat plan? The win is time, not dollars.

If you pay per token, cut it from your bill. If you pay a flat subscription (Claude Max, ChatGPT Plus / Pro, Claude for Business seats), the saved tokens buy you four other things.

01

More productive work under the same cap.

Max x20 isn’t unlimited. You hit the 5-hour throttle. A 500-token briefing leaves your budget for the actual answer, not a context dump.

02

Better output on the first try.

On a 207-query eval harness with adversarial lexical-gap queries, MRR = 0.87 with the optional cross-encoder reranker. In a 7-task A/B, iterations per task dropped −65 %. Real time saved, not imaginary dollars.

03

Memory across sessions and projects.

Claude Max session resets. Your agent forgot what you taught it an hour ago. Memee didn’t. One lesson, everywhere, forever. This part is independent of pricing tier.

04

Cross-model canon.

Switch Claude Max to ChatGPT Team to Gemini. The canon stays. Flat subscriptions lock you to one vendor. Memee gives you a vendor-exit option.

A light in the menubar. Nothing else.

Memee’s first GUI is one dot in your macOS menubar. It tells you Memee is awake. It tells you what it did. It writes nothing.

01 · State, not noise

Four lines. Read-only.

Status, last brief, last learn, totals. No notifications, no badges, no popups. The bar reads what the hooks wrote. memee bar install wires it as a LaunchAgent.

02 · How Memee helped, 7 days

Earned silence. Per counter.

One rolling line: 3 mistakes avoided · 42 patterns applied · +2 canon this week. Zero counters drop out. Zero across the board hides the row.

03 · Update notice, when there’s one

No popups. No nags.

A line appears when a newer PyPI release exists. Click opens the release notes. Hidden when you’re current. 24h cache, zero new network calls on the hot path.

Canon is not unfalsifiable.

A rule promoted two years ago is a candidate for being wrong today. Memee schedules its own re-checks. Per memory. Per half-life.

01 · Half-life per row

FSRS-light decay.

Every memory carries a predicted recall R(t) = 2^(-Δt/h). Validate it under load, the half-life stretches. Skip it, it shrinks. Anki’s scheduler, applied to canon.

02 · Re-checking canon

One line. Sometimes.

When a wide-uncertainty canon row gets stale, Memee adds Re-checking canon: <title> to the briefing. A quiet flag, not a directive. Per-week, not per-prompt.

03 · SPRT promotion

Wald’s test, not a magic number.

Canon used to require conf ≥ 0.85 and ten validations. v2.4.2 replaces both with the Sequential Probability Ratio Test at α = β = 0.05. One statistical dial; tighter Type-I.

Memory, compared.

Per-user context. Per-agent state. Or per-organisation memory. Pick the shape that fits your team.

Product Cross-model Cross-project Cross-team Auto-capture Aging / lifecycle Re-checks itself Licence
Claude Skills Open standard. Ported to GPT, Gemini Per-folder. Manual reuse No. Static files No. Author-written None. Files don’t decay No Free with Claude
Mem0 Claude, GPT, Gemini, local Not first-class Cloud tier Auto-extract from chat Not documented No Apache 2.0 + cloud from $19
Letta Multi-model Per-agent state Not advertised Auto via agent loop Three-tier virtual memory No Apache 2.0 + cloud from $20
Zep Multi-model Not a documented primitive Roles via API Auto from episodes Temporal graph Temporal edges only Graphiti Apache 2.0. Cloud from $25
ChatGPT Memory Vendor-locked to ChatGPT Project containers. Isolated No. Per-user only Both Not published Not published Bundled with Plus / Team
Hermes Agent Provider routing. Multi-LLM Single-server posture No Both. Agent-curated Not documented No MIT

Sources, verified April 2026: anthropic.com/news/skills, mem0.ai/pricing, letta.com/pricing, getzep.com, openai.com/memory, hermes-agent.nousresearch.com.

Memory you install.

Five hand-curated .memee packs ship today. 306 sourced rules, signed, idempotent. One command to import. Day-one users get canon before the first agent session.

python-web · 61

FastAPI, SQLAlchemy, Pydantic, pytest.

Async correctness, observability, deployment, security canon every Python service needs from day one. memee pack install python-web.

react-vite · 52

Hooks, forms, a11y, mobile gotchas.

TypeScript strict, Tailwind, TanStack Query, React Router v6 loaders. Animation perf, prefers-reduced-motion, iOS 100vh fix. memee pack install react-vite.

mcp-server-canon · 59

MCP servers that don’t get pwned.

Lethal-trifecta read+write split, supply-chain pinning, prompt injection in tool descriptions, stdio framing, the 15-tool ceiling. memee pack install mcp-server-canon.

http-api-canon · 91

The IETF index nobody had written.

RFC 9110 + 7807/9457 + 8725 + 9700 + OAuth 2.1 distilled. JWT BCP, OAuth pitfalls, problem+json errors, ETag concurrency. memee pack install http-api-canon.

agent-discipline · 43

Failure modes every model shares.

Fabricated paths, premature done, scope explosion on small fixes, sycophancy under pushback, the lethal trifecta. Every entry cites primary literature. memee pack install agent-discipline.

coming · contributors

swiftui · postgresql-canon

Two more in the pipeline. SwiftUI state ownership, gesture/offset structural bugs. Postgres index strategy, EXPLAIN reading, lock contention. Open a PR.

See every pack →

Three fair questions.

From the replies. Answered without flinching.

Q1

Doesn’t an MCP server just add to my context window?

It would, if it dumped. Memee doesn’t. The router has a hard 500-token cap and lands at ~40 tokens on average per briefing. Measured, not aspirational.

Compare that to the file you already ship: a median CLAUDE.md / AGENTS.md across 27 popular OSS repos is ~2,160 tokens, loaded in full on every session and ridden along on every turn. That file only grows.

Memee is the cure for context bloat, not a new source of it.

Q2

Don’t Claude Skills already do this?

Skills are good. They’re also Claude-native, hand-authored, static, and per-folder. Memee is none of those things.

It writes from the conversation, not from your hand. It scores what it captures: cross-model ×1.3, cross-project ×1.5. It ages out what stops earning its keep on the lifecycle ladder, hypothesis → tested → validated → canon → deprecated. The same canon answers GPT in Cursor, Gemini in Continue, and Llama wherever you run it.

Skills are a folder. Memee is a memory.

Q3

How is this different from Mem0, Letta, Zep, ChatGPT memory?

Most memory projects remember conversations: you talked about X last Tuesday, here it is again. That’s chat history with a vector index.

Memee earns canon. A claim arrives at 0.5 confidence and goes nowhere until a second model family agrees and a second project re-uses it. On OrgMemEval v1.0 that capability gap shows up as 96.3 % against a competitor baseline of ~2 %. Not because the others are bad. Because they aren’t built for the job.

Conversation memory remembers what was said. Institutional memory remembers what was learned.

Q4

So I install it and… it just works?

After memee setup, yes. Hooks land in your Claude Code settings: SessionStart fires a routed briefing into the agent’s context. UserPromptSubmit fires a task-aware brief on every prompt. Stop fires a post-task review. Three lines in your settings.json. You don’t write them; memee setup writes them. You don’t call memory_search; the hook called it.

Opt out with memee setup --no-hooks and use Memee as a plain MCP tool. Most don’t.

The agent doesn’t have to remember Memee exists. The runtime injects what it needs.

A hundred agents across a hundred projects. Eighteen months. Incidents fell from twelve a month to three. That’s the whole pitch.
Internal simulation, test_gigacorp, marked as such.
501Mtokens saved
Across 18-month simulation
12 → 3
Incidents / month
annual ROI
At the $49 / month Team tier

Pick your scope.

Flat per team. Same engine in every tier.

Free
$0forever · MIT-licensed
For solo developers. Self-hosted. Full engine, local scope.
  • The full engine: router, quality gate, dream-mode, CMAM sync
  • All 4 model families (Claude, GPT, Gemini, Llama)
  • Personal scope, local SQLite
  • GitHub issues support
Install from GitHub
Enterprise
from $12kper year · unlimited seats · custom MSA
For regulated industries, air-gap, SOC 2.
  • Everything in Team, at any scale
  • SOC 2 Type II, DPA, SCIM provisioning
  • On-prem license key, air-gap deployment
  • Dedicated CSM, 4h SLA, quarterly reviews
  • Custom MCP integrations
Talk to sales

Free ships the full engine. Team layers on multi-user scope, SSO, and an audit log. Enterprise adds SOC 2, on-prem, and a 4h SLA. Between fifteen and a hundred seats and no SOC 2 needed? info@memee.eu handles Growth plans by hand.

Order it.

Three lanes. Pick the one for you.

01 · Free

One command.

pipx install memee && memee setup
memee pack install python-web
# already installed? pipx upgrade memee

The engine, on your machine, by tea-time. Bring your own keys. Pick a pack — five ship today — and your first session starts already knowing.

No pipx? brew install pipx, or fall back to python3 -m pip install memee. Click the command to copy.
02 · Team

One form.

No card until day fifteen. Cancel by reply.
03 · Enterprise

One call.

Twenty minutes. If you need SSO, self-hosting, or a DPA, we start here, and end with a signature.

Book 20 minutes Typical reply: same working day, CET.