100% FREE · OPEN SOURCE · RUNS ON YOUR MACHINE

Memory that AI Agents Love !

Your agent forgets everything. Memanto fixes that.

Persistent memory for Claude Code, Cursor, Codex, and 14 other agents. 100% free, open source, and runs entirely on your machine · no API keys, no vector database, no backend.

Start Memorizing View on GitHub

89.8%LongMemEval

<90msrecall latency

17+integrations

›_memanto · bash

$pip install memanto

Collecting memanto...
Successfully installed memanto-0.2.0

memanto agent create dev-agent

█

> Agent namespace [dev-agent] created.
[OK] Memory nodes are listening.

remember · recall · answer

# 1. Install the CLI
$ pip install memanto

# 2. One-time setup — pick 2 for On-Prem
$ memanto
  Choose your backend
    1  Moorcheh Cloud    (instant, needs API key)
  > 2  Moorcheh On-Prem  (Docker, no API key)
  ✓ Setup complete — server on http://localhost:8080

# 3. Plug it into your agent
$ memanto connect claude-code

# 4. That's it — your agent now remembers
$ memanto remember "We deploy with Docker on port 8080"
$ memanto recall "how do we deploy?"

Runs in Docker on your machine. Embeddings and answers via local Ollama models — nothing leaves your laptop.

memanto · walkthrough

Instant IngestionConflict ResolutionMulti-AgentSemantic TypesBuilt-in RAGZero-CostFreshnessVerifiable SourcesDeterministic SearchTemporal QueriesInstant IngestionConflict ResolutionMulti-AgentSemantic TypesBuilt-in RAGZero-CostFreshnessVerifiable SourcesDeterministic SearchTemporal Queries

Info-Theoretic ScoringSub-90ms32x Compression17 IntegrationsCross-PlatformNo API KeyLocal EmbeddingsConfidence ScoringDaily SummariesAutonomous CategorizationInfo-Theoretic ScoringSub-90ms32x Compression17 IntegrationsCross-PlatformNo API KeyLocal EmbeddingsConfidence ScoringDaily SummariesAutonomous Categorization

context

Never re-explain your codebase

A new session starts where the last one ended - decisions, conventions, and gotchas already in mind.

token tax

0 LLM tokens per write

Others invoke an LLM on every save. Memanto's write path costs nothing - verify it yourself.

recall

Stored → searchable, instantly

No indexing queue. Memories are findable the moment they're written - in under 90 ms.

setup

One pip install. Nothing else.

No vector database, no schema, no rerankers, no backend service to keep alive.

Origin Story

Identifying the gaps in
AI agent memory

We built Moorcheh.aifirst, the only serverless vector search that delivers this level of recall efficiency at scale. While building it, we kept running into agents that forgot everything between sessions. We asked Claude what causes agent memory to fail, it pointed to passive, static context. Six gaps. We built MEMANTO around exactly those six problems, and it wouldn't be possible without Moorcheh.ai's serverless vector infrastructure underneath.

Irrelevant memory dumps

Memory arrives as a blob.

Outdated memories

Old notes weigh as much as new.

Unknown memory sources

Stated, inferred, or stale, unclear.

Representative model reply

“My memory exists as a static snapshot injected into context, useful, but fundamentally passive. I can't query it, update it mid-conversation, or distinguish ‘I know this’ from ‘I was told this once.’”

Tap a problem to see an example

All memories grouped together

All memory types collapsed flat.

Memory contradiction

Conflicts never reconcile.

Long overhead ingestion

Indexing lag and server overhead.

6 design principles

Relevant results only

Instead of overloading your agent with data, Memanto finds only the exact information needed for the current task.

✓

Prioritizes new info

✓

Verifiable sources

✓

Smartly categorized

✓

Resolves contradictions

✓

Instant memory ingestion

✓

Relevant results only

127.0.0.1:8000/ui

terminal

memanto v0.2.0on-prem · localhost:8080

memory core

Claude CodeCLI

CursorIDE

WindsurfIDE

Codex CLIOpenAI

ClineAgent

GitHub CopilotEditor

API AgentsREST·MCP

VS CodeEditor

Gemini CLIGoogle

Prefers TypeScript

Avoids recursive bug

Uses Supabase

Tab indent, 2 spaces

Tailwind v4

Vitest over Jest

Claude Code

Codex CLI

Cursor

Windsurf

Antigravity

Gemini CLI

Cline

Continue

OpenCode

Goose

Roo Code

GitHub Copilot

Augment Code

$memantoconnectclaude-code

Frameworks & Agent Platforms

Built-in MEMANTO memory agent on NemoClaw agents
Semantic retrieval across sessions with zero-cost ingestion
Agentic calls powered by Moorcheh's native LLM, no extra API keys needed
Open-source and self-hostable

View on GitHub How it's built

Feature	Mem0	Zep	Letta	LangMem	MemantoBest
RememberStore agent memories
RecallSemantic search & retrieval
AnswerMemanto onlyLLM-grounded response from memory
Instant IngestionMemories available instantly after write
Conflict ResolutionAutomated contradiction detection
Semantic Memory Types13 built-in memory categories
Multi-Agent NamespacesIsolated memory per agent
No External API KeyBuilt-in LLM proxy, no setup