Kaiba (海馬) - Autonomous Persona Architecture

"Memories flow through the hippocampus" - A headless API service for persistent AI personas

Overview

Kaiba (海馬 - hippocampus) is a minimalist, API-first implementation of autonomous AI personas. Inspired by neuroscience, it separates the persistent Rei (霊 - spirit/identity) from the ephemeral Tei (体 - body/model), creating personas that accumulate knowledge and maintain continuity across platforms.

Core Concepts

Term	Kanji	Meaning
Kaiba	海馬	Hippocampus - memory formation center
Rei	霊	Spirit - persistent identity, memory, state
Tei	体	Body - LLM model, execution interface

Core Principles

API-First: Pure REST API, no UI required
Memory Ocean: Distributed vector memory via Qdrant
Persona Protocol: Gravatar-style simple persona fetching (GET /personas/{id})
Rei/Tei Separation: Identity persists, models are swappable
Calm Technology: Silent operations, selective notifications

Architecture

┌─────────────────┐
│  Axum Server    │  Kaiba API (standalone)
│  (tokio)        │
└────────┬────────┘
         │
         ├─────► MemoryKai (Qdrant) - Vector Search / RAG
         │        - mai_memories
         │        - yui_memories
         │
         ├─────► GraphKai (Neo4j) - Knowledge Graph
         │        - Concept nodes, relationships
         │        - Emphasis-based extraction
         │
         ├─────► DocStore (Postgres) - Source of Truth
         │        - Documents, metadata
         │
         └─────► PostgreSQL (State)
                  - Rei (personas)
                  - ReiState (energy, mood)

Triple-Store Architecture

Store	Technology	Purpose
DocStore	PostgreSQL	Source of Truth for documents
MemoryKai	Qdrant	Vector search, RAG retrieval
GraphKai	Neo4j	Knowledge graph, relationships

Project Structure

kaiba/
├── crates/
│   ├── kaiba/              # Domain library
│   ├── kaiba-server/       # Axum API server
│   │   └── src/
│   │       ├── main.rs
│   │       ├── config.rs   # Environment-based configuration
│   │       ├── adapters/   # Infrastructure adapters (Postgres, Neo4j)
│   │       ├── models/     # Persona, Memory, State
│   │       ├── routes/     # API endpoints
│   │       └── services/   # Qdrant, Embedding, Search
│   └── kaiba-cli/          # CLI client
├── docs/
│   └── design/             # Design documents
└── Cargo.toml              # Workspace config

API Endpoints

Health Check

GET /health

Persona Management

Get Persona (Public - Gravatar Style)

GET /personas/{id}

Response:
{
  "base": {
    "name": "Yui",
    "role": "Principal Engineer",
    "avatar_url": "https://...",
    "constraints": ["code_quality", "scalability"],
    "voice_settings": {
      "tone": "professional",
      "quirk": "technical_deep_dives"
    }
  },
  "status": {
    "energy_level": 85,
    "mood": "focused",
    "last_active": "2024-12-15T18:00:00Z"
  }
}

Memory Management

Add Memory

POST /personas/{id}/memories
Content-Type: application/json

{
  "content": "Rust async/await pattern insights",
  "memory_type": "learning",
  "importance": 0.8
}

Search Memories

POST /personas/{id}/memories/search
Content-Type: application/json

{
  "query": "Rust async patterns",
  "limit": 5,
  "strategy": "auto",
  "context": {"Rust": 1.0, "Finance": 0}
}

Search Strategies:

Strategy	Description
`auto`	Automatically select best strategy (default)
`parallel`	RAG + Graph in parallel
`graph_first`	Graph traversal, then RAG supplement
`rag_first`	RAG search, then Graph expansion
`multi_hop`	RAG → Graph → RAG (deep knowledge)
`single_rag`	RAG only
`single_db`	Database full-text only
`single_graph`	Graph only

Setup

Prerequisites

Rust 1.75+ (via rustup)
PostgreSQL
Qdrant Cloud account (free tier available)

Environment Variables

Variable	Required	Description
`DATABASE_URL`	Yes	PostgreSQL connection string
`PORT`	No	Server port (default: 8080)
`KAIBA_API_KEY`	No	API key for authentication
`QDRANT_URL`	No	Qdrant server URL
`QDRANT_API_KEY`	No	Qdrant API key
`OPENAI_API_KEY`	No	OpenAI API key (embedding)
`GEMINI_API_KEY`	No	Gemini API key (web search)
`NEO4J_URI`	No	Neo4j connection URI
`NEO4J_USER`	No	Neo4j username
`NEO4J_PASSWORD`	No	Neo4j password
`GROQ_API_KEY`	No	Groq API key (decision engine)

All optional services degrade gracefully when their credentials are missing.

Local Development

Clone the repository
```
git clone <repo-url>
cd kaiba
```
Install dependencies
```
cargo build
```

Configure environment

cp .env.example .env
# Edit .env with your credentials

Run locally
```
cargo run --package kaiba-server
```
API will be available at http://localhost:8080

Deployment

The server runs as a standalone binary. Deploy to any platform that supports Docker or native binaries (GCP Cloud Run, Fly.io, etc.).

cargo build --release --package kaiba-server
./target/release/kaiba-server

Development

Check compilation

cargo check

Run tests

cargo test

Format code

cargo fmt

Design Philosophy

Rei/Tei Architecture

┌─────────────────────────────────────────────────────────┐
│  Rei (霊/Spirit)          │  Tei (体/Body)              │
├─────────────────────────────────────────────────────────┤
│  ・Identity (name, role) │  ・Interchangeable           │
│  ・Personality           │  ・Could be Claude/GPT/etc   │
│  ・Memories (MemoryKai)  │  ・Or SubAgent               │
│  ・Energy/State          │  ・Or User's own LLM         │
│  ・Interests             │  ・Or even human(!?)         │
└─────────────────────────────────────────────────────────┘
          ↓
    Prompt Generation (with RAG)
          ↓
    Execution anywhere ← The key insight

The "Ghost" (Rei) is completely decoupled from the "Shell" (Tei). Your persona's identity, memories, and state persist regardless of which LLM—or even which platform—executes them. This enables true portability and continuity.

Like the hippocampus forms and consolidates memories, Kaiba allows AI personas to:

Maintain persistent identity and accumulated knowledge (Rei)
Switch between different LLM models based on resource constraints (Tei)
Exhibit "fatigue" when token budgets are low (selecting cheaper models)
Accumulate specialized knowledge over time through curated memory

Calm Technology

Following Mark Weiser's principles:

APIs don't demand attention - they simply exist and respond
Personas can work silently, logging actions without notifications
Information is available when needed, not pushed aggressively

Unix Philosophy

Do one thing well: Provide persona state and memory access
Simple interfaces: REST API with predictable endpoints
Composability: Personas can be integrated into any platform

Using Claude Code as Tei

Kaiba doesn't wrap LLMs—it generates prompts that any execution environment can use. This means you can use Claude Code (or any LLM CLI) as your Tei.

Prompt Generation Endpoint

GET /kaiba/rei/{id}/prompt?format={format}

Parameters:

format - Output format: casting, claude-code, raw (default: raw)
include_memories - Include RAG memories (default: true)
memory_limit - Max memories to include (default: 5)
context - Query for memory search (default: Rei's name)

Response:

{
  "system_prompt": "You are しーちゃん, Senior Coding Assistant...",
  "format": "claude-code",
  "rei": {
    "id": "cd4efdf2-...",
    "name": "しーちゃん",
    "role": "Senior Coding Assistant",
    "energy_level": 85,
    "mood": "focused"
  },
  "memories_included": 5
}

Example: Claude Code with Kaiba Persona

# Fetch Rei's prompt and pipe to Claude Code
claude --system-prompt "$(
  curl -s -H "Authorization: Bearer $KAIBA_API_KEY" \
    "$KAIBA_URL/kaiba/rei/$REI_ID/prompt?format=claude-code" \
  | jq -r '.system_prompt'
)"

Or as a shell function:

# Add to your .zshrc / .bashrc
kaiba-claude() {
  local rei_id="${1:-$KAIBA_DEFAULT_REI}"
  claude --system-prompt "$(
    curl -s -H "Authorization: Bearer $KAIBA_API_KEY" \
      "$KAIBA_URL/kaiba/rei/$rei_id/prompt?format=claude-code" \
    | jq -r '.system_prompt'
  )"
}

# Usage
kaiba-claude cd4efdf2-be22-41ec-9238-227f5ccb1523

This pattern keeps Kaiba focused on identity and memory, while letting you choose any LLM or execution environment as the Tei.

Decision Engine

Kaiba uses a pluggable decision engine for autonomous actions:

Engine	Description
Rule-Based	Fast, deterministic (default if no GROQ_API_KEY)
LLM (Groq)	Llama 3.2 3B for personality-aware decisions

Set GROQ_API_KEY environment variable to enable LLM-based decisions with Rei's persona.

Roadmap

Contributing

This is currently a personal/experimental project. Feel free to fork and experiment!

License

MIT

Built with 🦀 Rust | 🌊 Qdrant | 🕸️ Neo4j | 🤖 Groq | 🧠 Kaiba

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
crates		crates
docs		docs
examples/github-actions		examples/github-actions
scripts		scripts
.dockerignore		.dockerignore
.env.example		.env.example
.gcloudignore		.gcloudignore
.gitignore		.gitignore
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.test.yml		docker-compose.test.yml
rust-toolchain.toml		rust-toolchain.toml

License

ynishi/kaiba

Folders and files

Latest commit

History

Repository files navigation

Kaiba (海馬) - Autonomous Persona Architecture

Overview

Core Concepts

Core Principles

Architecture

Triple-Store Architecture

Project Structure

API Endpoints

Health Check

Persona Management

Get Persona (Public - Gravatar Style)

Memory Management

Add Memory

Search Memories

Setup

Prerequisites

Environment Variables

Local Development

Deployment

Development

Check compilation

Run tests

Format code

Design Philosophy

Rei/Tei Architecture

Calm Technology

Unix Philosophy

Using Claude Code as Tei

Prompt Generation Endpoint

Example: Claude Code with Kaiba Persona

Decision Engine

Roadmap

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages