Busibox

A self-hosted AI platform that keeps your data on your infrastructure.

Busibox integrates document processing, semantic search, AI agents, and custom applications into a single platform — running on Docker or Proxmox LXC containers with enterprise-grade security baked in.

Think of it as a Linux distribution for AI: install it on your hardware, and you get a complete stack for building AI-powered workflows without sending a single byte to the cloud (unless you choose to).

Why Busibox?

You shouldn't have to choose between powerful AI and data sovereignty.

Most AI platforms require uploading your documents to third-party servers. Busibox runs entirely on infrastructure you control — a local server, a Proxmox cluster, or Docker on your laptop. Your documents, embeddings, conversations, and search indexes never leave your network.

Problem	How Busibox Solves It
Sensitive data can't go to cloud AI	Everything runs locally — LLMs, embeddings, vector search
AI tools are fragmented	One platform: documents, search, agents, and apps share auth and data
Building AI apps is slow	App template + shared library + deploy in minutes, not weeks
Access control is an afterthought	Zero Trust auth, RBAC, and PostgreSQL Row-Level Security from day one
Infra is painful to manage	Ansible IaC, unified `make` interface, one command to deploy

What You Get

Document Processing

Upload PDFs, Word, Excel, PowerPoint, images (with OCR), or Markdown. Busibox automatically extracts text, chunks it, generates embeddings, and indexes everything for search. Schema-driven extraction pulls structured fields (dates, names, amounts) from unstructured documents.

Hybrid Search

Natural language queries against your document library. Combines vector search (semantic), BM25 (keyword), graph-based retrieval, and LLM reranking — all filtered by the user's permissions. Ask "What are the budget assumptions for Q3?" instead of guessing keywords.

AI Agents

Conversational agents that search your documents (RAG), browse the web, accept file attachments, remember context, and stream responses with source citations. Configure agents with custom instructions, tools, and model routing per task.

Hybrid LLM Routing

LiteLLM gateway routes requests to local models (vLLM on NVIDIA GPUs, MLX on Apple Silicon) or cloud providers (OpenAI, Anthropic, AWS Bedrock) — per agent, per task. Use a fast local model for extraction and a frontier model for complex reasoning, all through one API.

Custom Applications

Build and deploy Next.js apps that inherit Busibox auth, data access, and AI capabilities. A shared library (@jazzmind/busibox-app) provides SSO, data API clients, chat components, and search — so you write domain logic, not plumbing.

Bridge Channels

Connect agents to Telegram, Signal, Discord, WhatsApp, and email. Users interact with AI in the tools they already use.

Architecture

┌─────────────────────────────────────────────────────────┐
│                       Browser                           │
└────────────────────────┬────────────────────────────────┘
                         │
                    ┌────▼────┐
                    │  nginx  │  reverse proxy + SSL
                    └────┬────┘
            ┌────────────┼────────────────┐
            ▼            ▼                ▼
      ┌──────────┐ ┌──────────┐    ┌───────────┐
      │  Portal  │ │  Agents  │    │ User Apps │
      └────┬─────┘ └────┬─────┘    └─────┬─────┘
           └─────────────┼────────────────┘
                         ▼
        ┌────────────────────────────────┐
        │         API Layer              │
        │  AuthZ · Data · Agent · Search │
        │  Docs · Deploy · Embedding     │
        └───────────────┬────────────────┘
                        ▼
        ┌────────────────────────────────┐
        │       Infrastructure           │
        │  PostgreSQL · Milvus · MinIO   │
        │  Redis · LiteLLM · vLLM       │
        └────────────────────────────────┘

Each service runs in its own isolated container. Compromise in one container does not grant access to another. All inter-service communication uses audience-scoped RS256 JWTs verified via JWKS — no shared secrets, no static tokens.

Security Model

Busibox treats security as architecture, not a feature bolted on later.

Zero Trust Authentication — AuthZ is the sole token authority. RS256-signed JWTs verified via JWKS; subject token exchange scopes tokens per service.
Row-Level Security — PostgreSQL RLS enforces access at the database level. Even with an application bug, the database won't return unauthorized rows.
RBAC Everywhere — Documents, agents, and apps are assigned to roles. Users see only what their roles permit. Agents inherit the calling user's permissions.
Envelope Encryption — Files encrypted at rest with Master Key → Key Encryption Keys → Data Encryption Keys, per file.
Passwordless Auth — Passkeys (biometrics/security keys), TOTP, or magic links. No passwords by design.
Audit Trail — Auth events, token exchanges, and admin actions logged with timestamps, user IDs, and IP addresses.

Who It's For

Enterprise teams that need AI but can't send sensitive data to third parties
Consultancies building AI solutions for clients on shared infrastructure
Regulated industries (legal, finance, healthcare, government) with data residency and audit requirements
AI-native organizations that want control without stitching together a dozen tools

Quick Start

All operations use the unified make interface — it handles secrets injection, environment detection, and runs inside a manager container with guaranteed dependencies.

# Deploy infrastructure (PostgreSQL, Redis, MinIO, Milvus)
make install SERVICE=infrastructure

# Deploy all API services
make install SERVICE=apis

# Deploy the portal and agent manager
make install SERVICE=frontend

# Check status
make manage SERVICE=all ACTION=status

# View logs
make manage SERVICE=agent ACTION=logs

# Interactive menus
make            # Main launcher
make install    # Installation wizard
make manage     # Service management
make test       # Testing menu

See docs/administrators/ for full deployment and configuration guides.

For Developers

Build Apps on Busibox

The app template and @jazzmind/busibox-app library give you:

SSO out of the box — SessionProvider handles auth, token refresh, and 401 retry
Data API client — structured CRUD with automatic RLS enforcement
Chat components — SimpleChatInterface for agent-powered UIs
Search client — hybrid search with permission filtering
Deploy in one command — make install SERVICE=my-app

# Start from the template
git clone <template-repo> my-app
cd my-app && npm install && npm run dev

Apps are cloned and built at runtime — code changes deploy without rebuilding containers.

MCP Servers for Cursor

Three MCP servers provide structured access to documentation, testing, and deployment:

Server	For	What It Does
`mcp-core-dev`	Core developers	Docs, scripts, testing, container logs
`mcp-app-builder`	App developers	Auth patterns, template reference, service endpoints
`mcp-admin`	Operators	Deployment, SSH, container management

make mcp   # Build all servers and write Cursor config

Technology Stack

Layer	Components
Compute	Proxmox VE (LXC) or Docker
Provisioning	Ansible, Bash
APIs	FastAPI (Python 3.11+)
Apps	Next.js 16, React 19, TypeScript 5
Database	PostgreSQL 15+ with RLS
Vector Search	Milvus 2.3+
Object Storage	MinIO (S3-compatible)
Queue	Redis Streams
LLM Gateway	LiteLLM → vLLM, Ollama, OpenAI, Anthropic, Bedrock
Reverse Proxy	nginx with SSL
Auth	RS256 JWTs, JWKS, Zero Trust token exchange

Documentation

Audience	Location	Content
Administrators	docs/administrators/	Deployment, configuration, troubleshooting
Developers	docs/developers/	Architecture, APIs, security, app development
Users	docs/users/	Feature guides, document management, chat, search

Project Structure

busibox/                        # This repo — infrastructure, APIs, provisioning
├── docs/                       #   Documentation (by audience)
├── srv/                        #   Service source code
│   ├── agent/                  #     Agent API (FastAPI)
│   ├── data/                   #     Data API + Ingest Worker
│   ├── docs/                   #     Docs API
│   └── deploy/                 #     Deploy API
├── provision/
│   ├── ansible/                #     Ansible roles and inventory
│   └── pct/                    #     Proxmox container scripts
├── scripts/                    #   Admin workstation scripts
├── tools/                      #   MCP servers and utilities
└── specs/                      #   Project specifications

Related repositories:

Repo	What It Contains
busibox-frontend	All frontend apps (Portal, Agents, Admin, Chat, App Builder, Media, Documents) and the `@jazzmind/busibox-app` shared library
busibox-template	Starter template for building new apps on the Busibox platform

Contributing

Read the architecture docs in docs/developers/architecture/
Set up a local development environment with Docker: make install SERVICE=all
Run tests: make test-docker SERVICE=<service>
Follow the organization rules in .cursor/rules/ for file placement and naming

See CLAUDE.md for detailed development workflow and conventions.

Name		Name	Last commit message	Last commit date
Latest commit History 2,154 Commits
.cursor		.cursor
.specify		.specify
cli/busibox		cli/busibox
config		config
dev-apps		dev-apps
docs		docs
k8s		k8s
openapi		openapi
provision		provision
scripts		scripts
srv		srv
tests/security		tests/security
tools		tools
.cursorrules		.cursorrules
.dockerignore		.dockerignore
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Makefile		Makefile
QUICKSTART.md		QUICKSTART.md
README.md		README.md
TESTING.md		TESTING.md
docker-compose.github.yml		docker-compose.github.yml
docker-compose.local-dev.yml		docker-compose.local-dev.yml
docker-compose.original-dev.yml		docker-compose.original-dev.yml
docker-compose.yml		docker-compose.yml
env.local.example		env.local.example

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Busibox

Why Busibox?

What You Get

Document Processing

Hybrid Search

AI Agents

Hybrid LLM Routing

Custom Applications

Bridge Channels

Architecture

Security Model

Who It's For

Quick Start

For Developers

Build Apps on Busibox

MCP Servers for Cursor

Technology Stack

Documentation

Project Structure

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Busibox

Why Busibox?

What You Get

Document Processing

Hybrid Search

AI Agents

Hybrid LLM Routing

Custom Applications

Bridge Channels

Architecture

Security Model

Who It's For

Quick Start

For Developers

Build Apps on Busibox

MCP Servers for Cursor

Technology Stack

Documentation

Project Structure

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages