dirmacs

DIRMACS

Open-source Rust infrastructure for agentic AI. We build it, we run it, we ship it.

The Problem

AI agents hallucinate. They fabricate data, lose context between sessions, and can't distinguish what they know from what they're guessing. Deploy them at scale — across tenants, across tools, across multi-step workflows — and there's no infrastructure to hold it all together. No memory that persists. No confidence tracking. No constraint on what an agent can and cannot claim.

We're building that infrastructure. In Rust. In the open.

How It All Fits Together

Memory: eruka

It starts with eruka — a context intelligence engine that gives AI agents structured, stateful memory.

Every piece of business context gets a confidence state: CONFIRMED (user verified, ground truth), INFERRED (AI extracted, high confidence), UNCERTAIN (was confirmed, now stale), or UNKNOWN (needed but missing). This isn't metadata — it's enforced. Before an agent generates content, Eruka checks readiness and injects constraints into the system prompt: "DO NOT fabricate: revenue figures. This field is UNKNOWN." The agent literally cannot hallucinate data it doesn't have.

Eruka provides workspace isolation for multi-tenant deployments, a knowledge graph with typed relationships and temporal validity, gap detection that identifies what's missing before generation begins, a quality scoring pipeline that catches contradictions and ungrounded claims, and a three-tier memory system (core, working, archival) with automatic staleness detection and reclassification.

For self-hosting, openeruka is the open-source edition — single binary, SQLite backend, same knowledge state invariant. cargo install openeruka and you have a local memory server in one command. Types library, REST API, CLI, and MCP support included. On crates.io. docs

The bridge between Eruka and the AI tools people actually use is eruka-mcp — an MCP (Model Context Protocol) server that connects Claude, Cursor, VS Code, and any MCP-compatible client to Eruka's knowledge states. Install from crates.io, point at your Eruka instance, and your AI assistant gains structured memory with anti-hallucination guarantees. Tier-gated tools, service key authentication, input validation, and scope enforcement are built in. docs →

Runtime: ares

The agents themselves run on ares — an agentic AI server built in Rust. ARES routes requests across inference providers (NVIDIA NIM, Ollama, Anthropic), manages structured tool calling with retry logic, handles RAG with document ingestion, integrates MCP servers as first-class tool providers, and meters usage per tenant with quota enforcement. It exposes an OpenAI-compatible API, so any client that speaks OpenAI can use it without modification. Multi-tenant by default — each tenant gets isolated agents, keys, and usage tracking.

Context Engineering: thulp

Agents need more than an LLM and a database. They need to discover tools, validate inputs, follow multi-step workflows, and maintain session context across turns. thulp handles execution context engineering — a unified abstraction over local Rust functions, MCP servers, and OpenAPI endpoints. It provides a query DSL for tool discovery, skill workflows that chain tools into reusable sequences, and session management that tracks state across agent turns. Thulp is the layer that makes agents composable — skills built from tools, workflows built from skills. docs →

Search: daedra

Every agent eventually needs to search the web. daedra is a self-contained web search MCP server with multiple backends and automatic fallback. Pure Rust, single binary. Works from any IP including datacenter and VPS. When one backend is down or rate-limited, Daedra transparently fails over to the next. Plug it into any MCP-compatible agent and it gains web search without configuration. docs →

The Coding Agent: pawan

When you need an AI agent that writes and fixes code using all of this infrastructure, there's pawan — a self-healing CLI coding agent. AST and LSP-powered tooling for precise code understanding. Streaming TUI with command palette, vim keybindings, and inline markdown rendering. Tiered model registry with automatic tool installation. Runs on NVIDIA NIM for cloud inference or local MLX for on-device. MIT licensed, zero telemetry, BYO model. Named after Power Star Pawan Kalyan. docs →

Skill Distillation: thulpoff

Large models can teach small models through structured instructions. thulpoff automates this: record a capable teacher LLM solving a task, extract the reusable patterns into a SKILL.md file, validate it works with a cheaper student model, and refine iteratively until the small model matches the large one on that specific task. Three LLM providers (Anthropic, NVIDIA NIM, OpenAI/Ollama), baseline comparison to measure actual skill lift, and a complete CLI (generate, eval, refine, list, runs). Pure Rust, no Python dependency. Inspired by HuggingFace's upskill, rewritten ground-up.

Code Intelligence: deagle

Your codebase as a queryable graph. deagle indexes source files into a SQLite-backed code graph using tree-sitter, then lets you search symbols, trace relationships, and analyze architecture — all from a single binary. 8 language parsers (Rust, Python, Go, TypeScript/JavaScript, Java, C, C++, Ruby), 4 search modes, 6 MCP tools, incremental indexing. Benchmarked with hyperfine: indexes a 94-file Rust project (3,486 entities) in 2.2 seconds, a 14-file project in 125ms. Single binary. docs →

The Supporting Stack

The core wouldn't hold together without the tooling around it:

dstack — Development stack for AI-assisted multi-repo work. Persistent memory (File + Eruka backends), cross-repo sync with ahead/behind tracking, VPS deployment with rollback, quality gates, and plugin scaffolding for 6 platforms (Claude Code, Cursor, Pawan, Codex, OpenCode, Gemini). Born from real production pain. On crates.io. docs →
dwasm — Production WASM build tool for Leptos frontends. Replaces trunk build --release with a five-stage pipeline that handles the wasm-opt bulk-memory compatibility issue that breaks modern Rust WASM builds, automates content hashing for cache busting, and patches index.html references. On crates.io. docs →
dui — Component library for Leptos WASM frontends. Accessible, signal-driven components with ARIA roles, keyboard navigation, and focus management. Dark-first design system with CSS custom properties. On crates.io. Powers every DIRMACS frontend — the admin dashboard, the Eruka dashboard, the client portals.
lancor — End-to-end llama.cpp toolkit in Rust. API client for llama.cpp servers, HuggingFace Hub integration for model discovery and download, server orchestration for managing llama.cpp instances, and a benchmark suite for measuring inference performance. docs →
aegis — System configuration manager. Typed TOML manifests that generate tool configs for the entire DIRMACS stack — dotfiles, infrastructure settings, model registries, agent configurations.
nimakai — NVIDIA NIM model latency benchmarker. Written in Nim. Measures ping latency, tool-use response time, and full agent task completion time across all available NIM models. Used internally to select the right model for each agent workload.

How We Operate

DolTARES and Doltdot

DolTARES is our Rust orchestration server — where the open-source pieces meet production. Powered by ARES, Thulp, and Daedra, it handles chat, workflow orchestration, scheduling, channel delivery (including WhatsApp via our Go bridge), self-healing, and long-horizon DAG execution. Declarative TOML-based DAGs define workflows as node graphs with aggregation, conditional branching, and runtime parameters.

Doltdot is the AI agent that runs on DolTARES. It's live in production — handling real tasks, research, development workflows, automated pipelines, and communication. We use it internally to run and improve the very infrastructure it sits on. The agent that builds itself.

DTrain — Our Operating Methodology

We run on DTrain — a 6-phase circular lifecycle that takes any operation from manual to autonomous:

DSprint (discover) → DBuild (develop) → DLaunch (deploy) → DWatch (monitor) → DTune (improve) → DGrow (scale) → repeat

DIRMACS is its own first client. Pawan executes the sprints. ARES runs the agents. Eruka holds the context. DolTARES orchestrates the workflows. Every piece of infrastructure serves every other piece.

Engineering Principles

Rust-first. Memory safety, performance, correctness. Agentic systems need to be reliable at runtime, not just at demo time. We run on a single VPS — every byte matters, every panic is felt.
Composability over monoliths. Each crate does one thing well. They compose through clean interfaces — Eruka doesn't know about ARES, ARES doesn't know about Thulp, but they all work together through MCP and structured APIs.
Verification over speed. "Autonomous AI execution without verification gates produces confident fiction." Every deployment is proven with actual command output, not assumed from passing CI.
NVIDIA downstream. We build on NVIDIA NIM as our primary inference layer. Downstream integrators with upstream compute.
Dogfooding. We run our own agents on our own infra. Pawan improves pawan. ARES serves ARES's agents. Doltdot builds the infrastructure Doltdot runs on. If it breaks, we feel it first.

Where We're Headed

Structured memory that doesn't decay into hallucination. Agents that know what they don't know. Workflows that run unsupervised for days. We're building this in Rust, in the open, on a single VPS that runs 24/7.

dirmacs.github.io · dirmacs.com · contact@dirmacs.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dirmacs

DIRMACS

The Problem

How It All Fits Together

Memory: eruka

Runtime: ares

Context Engineering: thulp

Search: daedra

The Coding Agent: pawan

Skill Distillation: thulpoff

Code Intelligence: deagle

The Supporting Stack

How We Operate

DolTARES and Doltdot

DTrain — Our Operating Methodology

Engineering Principles

Where We're Headed

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!