- Austin, TX
- www.joshkasuboski.com
Starred repositories
FastContext: Training Efficient Repository Explorer for Coding Agents
Makes your AI agent think like the laziest senior dev in the room. The best code is the code you never wrote.
High-performance code-intelligence engine for AI agents and IDE, supports 257 languages, multi repositories, based on graph, with access via CLI, MCP Server, and API. AI coding agents teammate - ex…
HelixDB is an OLTP graph-vector database built in Rust.
How to Build Robots and Make Them Move
🛠️ The meta-harness for AI agents — scaffold your own focused, branded agent harness with its own npx CLI, MCP server, memory, learning loop, and witness-signed releases. Works with Claude Code, Co…
A fast, helpful, and open-source document parser
Semantic version control => entity-level diffs, blame, and impact analysis on top of git. 28 languages via tree-sitter. Built for coding agents.
A minimal codebase to generate synthetic coding agent session traces
CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies
Distributed event stream server over HTTP, backed by S3.
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
A Python framework for self-hosted LLM tool-calling and multi-step agentic workflows
🌱 Private, quiet space for thinking. Simple app for .md files. LLM-friendly.
Fast and Accurate Code Search for Agents. Uses ~98% fewer tokens than grep+read
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Orchestrate multiple coding agents from desktop and mobile
Own your AI. The native macOS harness for AI agents -- any model, persistent memory, autonomous execution, cryptographic identity. Built in Swift. Fully offline. Open source.
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…
Distributed WASM runtime. Workloads place themselves over a zero-trust mesh. One static binary.