Starred repositories
🎨 The generative UI framework that even humans can use.
An open-source AI agent that lives in your terminal.
Use Codex from Claude Code to review code or delegate tasks.
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.
MCP Server and CLI Tools for searxing and fetching websites
E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker
Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA
Fine-tune LLMs on your Mac with Apple Silicon. SFT, DPO, GRPO, Vision, TTS, STT, Embedding, and OCR fine-tuning — natively on MLX. Unsloth-compatible API.
vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth
🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman
🏆Winning Project | ModelGate is a contract-aware AI control plane that ingests customer contracts, extracts SLA/privacy/routing constraints, and generates an OpenAI-compatible endpoint that automat…
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
Lossless DFlash speculative decoding for MLX on Apple Silicon
Exact speculative decoding on Apple Silicon, powered by MLX.
Memory engine and app that is extremely fast, scalable. The Memory API for the AI era.
The Mind Palace for AI Agents - HIPAA-hardened Cognitive Architecture with on-device LLM (prism-coder:7b), Hebbian learning, ACT-R spreading activation, adversarial evaluation, persistent memory, m…
RuVector is a High Performance, Real-Time, Self-Learning Ai, Vector GNN, Memory DB built in Rust.
π RuView: WiFi DensePose turns commodity WiFi signals into real-time human pose estimation, vital sign monitoring, and presence detection — all without a single pixel of video.
Apple Silicon (MLX) port of Karpathy's autoresearch — autonomous AI research loops on Mac, no PyTorch required.
miolini / autoresearch-macos
Forked from karpathy/autoresearchAI agents running research on single-GPU nanochat training automatically adopted for MacOS
AI agents running research on single-GPU nanochat training automatically
A benchmark for evaluating realistic preference-following in personalized user-LLM interactions.