Starred repositories
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
Repo for SwiftVR: Real-Time One-Step Generative Video Restoration
Fully open reproduction of DeepSeek-R1
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
A Python interface to k-Wave GPU accelerated binaries
Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods
Generative Acoustic Metamaterial – Design and Optimisation Pipeline
Official implementation of LittleBit (NeurIPS 2025) and its follow-up LittleBit-2 (ICML 2026)
ADHD — a skill for coding agents. Tree-of-thought with pruning, built on the Claude & Codex Agent SDK. Fans out parallel divergent thoughts under different cognitive frames, scores, prunes traps, d…
EdgeRazor: A Lightweight Framework for Large Language Models via Mixed-Precision Quantization-Aware Distillation
Mobile-first web terminal for monitoring AI agents and SSH sessions from your phone. Touch-optimized with soft keys, gestures, and session persistence. Self-hosted, source-available.
Code for "L2P: Unlocking Latent Potential for Pixel Generation"
Ghostty for the web with xterm.js API compatibility
Whispered-to-Normal Speech Conversion via Conditional Flow Matching (arXiv:2603.04296) — PyTorch reproduction with VAE, DiT, multilingual support, and Heun/Euler ODE solvers
First super-resolution model designed for Apple Neural Engine. 2x upscale, real-time, on-device. Built by Ben Racicot.
Pre-indexed code knowledge graph, auto syncs on code changes, for Claude Code, Codex, Gemini, Cursor, OpenCode, AntiGravity, Kiro, and Hermes Agent — fewer tokens, fewer tool calls, 100% local
Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed.
Fast, lossless LLM inference via dual-view diffusion decoding.
26m function call model that runs on incredibly small devices
A sandboxed execution environment for AI agents via WASM
Local generative video panel for Apple Silicon. Wraps LTX-2 MLX, joint audio+video, one-click Pinokio install.
Headless CLI client for stateful Agent Client Protocol (ACP) sessions
MLX native implementations of state-of-the-art generative image models
[SIGGRAPH 2025] LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation"
Chronos: Pretrained Models for Time Series Forecasting