Starred repositories
llama.cpp fork with additional SOTA quants and improved performance
Fast and memory efficient c++ flat hash table/map/set
Decompilation of 3D Pinball for Windows – Space Cadet
Library for reducing tail latency in RAM reads
🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman
State-of-the-Art Embeddings, Retrieval, and Reranking
Animated sprite editor & pixel art tool -- Fork of the last GPLv2 commit of Aseprite
Entity-level git merge driver. Resolves false conflicts git invents when independent agents edit the same file. ~95% reduction vs. line-based merge.
16 bytes fixed size image placeholder, an alternative to blurhash and thumbhash
Ultra fast and portable Parakeet implementation for on-device inference in C++ using Axiom with MPS+Unified Memory
The Swiss Army Knife of Offline AI. Chat, Speak, and Generate Images - Privacy First, Zero Internet. Download an LLM and use it on your mobile device. No data ever leaves your phone. Supports text-…
A simple, high-quality voice conversion tool focused on ease of use and performance.
Inference server for MioTTS, a lightweight and fast LLM-based TTS model.
Complete end-to-end setup for maximizing DGX Spark compute for AI Workloads
A tiny, single-header <canvas>-like 2D rasterizer for C++
QR designer web app with a novel method of designing qr codes that does not take advantage of error correction
Software for decoding classical and quantum codes
Web Extension for saving a faithful copy of a complete web page in a single HTML file