6 6

264 followers · 393 following

Maine

Achievements

x3 x2 x4

Achievements

x3 x2 x4

Organizations

Stars

pithings / voipi

🎙️ Give your apps, CLIs, and agents a voice. VoiPi is a universal, zero-dependency, free text-to-speech library for JavaScript.

TypeScript 181 12 Updated Apr 26, 2026

acronjob / DS4_SM120_Flash_VLLM_Experiment

Cuda 3 1 Updated Apr 26, 2026

cloudflare / artifact-fs

ArtifactFS is a filesystem driver designed to mount large git repos as quickly as possible, hydrating file contents on-the-fly instead of blocking on the initial clone. It's ideal for agents, sandb…

Go 742 28 Updated Apr 23, 2026

huggingface / transformers-to-mlx

Agent Skill to help convert transformer LLMs to mlx-lm

Python 17 1 Updated Apr 16, 2026

scrya-com / rotorquant

KV cache compression via block-diagonal rotation. Beats TurboQuant: better PPL (6.91 vs 7.07), 28% faster decode, 5.3x faster prefill, 44x fewer params. Drop-in llama.cpp integration.

Python 921 78 Updated Apr 23, 2026

TheTom / turboquant_plus

Python 6,572 880 Updated Apr 25, 2026

jundot / omlx

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Python 11,647 1,011 Updated Apr 24, 2026

togethercomputer / saw-int4

Official implementation of Paper "System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving"

Shell 13 1 Updated Apr 17, 2026

Luce-Org / lucebox-hub

Lucebox optimization hub: hand-tuned LLM inference, built for specific consumer hardware.

C++ 1,009 80 Updated Apr 26, 2026

bstnxbt / dflash-mlx

Lossless DFlash speculative decoding for MLX on Apple Silicon

Python 585 33 Updated Apr 24, 2026

lukealonso / b12x

Python 55 8 Updated Apr 25, 2026

voipmonitor / rtx6kpro

RTX 6000 Pro Wiki — Running Large LLMs (Qwen3.5-397B, Kimi-K2.5, GLM-5) on PCIe GPUs without NVLink

Python 220 19 Updated Apr 27, 2026

rhysd / actionlint

Static checker for GitHub Actions workflow files

Go 3,818 215 Updated Apr 19, 2026

siddharthvaddem / openscreen

Create stunning demos for free. Open-source, no subscriptions, no watermarks, and free for commercial use. An alternative to Screen Studio.

TypeScript 33,038 2,224 Updated Apr 27, 2026

modular / modular

The Modular Platform (includes MAX & Mojo)

Mojo 25,910 2,802 Updated Apr 27, 2026

always-further / nono

nono - a capability-based, multiplexing sandbox tool, built for developers - lift'n'shift seamless path to prod. Run agents securely without needing any additional infra, zero setup, zero latency.

Rust 2,129 150 Updated Apr 25, 2026