Lists (12)
Sort Name ascending (A-Z)
Stars
Supercharge Your LLM with the Fastest KV Cache Layer
Ultra-Sparse Adaptation of 1-Bit LLMs via XOR Patches
NUMA-Aware Contention-Free Dynamically-Auto-Tuning Bash-Native Streaming Parallelization Engine
The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm
Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙 Alternative to projects like llm-d, Docker Model Runner, etc but with less moving parts and simple deployments b…
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels
I replicated Ng's RYS method and found that duplicating 3 specific layers in Qwen2.5-32B boosts reasoning by 17% and duplicating layers 12-14 in Devstral-24B improves logical deduction from 0.22→0.…
Magical utilities for your Svelte applications.
Reduce Claude Code, Codex, OpenCode wall clock and token use by 50% with open source, local semantic search. Works for small and large codebases and monorepos! Enterprise-ready and fully compliant …
Tiny, Fast, and Deployable anywhere — automate the mundane, unleash your creativity
Ready-to-use and customizable users management for FastAPI
Pure C inference of Mistral Voxtral Realtime 4B speech to text model
A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!
Compiler-based i18n library that emits tree-shakable translations, leading to up to 70% smaller bundle sizes.
Svelte AI Elements is a custom registry built on top of shadcn-svelte to help you build AI-native applications faster.
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Turso is an in-process SQL database, compatible with SQLite.
The most comprehensive authentication framework for TypeScript
Go bindings for WebRTC AudioProcessing module (echo cancellation, noise suppression, AGC, VAD)