Lists (1)
Sort Name ascending (A-Z)
Stars
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
slime is an LLM post-training framework for RL Scaling.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Aya is an eBPF library for the Rust programming language, built with a focus on developer experience and operability.
DeepEP: an efficient expert-parallel communication library
Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.
Ongoing research training transformer models at scale
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process communication and coordination overheads by allowing programmer…
Public repository for the BeeGFS Parallel File System
Manage your Hevy workouts, routines, folders, and exercise templates. Create and update sessions faster, organize plans, and search exercises to build workouts quickly. Stay synced with changes so …
Lightweight coding agent that runs in your terminal
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
Asterinas is a secure, fast, and general-purpose OS kernel, written in Rust and providing Linux-compatible ABI.
Deploy a Production Ready Kubernetes Cluster
A self-hosted dashboard that puts all your feeds in one place
CUDA Python: Performance meets Productivity
SGLang is a fast serving framework for large language models and vision language models.
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
Model Context Protocol Servers
⚓ A collection of high-performance JavaScript tools.
Build smaller, faster, and more secure desktop and mobile applications with a web frontend.
Highlight and capture the web in your favorite browser. The official Web Clipper extension for Obsidian.
A Datacenter Scale Distributed Inference Serving Framework