Skip to content
View kzh's full-sized avatar
🌳
🌳

Organizations

@Team334

Block or report kzh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,392 220 Updated Dec 23, 2025

slime is an LLM post-training framework for RL Scaling.

Python 3,015 367 Updated Dec 26, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,002 781 Updated Dec 23, 2025

Aya is an eBPF library for the Rust programming language, built with a focus on developer experience and operability.

Rust 4,140 371 Updated Dec 26, 2025

Quantized LLM training in pure CUDA/C++.

C++ 224 14 Updated Dec 19, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,834 1,038 Updated Dec 24, 2025

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

Python 433 15 Updated Dec 16, 2025

A Quirky Assortment of CuTe Kernels

Python 723 64 Updated Dec 23, 2025

Ongoing research training transformer models at scale

Python 14,717 3,415 Updated Dec 25, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,360 3,243 Updated Dec 25, 2025

NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process communication and coordination overheads by allowing programmer…

C++ 427 48 Updated Dec 25, 2025

Public repository for the BeeGFS Parallel File System

C++ 185 33 Updated Dec 18, 2025

Manage your Hevy workouts, routines, folders, and exercise templates. Create and update sessions faster, organize plans, and search exercises to build workouts quickly. Stay synced with changes so …

TypeScript 80 19 Updated Dec 25, 2025

Lightweight coding agent that runs in your terminal

Rust 54,688 6,960 Updated Dec 26, 2025

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

Rust 5,058 370 Updated Dec 26, 2025

Asterinas is a secure, fast, and general-purpose OS kernel, written in Rust and providing Linux-compatible ABI.

Rust 4,068 253 Updated Dec 26, 2025

Deploy a Production Ready Kubernetes Cluster

Jinja 18,070 6,821 Updated Dec 25, 2025

A self-hosted dashboard that puts all your feeds in one place

Go 30,578 1,140 Updated Dec 10, 2025

CUDA Python: Performance meets Productivity

Cython 3,101 235 Updated Dec 24, 2025

Official Notion MCP Server

TypeScript 3,625 416 Updated Dec 24, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,983 3,873 Updated Dec 26, 2025

KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale

Go 1,037 127 Updated Dec 25, 2025

JavaScript animation engine

JavaScript 65,651 4,403 Updated Dec 3, 2025

https://hf.co/hexgrad/Kokoro-82M

JavaScript 5,150 581 Updated Aug 6, 2025

Model Context Protocol Servers

TypeScript 75,021 9,099 Updated Dec 19, 2025

⚓ A collection of high-performance JavaScript tools.

Rust 18,022 760 Updated Dec 26, 2025

Build smaller, faster, and more secure desktop and mobile applications with a web frontend.

Rust 100,419 3,247 Updated Dec 25, 2025

Highlight and capture the web in your favorite browser. The official Web Clipper extension for Obsidian.

TypeScript 2,793 278 Updated Nov 18, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,685 755 Updated Dec 25, 2025
Next