-
-
-
mlsys-notes Public
Learning notes for understanding modern Machine Learning System.
-
thread-chat Public
Forked from assistant-ui/assistant-uimodern chat interface with thread feature.
TypeScript MIT License UpdatedApr 4, 2026 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedApr 3, 2026 -
-
vllm-omni Public
Forked from vllm-project/vllm-omniA framework for efficient model inference with omni-modality models
Python Apache License 2.0 UpdatedMar 31, 2026 -
ssd Public
Forked from tanishqkumar/ssdA lightweight inference engine supporting speculative speculative decoding (SSD).
Python MIT License UpdatedMar 25, 2026 -
-
LMCache Public
Forked from LMCache/LMCacheSupercharge Your LLM with the Fastest KV Cache Layer
Python Apache License 2.0 UpdatedFeb 23, 2026 -
-
nanochat Public
Forked from karpathy/nanochatThe best ChatGPT that $100 can buy.
Python MIT License UpdatedFeb 20, 2026 -
compaction Public
Forked from adamzweiger/compactionAlgorithms for latent compaction
Python MIT License UpdatedFeb 19, 2026 -
-
-
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
MLIR MIT License UpdatedJan 23, 2026 -
-
sglang Public
Forked from sgl-project/sglangSGLang is a high-performance serving framework for large language models and multimodal models.
Python Apache License 2.0 UpdatedJan 17, 2026 -
-
spinningup-with-gui Public
Forked from openai/spinningupAn educational resource to help anyone learn deep reinforcement learning.
Python MIT License UpdatedAug 21, 2025 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedAug 16, 2025 -
AMD ROCm™ Software - GitHub Home
Shell MIT License UpdatedJul 10, 2025 -
homework_fall2023 Public
Forked from berkeleydeeprlcourse/homework_fall2023Jupyter Notebook MIT License UpdatedDec 6, 2024 -
RouteLLM Public
Forked from lm-sys/RouteLLMA framework for serving and evaluating LLM routers - save LLM costs without compromising quality
Python Apache License 2.0 UpdatedAug 10, 2024 -
annotated-transformer Public
Forked from harvardnlp/annotated-transformerAn annotated implementation of the Transformer paper.
Jupyter Notebook MIT License UpdatedApr 7, 2024