-
bench_serving Public
Forked from kimbochen/bench_servingPython Apache License 2.0 UpdatedDec 11, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedNov 14, 2025 -
Mooncake Public
Forked from kvcache-ai/MooncakeMooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++ Apache License 2.0 UpdatedNov 14, 2025 -
TinyZero Public
Forked from Jiayi-Pan/TinyZeroClean, minimal, accessible reproduction of DeepSeek R1-Zero
Python Apache License 2.0 UpdatedApr 2, 2025 -
exllamav2 Public
Forked from turboderp-org/exllamav2A fast inference library for running LLMs locally on modern consumer-class GPUs
Python MIT License UpdatedFeb 29, 2024 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedDec 19, 2023 -