Pinned Loading
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
-
vllm-project/vllm-omni
vllm-project/vllm-omni PublicA framework for efficient model inference with omni-modality models
-
kvcache-ai/Mooncake
kvcache-ai/Mooncake PublicMooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
-
dragonflydb/dragonfly
dragonflydb/dragonfly PublicA modern replacement for Redis and Memcached
-
RL-Align/RL-Kernel
RL-Align/RL-Kernel PublicModern RL Post-training Infrastructure: Optimized for NVIDIA/AMD GPUs with a focus on vLLM and DeepSpeed integration, CUDA/ROCm/Triton kernels, and transparent hardware-aware scaling.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.