Stars
Research prototype of PRISM — a cost-efficient multi-LLM serving system with flexible time- and space-based GPU sharing.
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
GIR-Bench: Versatile Benchmark for Generating Images with Reasoning
[ICCV 2025] This repo is the official implementation of "Music Grounding by Short Video"
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.