PhD @ Princeton &
声豚
-
Princeton
- SF, CA
-
09:26
(UTC -07:00) - https://oasis-git.github.io/
Stars
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
High-speed Large Language Model Serving for Local Deployment