Stars
2
results
for sponsorable starred repositories
written in Python
Clear filter
A high-throughput and memory-efficient inference and serving engine for LLMs
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization