Skip to content
View ApostaC's full-sized avatar

Block or report ApostaC

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
7 stars written in Python
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 70,159 13,414 Updated Feb 12, 2026

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,885 898 Updated Feb 12, 2026

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,162 366 Updated Feb 12, 2026
Python 164 24 Updated Jul 15, 2025
Python 151 23 Updated Oct 9, 2024

The driver for LMCache core to run in vLLM

Python 60 32 Updated Feb 4, 2025

the frontend of lmcache

Python 18 7 Updated Dec 31, 2025