Starred repositories
4
results
for source starred repositories
written in C
Clear filter
Dynamic Memory Management for Serving LLMs without PagedAttention
Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.