Skip to content
View cs-qyzhang's full-sized avatar

Highlights

  • Pro

Block or report cs-qyzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

107 stars written in Python
Clear filter

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 218 11 Updated Jul 24, 2025
Python 218 17 Updated Jan 23, 2025

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Python 168 14 Updated Sep 23, 2025

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Python 157 29 Updated Jul 10, 2024

Implementation of BTree part for paper 'The Case for Learned Index Structures'

Python 149 42 Updated Dec 20, 2018

Persist and reuse KV Cache to speedup your LLM.

Python 112 35 Updated Nov 7, 2025

Modular and structured prompt caching for low-latency LLM inference

Python 102 10 Updated Nov 9, 2024

A simple Django app to render LaTeX templates and compile them into PDF files.

Python 86 30 Updated Aug 23, 2025

Python wrappers for calling LaTeX/building LaTeX documents.

Python 78 32 Updated Dec 13, 2023

NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading

Python 67 20 Updated Jun 16, 2025

Download & install fonts from Adobe Creative Cloud

Python 54 11 Updated Aug 7, 2017

eBPF Standard Documentation

Python 50 6 Updated Sep 14, 2024

[HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System

Python 49 7 Updated Jul 21, 2025

[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo

Python 48 5 Updated Aug 5, 2025

Open Programmable Infrastructure API and Behavioral Model

Python 33 41 Updated Oct 24, 2025

SmartSSD related benchmarks and toy applications

Python 10 1 Updated Nov 1, 2023

Provide example code for machine learning class

Python 9 5 Updated Jun 3, 2019