Skip to content
View WangXuan95's full-sized avatar

Block or report WangXuan95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI&LLM

15 repositories

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Cuda 229 22 Updated Sep 24, 2023

Using LLM to evaluate MMLU dataset.

Python 41 3 Updated Mar 8, 2024

A collection of benchmarks and datasets for evaluating LLM.

535 34 Updated Jul 13, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,218 8,580 Updated Nov 12, 2025

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

623 18 Updated Sep 30, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,878 2,274 Updated Sep 3, 2025

Official implementation for Yuan & Liu & Zhong et al., KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches. EMNLP Findings 2024

Python 87 4 Updated Feb 27, 2025

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 732 55 Updated Aug 6, 2025

Generative Models by Stability AI

Python 26,721 3,009 Updated Dec 16, 2025

Quantization (QAT) Demo on CIFAR10

Python 17 1 Updated Jan 29, 2024
Python 296 26 Updated Jul 10, 2025

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

404 25 Updated Mar 3, 2025

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,210 53 Updated Jul 31, 2024

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Python 111 19 Updated Oct 15, 2024