WangXuan95

Dr.W.X WangXuan95

PhD graduated from USTC // FPGA // Verilog // Data Compression // LLM newbie // Hope to bring better open source projects. Welcome to report bugs.

2.3k followers · 117 following

Univ. of Sci. & Tech. of China (USTC)
China
https://gitee.com/wangxuan95
https://www.zhihu.com/people/wang-xuan-12-89/posts

Achievements

Stars

AI&LLM

15 repositories

AlibabaResearch / flash-llm

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Cuda 229 22 Updated Sep 24, 2023

percent4 / llm_evaluation_4_mmlu

Using LLM to evaluate MMLU dataset.

Python 41 3 Updated Mar 8, 2024

leobeeson / llm_benchmarks

A collection of benchmarks and datasets for evaluating LLM.

535 34 Updated Jul 13, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,218 8,580 Updated Nov 12, 2025

October2001 / Awesome-KV-Cache-Compression

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

623 18 Updated Sep 30, 2025

Infrasys-AI / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,878 2,274 Updated Sep 3, 2025

henryzhongsc / longctx_bench

Official implementation for Yuan & Liu & Zhong et al., KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches. EMNLP Findings 2024

Python 87 4 Updated Feb 27, 2025

ZRayZzz / flash-attention-v100

Cuda 63 9 Updated Feb 19, 2024

microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 732 55 Updated Aug 6, 2025

Stability-AI / generative-models

Generative Models by Stability AI

Python 26,721 3,009 Updated Dec 16, 2025

louvinci / PACT-Quantization

Quantization (QAT) Demo on CIFAR10

Python 17 1 Updated Jan 29, 2024

FasterDecoding / SnapKV

Python 296 26 Updated Jul 10, 2025

Zefan-Cai / Awesome-LLM-KV-Cache

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

404 25 Updated Mar 3, 2025

xianshang33 / llm-paper-daily

Daily updated LLM papers. 每日更新 LLM 相关的论文，欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,210 53 Updated Jul 31, 2024

GATECH-EIC / ShiftAddLLM

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Python 111 19 Updated Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dr.W.X WangXuan95

Achievements

Achievements

Block or report WangXuan95

AI&LLM

AlibabaResearch / flash-llm

percent4 / llm_evaluation_4_mmlu

leobeeson / llm_benchmarks

karpathy / nanoGPT

October2001 / Awesome-KV-Cache-Compression

Infrasys-AI / AISystem

henryzhongsc / longctx_bench

ZRayZzz / flash-attention-v100

microsoft / BitBLAS

Stability-AI / generative-models

louvinci / PACT-Quantization

FasterDecoding / SnapKV

Zefan-Cai / Awesome-LLM-KV-Cache

xianshang33 / llm-paper-daily

GATECH-EIC / ShiftAddLLM