WangXuan95

Dr.W.X WangXuan95

PhD graduated from USTC // FPGA // Verilog // Data Compression // LLM newbie // Hope to bring better open source projects. Welcome to report bugs.

2.3k followers · 117 following

Univ. of Sci. & Tech. of China (USTC)
China
https://orcid.org/0000-0003-3065-4606
https://gitee.com/wangxuan95
https://www.zhihu.com/people/wang-xuan-12-89/posts

Achievements

Lists (9)

Sort

Stars

ModelEngine-Group / community

ModelEngine 项目群的社区管理规范。

42 Updated Nov 17, 2025

microsoft / RetrievalAttention

Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.

Python 106 19 Updated Sep 17, 2025

IlyaGrebnov / libbsc

High performance block-sorting data compression library

C 337 62 Updated Oct 1, 2025

lsleonard / tiny-data-compression

High-speed lossless data compression of 16 to 512 bytes--get better average compression than QuickLZ for 512-byte blocks. td512 maintains good compression down to 16-byte blocks.

C 25 1 Updated Feb 14, 2022

GATECH-EIC / ShiftAddLLM

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Python 111 19 Updated Oct 15, 2024

xianshang33 / llm-paper-daily

Daily updated LLM papers. 每日更新 LLM 相关的论文，欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,210 53 Updated Jul 31, 2024

Zefan-Cai / Awesome-LLM-KV-Cache

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

404 25 Updated Mar 3, 2025

FasterDecoding / SnapKV

Python 296 26 Updated Jul 10, 2025

CMU-SAFARI / MQSim

MQSim is a fast & accurate simulator for modern multi-queue (MQ) and SATA SSDs. MQSim faithfully models new high-bandwidth protocol implementations, steady-state SSD conditions, and full end-to-end…

C++ 341 174 Updated Aug 25, 2025

spypaul / MQSim_CXL

C++ 75 13 Updated May 30, 2023

arpitp / ssd-controller

Open Source SSD Controller. NVMe and Lightstor variants

Bluespec 16 22 Updated May 21, 2014

Italink / ModernGraphicsEngineGuide

现代图形引擎入门指南

C++ 452 52 Updated Dec 16, 2025

AlibabaResearch / flash-llm

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Cuda 229 22 Updated Sep 24, 2023

iliasam / 10base_t_software_PHY

C 13 8 Updated Nov 7, 2024

AngeloJacobo / UberDDR3

Opensource DDR3 Controller

Verilog 401 57 Updated Jun 14, 2025

percent4 / llm_evaluation_4_mmlu

Using LLM to evaluate MMLU dataset.

Python 41 3 Updated Mar 8, 2024

leobeeson / llm_benchmarks

A collection of benchmarks and datasets for evaluating LLM.

535 34 Updated Jul 13, 2024

chocolate42 / qoi-simd

qoi and qoi-like implementations optionally using simd

C 11 1 Updated Nov 28, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,209 8,576 Updated Nov 12, 2025

October2001 / Awesome-KV-Cache-Compression

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

623 18 Updated Sep 30, 2025

Infrasys-AI / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,874 2,273 Updated Sep 3, 2025

henryzhongsc / longctx_bench

Official implementation for Yuan & Liu & Zhong et al., KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches. EMNLP Findings 2024

Python 87 4 Updated Feb 27, 2025