Skip to content
View WangXuan95's full-sized avatar

Block or report WangXuan95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ModelEngine 项目群的社区管理规范。

42 Updated Nov 17, 2025

Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.

Python 106 19 Updated Sep 17, 2025

High performance block-sorting data compression library

C 337 62 Updated Oct 1, 2025

High-speed lossless data compression of 16 to 512 bytes--get better average compression than QuickLZ for 512-byte blocks. td512 maintains good compression down to 16-byte blocks.

C 25 1 Updated Feb 14, 2022

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Python 111 19 Updated Oct 15, 2024

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,210 53 Updated Jul 31, 2024

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

404 25 Updated Mar 3, 2025
Python 296 26 Updated Jul 10, 2025

MQSim is a fast & accurate simulator for modern multi-queue (MQ) and SATA SSDs. MQSim faithfully models new high-bandwidth protocol implementations, steady-state SSD conditions, and full end-to-end…

C++ 341 174 Updated Aug 25, 2025
C++ 75 13 Updated May 30, 2023

Open Source SSD Controller. NVMe and Lightstor variants

Bluespec 16 22 Updated May 21, 2014

现代图形引擎入门指南

C++ 452 52 Updated Dec 16, 2025

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Cuda 229 22 Updated Sep 24, 2023

Opensource DDR3 Controller

Verilog 401 57 Updated Jun 14, 2025

Using LLM to evaluate MMLU dataset.

Python 41 3 Updated Mar 8, 2024

A collection of benchmarks and datasets for evaluating LLM.

535 34 Updated Jul 13, 2024

qoi and qoi-like implementations optionally using simd

C 11 1 Updated Nov 28, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,209 8,576 Updated Nov 12, 2025

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

623 18 Updated Sep 30, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,874 2,273 Updated Sep 3, 2025

Official implementation for Yuan & Liu & Zhong et al., KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches. EMNLP Findings 2024

Python 87 4 Updated Feb 27, 2025

Fast LZMA2 Library

C 318 31 Updated Oct 26, 2025

翻译systemverilog assertion部分

8 1 Updated Jul 15, 2024

Insane(ly slow but wicked good) PNG image optimization

Python 3,424 147 Updated Jun 18, 2022

cmix is a lossless data compression program aimed at optimizing compression ratio at the cost of high CPU/memory usage.

C++ 677 53 Updated Dec 7, 2025

Trabalho de Graduação

C++ 18 3 Updated Nov 2, 2014

A random event driven text-based game engine.

TypeScript 256 39 Updated Aug 19, 2024

state-of-the-art lossless audio compression

C++ 60 5 Updated Jul 24, 2025
Next