hnjylwb

Wenbo Li hnjylwb

A Ph.D. student in Database Group @ THU

56 followers · 78 following

Tsinghua University
Beijing, China
02:21 (UTC +08:00)

Achievements

Highlights

Lists (1)

Sort

database

2 repositories

Stars

2 stars written in Cuda

Clear filter

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,876 289 Updated Dec 11, 2025

thu-ml / SpargeAttn

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 842 71 Updated Dec 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wenbo Li hnjylwb

Achievements

Achievements

Highlights

Block or report hnjylwb

Lists (1)

database

Stars

thu-ml / SageAttention

thu-ml / SpargeAttn