jan-www

Jan Wang jan-www

focus on web machine learning

9 followers · 13 following

alibaba-inc
Hangzhou

Achievements

Stars

1 star written in Cuda

Clear filter

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,301 398 Updated Jan 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jan Wang jan-www

Achievements

Achievements

Block or report jan-www

Stars

thu-ml / SageAttention