LaurenSpiegel

Lauren LaurenSpiegel

77 followers · 45 following

Achievements

Stars

1 result for source starred repositories written in Cuda

Clear filter

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,626 258 Updated Nov 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lauren LaurenSpiegel

Achievements

Achievements

Block or report LaurenSpiegel

Stars

thu-ml / SageAttention