btakita

Brian Takita btakita

148 followers · 110 following

Bedford, NH
https://briantakita.me
in/briantakita

Achievements

x3 x2

Achievements

x3 x2

Highlights

Lists (1)

Sort

🔮 Future ideas

1 repository

Stars

1 star written in Cuda

Clear filter

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,628 259 Updated Nov 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Brian Takita btakita

Achievements

Achievements

Highlights

Block or report btakita

Lists (1)

🔮 Future ideas

Stars

thu-ml / SageAttention