🤕
hungry
-
University of Information Technology
- Ho Chi Minh City, Viet Nam
-
21:16
(UTC +07:00) - https://orcid.org/0009-0007-1788-4155
- nhtuan.2712
- in/htuann2712
Lists (6)
Sort Name ascending (A-Z)
Stars
2
stars
written in Cuda
Clear filter
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.