🚫
And it’s the road that leads to nowhere. But all I want to do is go there.
@ossdao-org•AIRDROP-0x648BD98c408E8dCAcf31Ffdb77C7F9dCF57348dB
-
HNA Group
- Guangzhou
Stars
2
stars
written in Cuda
Clear filter
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.