charleschetty

charles chetty charleschetty

12 followers · 47 following

Math.SDU
JiNan ShanDong china

Achievements

Stars

4 stars written in Cuda

Clear filter

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,628 259 Updated Nov 6, 2025

Tongkaio / CUDA_Kernel_Samples

CUDA 算子手撕与面试指南

Cuda 673 75 Updated Aug 23, 2025

Eddie-Wang1120 / Professional-CUDA-C-Programming-Code-and-Notes

CUDA C 编程权威指南代码实现包含了书上第二章到第八章的大部分代码实现和作者笔记，全由作者本人手动实现，难免有错误的地方，请大家谨慎参考，非常欢迎对错误的指正。如果有帮助的话请Star一下，对作者帮助很大，谢谢！

Cuda 369 24 Updated Oct 20, 2022

lzyrapx / LeetGPU

Solutions of LeetGPU

Cuda 44 3 Updated Oct 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

charles chetty charleschetty

Achievements

Achievements

Block or report charleschetty

Stars

thu-ml / SageAttention

Tongkaio / CUDA_Kernel_Samples

Eddie-Wang1120 / Professional-CUDA-C-Programming-Code-and-Notes

lzyrapx / LeetGPU