🐒
PhD student @ CMU Safe AI Lab
-
Carnegie Mellon University
- Pittsburgh, PA
- https://willxxy.github.io/
Lists (2)
Sort Name ascending (A-Z)
Stars
8
stars
written in Cuda
Clear filter
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
CUDA accelerated rasterization of gaussian splatting
Flash Attention in ~100 lines of CUDA (forward pass only)