📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
-
Updated
Dec 4, 2025 - Cuda
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
CUDA kernel author's tools
🍒 cuRBLAS (Randomized BLAS) is a GPU-accelerated library for accelerating AI and HPC applications.
CUDA Finite Difference Library
CUDA Programming Practices
A CUDA concurrency library designed to simplify concurrency programming, offering C++-style wrappers for selected CUDA Runtime APIs
CUSL: CUDA port of GNU Scientific Library (GSL)
A beginner's guide to CUDA programming
CUDA library for irregular tasks using a dynamic block-internal balancing mechanism
Experiments with CUDA and GPU stuff
Add a description, image, and links to the cuda-library topic page so that developers can more easily learn about it.
To associate your repository with the cuda-library topic, visit your repo's landing page and select "manage topics."