Skip to content
View DoubleRedX's full-sized avatar

Block or report DoubleRedX

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

9 results for source starred repositories written in Cuda
Clear filter

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,325 825 Updated Nov 6, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 4,020 558 Updated Nov 6, 2025

CUDA Library Samples

Cuda 2,161 416 Updated Oct 31, 2025

GPU Accelerated t-SNE for CUDA with Python bindings

Cuda 1,892 136 Updated Oct 2, 2024

Causal depthwise conv1d in CUDA, with a PyTorch interface

Cuda 636 133 Updated Oct 20, 2025

Distributed multigrid linear solver library on GPU

Cuda 614 162 Updated Oct 15, 2025

🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.

Cuda 226 10 Updated Aug 8, 2025

Sample code from the book "Professional CUDA C Programming"

Cuda 40 20 Updated May 23, 2023