V1ammer

🦀

Vlad Gerasov V1ammer

🦀

15 followers · 73 following

team-73

Achievements

Lists (1)

Sort

ferrous-systems

ferrous-systems repos

11 repositories

Starred repositories

10 results for source starred repositories written in Cuda

Clear filter

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 10,831 1,093 Updated Apr 20, 2026

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 2,956 272 Updated Apr 22, 2026

Tony-Tan / CUDA_Freshman

Cuda 2,739 509 Updated Jan 16, 2024

NVIDIA / CUDALibrarySamples

CUDA Library Samples

Cuda 2,384 457 Updated Apr 20, 2026

tspeterkim / flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 1,127 111 Updated Dec 30, 2024

66RING / tiny-flash-attention

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 507 53 Updated Jan 20, 2026

wangzyon / NVIDIA_SGEMM_PRACTICE

Step-by-step optimization of CUDA SGEMM

Cuda 458 59 Updated Mar 30, 2022

xlite-dev / ffpa-attn

FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA.

Cuda 276 16 Updated Apr 30, 2026

Enigmatisms / cuda-pt

Writing a CUDA software ray tracing renderer with Analysis-Driven Optimization from scratch: a python-importable, distributed parallel renderer.

Cuda 37 2 Updated Apr 12, 2026

rishisankar / flashattention2

Flash Attention 2 CUDA implementations

Cuda 13 1 Updated Apr 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vlad Gerasov V1ammer

Achievements

Achievements

Block or report V1ammer

Lists (1)

ferrous-systems

Starred repositories

xlite-dev / LeetCUDA

BBuf / how-to-optim-algorithm-in-cuda

Tony-Tan / CUDA_Freshman

NVIDIA / CUDALibrarySamples

tspeterkim / flash-attention-minimal

66RING / tiny-flash-attention

wangzyon / NVIDIA_SGEMM_PRACTICE

xlite-dev / ffpa-attn

Enigmatisms / cuda-pt

rishisankar / flashattention2

Starred topics

music-server

low-level-design

Terminal

IPFS

Linux

Rust