an0308x

Follow

🎯

Focusing

Akshay an0308x

🎯

Focusing

Follow

Machine Learning

9 followers · 123 following

New York
07:43 (UTC -05:00)

Achievements

Achievements

Stars

5 stars written in Cuda

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,824 1,035 Updated Dec 5, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,989 778 Updated Dec 21, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,325 613 Updated Dec 22, 2025

a-hamdi / GPU

100 days of building GPU kernels!

Cuda 555 61 Updated Apr 27, 2025

simveit / effective_transpose

Effective transpose on Hopper GPU

Cuda 27 3 Updated Sep 6, 2025