Skip to content
View MachineGunLin's full-sized avatar

Block or report MachineGunLin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

9 stars written in Cuda
Clear filter

DeepEP: an efficient expert-parallel communication library

Cuda 9,591 1,217 Updated Apr 29, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,149 955 Updated Apr 24, 2026

how to optimize some algorithm in cuda.

Cuda 2,956 272 Updated Apr 22, 2026

CUDA Library Samples

Cuda 2,384 457 Updated Apr 20, 2026

Sample codes for my CUDA programming book

Cuda 2,042 384 Updated Dec 14, 2025

Introduction to Parallel Programming class code

Cuda 1,349 1,146 Updated Jun 27, 2022

Learn CUDA Programming, published by Packt

Cuda 1,243 260 Updated Dec 30, 2023

cuda编程学习资料

Cuda 37 10 Updated Apr 4, 2020