Skip to content
View Systemcluster's full-sized avatar
🏛️
Creating something ...
🏛️
Creating something ...

Sponsors

@Silic0nS0ldier

Sponsoring

@daxpedda

Highlights

  • Pro

Block or report Systemcluster

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
12 results for source starred repositories written in Cuda
Clear filter

LLM training in simple, raw C/CUDA

Cuda 28,418 3,333 Updated Jun 26, 2025

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 17,147 2,038 Updated Dec 14, 2025

A massively parallel, optimal functional runtime in Rust

Cuda 11,177 426 Updated Nov 21, 2024

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 1,023 100 Updated Dec 30, 2024

Pytorch Bindings for warp-ctc

Cuda 761 265 Updated Jul 2, 2023

llama3.cuda is a pure C/CUDA implementation for Llama 3 model.

Cuda 349 26 Updated Apr 27, 2025

an implementation of parallel linear BVH (LBVH) on GPU

Cuda 240 32 Updated Jun 8, 2020

High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.

Cuda 123 7 Updated Jul 13, 2024

GPU-Accelerated Lossless Data Compressors Survey

Cuda 121 11 Updated Sep 10, 2020

High-Performance SGEMM on CUDA devices

Cuda 113 5 Updated Jan 21, 2025

A GPU Accelerated Binary Vector Store

Cuda 47 2 Updated Feb 17, 2025
Cuda 4 Updated Apr 8, 2024