Skip to content
View fire's full-sized avatar

Sponsors

@benbot
@aaronfranke
@salty-godzilla
Private Sponsor
@MerlinVR
@RevoluPowered

Organizations

@godotengine @V-Sekai

Block or report fire

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

6 stars written in Cuda
Clear filter

Reference implementation of Megalodon 7B model

Cuda 526 54 Updated May 17, 2025

GPU-accelerated Levenberg-Marquardt curve fitting in CUDA

Cuda 338 102 Updated Mar 12, 2026

GPU-accelerated triangle mesh processing

Cuda 297 41 Updated Apr 15, 2026

Official code release for "Efficient Perspective-Correct 3D Gaussian Splatting Using Hybrid Transparency"

Cuda 140 11 Updated Feb 18, 2026

Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)

Cuda 88 7 Updated Feb 10, 2026

[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.

Cuda 12 5 Updated Jun 22, 2025