Skip to content
View roumenguha's full-sized avatar
💭
Looking for projects to contribute to!
💭
Looking for projects to contribute to!

Block or report roumenguha

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

8 stars written in Cuda
Clear filter

LLM training in simple, raw C/CUDA

Cuda 28,074 3,264 Updated Jun 26, 2025

Code and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189

Cuda 6,063 614 Updated Aug 2, 2021

Implementation of the KinectFusion approach in modern C++14 and CUDA

Cuda 486 81 Updated May 5, 2021

State of the art sorting and segmented sorting, including OneSweep. Implemented in CUDA, D3D12, and Unity style compute shaders. Theoretically portable to all wave/warp/subgroup sizes.

Cuda 396 21 Updated Dec 14, 2024
Cuda 123 16 Updated Oct 22, 2025

Code base for CUDA Masterclass course

Cuda 40 27 Updated Dec 12, 2020

CUDA C is all you need

Cuda 1 Updated Mar 30, 2025