Skip to content
View sytelus's full-sized avatar

Block or report sytelus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

8 stars written in Cuda
Clear filter

LLM training in simple, raw C/CUDA

Cuda 28,430 3,333 Updated Jun 26, 2025

Code and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189

Cuda 6,057 615 Updated Aug 2, 2021

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,982 778 Updated Dec 8, 2025

GPU database engine

Cuda 1,173 120 Updated Jan 30, 2017

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 1,023 100 Updated Dec 30, 2024

Facebook's CUDA extensions.

Cuda 285 57 Updated Mar 27, 2019

CUDA implementation of the Blocked Floyd Warshall All pairs shortest path graph algorithm

Cuda 42 13 Updated Mar 31, 2018

CUDA implementation of the Floyd-Warshall All pairs shortest path graph algorithm(with path reconstruction)

Cuda 39 15 Updated Sep 18, 2014