Skip to content
View edawson's full-sized avatar

Highlights

  • Pro

Block or report edawson

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
11 results for source starred repositories written in Cuda
Clear filter

LLM training in simple, raw C/CUDA

Cuda 28,081 3,264 Updated Jun 26, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 4,019 558 Updated Nov 6, 2025

Tile primitives for speedy kernels

Cuda 2,868 192 Updated Nov 4, 2025

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 961 97 Updated Dec 30, 2024

Fast, gpu-based CSV parser

Cuda 562 27 Updated Jan 23, 2017

A simple GPU hash table implemented in CUDA using lock free techniques

Cuda 400 44 Updated Feb 7, 2024

SDK for GPU accelerated genome assembly and analysis

Cuda 297 74 Updated May 3, 2024

Instructions, Docker images, and examples for Nsight Compute and Nsight Systems

Cuda 134 22 Updated May 19, 2020

Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.

Cuda 72 9 Updated Nov 4, 2015

LOGAN: High-Performance Multi-GPU X-Drop Long-Read Alignment.

Cuda 29 4 Updated Sep 23, 2022

A project when I was internship at the University of Washington in St. Louis under the guidance of Prof. Buhler.

Cuda 2 1 Updated Aug 18, 2018