tumbleintoyourheart

Michael Scofield tumbleintoyourheart

24 followers · 88 following

Achievements

Lists (20)

Sort

Stars

7 stars written in Cuda

Clear filter

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 28,091 3,266 Updated Jun 26, 2025

NVlabs / instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 17,036 2,018 Updated Oct 8, 2025

HigherOrderCO / HVM

A massively parallel, optimal functional runtime in Rust

Cuda 11,149 428 Updated Nov 21, 2024

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,696 973 Updated Nov 6, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,022 558 Updated Nov 6, 2025

Bruce-Lee-LY / cuda_hgemm

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Cuda 490 86 Updated Sep 8, 2024

jundaf2 / INT8-Flash-Attention-FMHA-Quantization

Cuda 158 16 Updated Sep 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Michael Scofield tumbleintoyourheart

Achievements

Achievements

Block or report tumbleintoyourheart

Lists (20)

API

Browsing

Crawler

DB

FE

GenCV

Knowledge

LLM platform

MOOC

OCR

Ops

PDF

Productivity

Project management

Prompt engineering

RAG

Rust

Social media

Speech

VPN

Stars

karpathy / llm.c

NVlabs / instant-ngp

HigherOrderCO / HVM

deepseek-ai / DeepEP

flashinfer-ai / flashinfer

Bruce-Lee-LY / cuda_hgemm

jundaf2 / INT8-Flash-Attention-FMHA-Quantization