Skip to content
View geohot's full-sized avatar

Highlights

  • Pro

Block or report geohot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 1,061 173 Updated Mar 24, 2026

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

174,744 17,820 Updated Apr 20, 2026

A kernel library written in tilelang

Python 1,586 138 Updated Apr 23, 2026
C 98 22 Updated Jun 6, 2026

OpenCode plugin that uses your existing Claude Code credentials — no separate login needed.

TypeScript 1,079 138 Updated May 15, 2026

The open source coding agent.

TypeScript 174,042 20,998 Updated Jun 13, 2026

Open-source CUDA, Triton and HIP compiler targeting multiple GPU and CPU architectures.

C 1,697 87 Updated Jun 13, 2026

Documentation for the Mainboard and printable mechanical parts in the Framework Desktop

OpenSCAD 302 20 Updated Nov 30, 2025

A project trying to build a hoverboard controller without semiconductors

20 1 Updated Nov 24, 2025

A machine learning accelerator core designed for energy-efficient AI at the edge.

Emacs Lisp 2,376 295 Updated Jun 12, 2026

The best ChatGPT that $100 can buy.

Python 54,989 7,492 Updated May 5, 2026

Memory Optimizations for Deep Learning (ICML 2023)

Python 122 15 Updated Mar 13, 2024

Exocompilation for productive programming of hardware accelerators

Python 730 54 Updated May 16, 2026

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

Cuda 2,306 219 Updated Jun 13, 2026
Rocq Prover 370 12 Updated Sep 20, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 13,161 1,585 Updated Feb 27, 2026

NanoGPT (124M) in 90 seconds

Python 5,391 809 Updated Jun 13, 2026

Open-source high-performance RISC-V processor

Scala 7,070 909 Updated Jun 13, 2026

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,825 90 Updated Jun 13, 2026

the official Rust and C implementations of the BLAKE3 cryptographic hash function

Assembly 6,276 460 Updated May 21, 2026

Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to large…

Python 357 27 Updated Jul 29, 2024

Entropy Based Sampling and Parallel CoT Decoding

Python 3,435 321 Updated Nov 13, 2024

A free and strong UCI chess engine

C++ 15,816 2,919 Updated Jun 10, 2026

parallelized hyperdimensional tictactoe

Python 127 2 Updated Aug 25, 2024

Nvidia Instruction Set Specification Generator

Python 339 23 Updated Jul 9, 2024
Next