Lists (21)
Sort Name ascending (A-Z)
Agent
Algorithms
Blog
Book
C/C++
CG
ChatGPT
CUDA
FullStack
Interview
Java
Javascript
Leetcode
LLM
ML/AI
Networking/C
Python
Rust
SD
Skills
System Design
Starred repositories
Learn and Master Node.js. V8, libuv, event loop, buffers, streams, async internals, performance, security, deployment, and production architecture.
A contact solver for physics-based simulations involving 👚 shells, 🪵 solids and 🪢 rods.
Lightweight 3d physics engine for javascript
DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm
Local-first code intelligence graph for MCP and CLI. Builds a persistent map of your codebase so AI coding tools read only what matters, with benchmarked context reductions on reviews and large-rep…
Become a cracked AI/ML Research Engineer
Website for Dynamic Programming TextBook
🌱Light and powerful C++ web framework for highly scalable and resource-efficient web application. It's zero-dependency and easy-portable.
Optimizing diffusion for production-ready speeds
Flash Attention in ~100 lines of CUDA (forward pass only)
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
Machine Learning Engineering Open Book
Dive into CPython internals, trying to illustrate every detail of CPython implementation
Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.
CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. The authors introduce each area of CUDA development through w…
Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
mini cli search engine for your docs, knowledge bases, meeting notes, whatever. Tracking current sota approaches while being all local
GLake: optimizing GPU memory management and IO transmission.
This repo contains my solutions to “Introduction to Machine Learning Interviews” by Chip Huyen.
Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1
A hardware-aware guide to data structures for system software engineers.
Native and Compact Structured Latents for 3D Generation
Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…