Highlights
- Pro
Starred repositories
A pure-Python implementation of the Nvidia CuTe layout algebra intended to be approachable and easy to learn.
bladeRF USB 3.0 Superspeed Software Defined Radio Source Code
Submit stacked diffs to GitHub on the command line
get things from one computer to another, safely
Complete cell formatting support for Google spreadsheets via gspread package.
NVIDIA Math Libraries for the Python Ecosystem
A tool for bandwidth measurements on NVIDIA GPUs.
MIT IAP short course: Matrix Calculus for Machine Learning and Beyond
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
Templight is a Clang-based tool to profile the time and memory consumption of template instantiations and to perform interactive debugging sessions to gain introspection into the template instantia…
Loop Habit Tracker, a mobile app for creating and maintaining long-term positive habits
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Jupyter notebooks and documentation for SageManifolds
Open CS Application | 开源CS申请
Intermediate Representation for Binary analysis and transformation
Wind power visualization with WebGL particles
C++ Insights - See your source code with the eyes of a compiler
Hacky scripts to fixup stack strings in Ghidra's decompiler.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧