Stars
llama3 implementation one matrix multiplication at a time
Simple to install, powerful command-line based AI agent system for coding.
Demonstrates how to protect your OpenAI API Key using a Cloudflare Worker to serve your ephemeral token and then do client side tool calling
revisiting a known normal transformation in computer graphics
The book "Performance Analysis and Tuning on Modern CPU"
State of the art sorting and segmented sorting, including OneSweep. Implemented in CUDA, D3D12, and Unity style compute shaders. Theoretically portable to all wave/warp/subgroup sizes.
Master programming by recreating your favorite technologies from scratch.
Physically based renderer demo written using Vulkan.
An example of a GPU-powered Debug Draw system in WebGPU.
System headers for interfacing WebGPU from C programs compiled via Emscripten to WebAssembly
Caldera data set from Call of Duty®: Warzone™
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
🎛️ Compact GUI for fine-tuning parameters and monitoring value changes
A lightweight controller and gui library
Modern, flexible computer graphics and rendering engine, written in C++23 with support for Vulkan 🌋 and DirectX 12 ❎.
Excalibur Hash is a high-speed hash map and hash set, ideal for performance-critical uses like video games
GPU Reshape (GRS) is an API & vendor agnostic instrumentation framework, with instruction level validation.
Native WebGPU implementation. Mirror of https://dawn.googlesource.com/dawn. File bugs here: https://crbug.com/dawn/new
A set of barebones utilities for playing with WebGL
The main repository for the FidelityFX SDK.
A FREE comprehensive reverse engineering tutorial covering x86, x64, 32-bit/64-bit ARM, 8-bit AVR and 32-bit RISC-V architectures.
A sample demonstrating hybrid ray tracing and rasterisation for reflections.
A WebGPU Shading Language parser and reflection library for Javascript.
Sample code for compute shader 101 training