Highlights
- Pro
Stars
KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)
Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
Next-Generation AI-Assisted Kernel Engineering for Multi-Chip Systems
A scalable PyTorch training framework with built-in optimizer, scheduler, and distributed backend support.
AccelOpt: Self-improving Agents for AI Accelerator Kernel Optimization
A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching
SymEngine is a fast symbolic manipulation library, written in C++
Reference Code Implementation of paper "Evolution of Kernels: Automated RISC-V Kernel Optimization with Large Language Models"
Simulator for LLM inference on an abstract 3D AIMC-based accelerator
[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
A Simulation Framework for Memristive Deep Learning Systems
Memory Array Simulation Testbed for Organization, Data, Operations, and Networks
Verilog used to evaluate the FASED dot product hardware unit [IEEE CAL 2026]
🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.
Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
Find shape errors before you run your code!
Artifact of MICRO'25 paper Characterizing and Optimizing Realistic Workloads on a Commercial Compute-in-SRAM Device
An end-to-end Transformer fusion integrating DAG-based pipeline scheduling and whole encoder and decoder fusion.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)