Stars
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
ChampSim is an open-source trace based simulator maintained at Texas A&M University and through the support of the computer architecture community.
Verilog evaluation benchmark for large language model
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
A scheduler for spatial DNN accelerators that generate high-performance schedules in one shot using mixed integer programming (MIP)
A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the…
Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples
Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.
A reference implementation of the Mind Mappings Framework.
Implementation of the paper: "High-dimensional Bayesian optimization using low-dimensional feature spaces".
Exercises for exploring the Fibertree, Timeloop and Accelergy tools
Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop
[FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Designed deformable convolution.
Intermediate Language (IL) for Hardware Accelerator Generators
Black-box Optimizer based on Bayesian Optimization
Algorithm-hardware Co-design for Deformable Convolution
Privacy preserving voluntary Covid-19 self-reporting platform. Share your location history and status, get alerts you are in high risk areas and identify high risk regions
Implementations of few-shot object detection benchmarks
Brevitas: neural network quantization in PyTorch
SystemC/C++ library of commonly-used hardware functions and components for HLS.