-
UC Berkeley
- https://people.eecs.berkeley.edu/~ysshao/
Stars
Configurable low-precision floating-point and microscaling hardware in Chisel
ucb-bar / XPU-RT
Forked from minhn02/RobotScheduleGenerating Schedules for Robotic Workloads
This repo contains the skeleton scripts for running a full RTL2GDS flow using Cadence tools, as demonstrated in the Full RTL2GDS Demo prepared and delivered by Prof. Adam (Adi) Teman.
CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…
Allo Accelerator Design and Programming Framework (PLDI'24)
A hardware–software co-design framework for developing and characterizing extended reality (XR) workloads on embedded systems-on-chip (SoCs).
EE194 Lab 0: Chisel Crash Course
Lab manual EECS151 Tapeout Decal. Public view - release to main branch only once ready.
A machine learning accelerator core designed for energy-efficient AI at the edge.
ucb-bar / Accelerated-TinyMPC
Forked from TinyMPC/TinyMPCModel-predictive control for microcontrollers (fork mapping tinyMPC to gemmini or other HW accelerators)
A Heterogeneous GPU Platform for AI and Neural Graphics
A submodule of Chipyard https://github.com/ucb-bar/chipyard
Cluster-level matrix unit integration into GPUs, implemented in Chipyard SoC
Chisel RISC-V Vector 1.0 Implementation
Tool for converting PyTorch models into raw C codes with minimal dependency and some performance optimizations.