Stars
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
AcadHomepage: A Modern and Responsive Academic Personal Homepage
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Allo Accelerator Design and Programming Framework (PLDI'24)
Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.
IC implementation of Systolic Array for TPU
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
synthesiseable ieee 754 floating point library in verilog
This is a simple crawler to grasp pages from search engine.
Course project of wifi communication of the Embedded System course in SUSTech
yHoweM / aanet
Forked from haofeixu/aanetAANet: Adaptive Aggregation Network for Efficient Stereo Matching, CVPR 2020
[CVPR'20] AANet: Adaptive Aggregation Network for Efficient Stereo Matching
yHoweM / simple_net
Forked from LiuXiaolong19920720/simple_netA simple deep neural network implemented in C++,based with OpenCV Mat matrix class
A simple deep neural network implemented in C++,based with OpenCV Mat matrix class
C++ implementation of the Python Numpy library
SGM,立体匹配StereoMatching最经典应用最广泛算法,4000+引用,兼顾效率和效果。完整实现,代码规范,注释清晰,博客教学!
SOS IROS 2018 GOOGLE; StereoNet ECCV2018 GOOGLE; ActiveStereoNet ECCV2018 Oral GOOGLE; HITNET CVPR2021 GOOGLE;PLUME Uber ATG