Skip to content
View ChiRuiChen's full-sized avatar
  • National Yang Ming Chiao Tung University
  • Taiwan

Block or report ChiRuiChen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.

Python 138 14 Updated Apr 28, 2022

QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

C++ 165 17 Updated Nov 11, 2025
Python 19 1 Updated Mar 21, 2023

BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing

Scala 149 32 Updated Dec 25, 2019

Caffe implementation of accurate low-precision neural networks

C++ 119 34 Updated Oct 25, 2018

LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks

Python 246 69 Updated Aug 30, 2022

From Pytorch model to C++ for Vitis HLS

C++ 20 5 Updated Feb 2, 2026

This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"

Python 20 5 Updated Aug 30, 2021

Simple PYNQ KV260 tutorial: Porting C-based design into FPGA via Xilinx HLS

Jupyter Notebook 9 3 Updated Oct 31, 2023

PYNQ-Torch: a framework to develop PyTorch accelerators on the PYNQ platform

VHDL 76 7 Updated Jul 2, 2020

Simulator for BitFusion

Python 102 26 Updated Aug 6, 2020

AMD University Program HLS tutorial

Jupyter Notebook 123 24 Updated Oct 28, 2024
Python 8 3 Updated May 11, 2024

Quantization of Convolutional Neural networks.

Python 250 60 Updated Aug 5, 2024

Papers and codes about Quantized Networks for easier survey and reference.

19 Updated Dec 3, 2021

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Python 6,298 1,791 Updated Aug 6, 2023

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-…

Python 2,272 478 Updated May 6, 2025

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

Python 682 143 Updated Feb 3, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,935 254 Updated Feb 5, 2026

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 22,744 4,394 Updated Feb 4, 2026

Manually implemented quantization-aware training

Python 23 4 Updated Oct 12, 2022

PYNQ driver for fpgaconvnet

Python 6 1 Updated Oct 12, 2023

Official pytorch Implementation of Relational Knowledge Distillation, CVPR 2019

Python 414 50 Updated May 17, 2021

This repository provides an FPGA-based solution for executing object detection, focusing specifically on the popular YOLOv5 model architecture.

Python 50 12 Updated Jan 12, 2026

Fast low-bit matmul kernels in Triton

Python 426 31 Updated Feb 1, 2026

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 749 57 Updated Aug 6, 2025

PyTorch implementation of Towards Efficient Training for Neural Network Quantization

Python 16 2 Updated Jan 16, 2020

BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)

Python 42 9 Updated Jan 12, 2021
Next