Skip to content
View AaronMaYue's full-sized avatar

Block or report AaronMaYue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
13 results for source starred repositories
Clear filter

A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.

Python 180 19 Updated Apr 2, 2025

A powerful toolkit for compressing large models including LLM, VLM, and video generation models.

Python 611 62 Updated Nov 5, 2025

Simple Fast Virtual Machine

C 10 Updated Apr 3, 2024

This is a Chinese translation of the CUDA programming guide

1,733 254 Updated Nov 13, 2024

[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization

Python 94 15 Updated May 5, 2022

Useful CMake Examples

CMake 13,001 2,549 Updated Feb 28, 2024

Repository containing notebooks of my posts on Medium

Jupyter Notebook 2,131 985 Updated Sep 22, 2024
Jupyter Notebook 281 166 Updated Jun 28, 2022

Dockerfile to build Tensorflow-GPU v1.10 with native CUDA driver (e.g. CUDA 8.0/CUDA 9.0/CUDA 9.2/CUDA 10.0)

Shell 19 5 Updated Jan 11, 2025

PyTorch implementation of "SlowFast Networks for Video Recognition".

Python 348 81 Updated Mar 13, 2019

Pretrained Image & Video ConvNets and GANs for PyTorch: NASNet, ResNeXt (2D + 3D), ResNet (2D + 3D), InceptionV4, InceptionResnetV2, Xception, DPN, NonLocalNets, R(2+1)D nets, MultiView CNNs, Tempo…

Python 331 49 Updated Jan 8, 2022

Build your neural network easy and fast, 莫烦Python中文教学

Jupyter Notebook 8,369 3,103 Updated Mar 23, 2023