A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,728 401 Updated May 20, 2026

apple / ml-ane-transformers

Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)

Python 2,716 94 Updated Apr 25, 2023

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 104,384 13,753 Updated May 20, 2026

quic / efficient-transformers

This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transformers library) into inference-ready formats that run efficien…

Python 89 87 Updated May 20, 2026

facebookresearch / parq

Piecewise-Affine Regularized Quantization

Python 19 4 Updated Feb 5, 2026

altair199797 / LowFormer

Python 43 2 Updated May 19, 2026

Tianfang-Zhang / CAS-ViT

Official repository of paper titled "CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications"

Python 93 12 Updated Jan 15, 2026

hacksider / Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Python 93,205 13,568 Updated May 20, 2026

wandb / wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 11,075 872 Updated May 20, 2026

microsoft / geta

[CVPR 2025] Official repository for GETA

Python 42 5 Updated Nov 5, 2025

ModelTC / LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 4,070 328 Updated May 20, 2026

ModelTC / LightCompress

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

Python 715 80 Updated May 14, 2026

Hsu1023 / DuQuant

[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.

Python 180 17 Updated Apr 24, 2026

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 6,088 613 Updated May 9, 2026

huawei-noah / Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

Python 4,414 737 Updated Mar 15, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,229 376 Updated Apr 20, 2026

nndeploy / nndeploy

一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework

C++ 1,820 219 Updated Apr 25, 2026

yunjey / pytorch-tutorial

PyTorch Tutorial for Deep Learning Researchers

Python 32,359 8,246 Updated Aug 15, 2023

BBuf / how-to-learn-deep-learning-framework

how to learn PyTorch and OneFlow

496 30 Updated May 20, 2026

zjhellofss / KuiperInfer

校招、秋招、春招、实习好项目！带你从零实现一个高性能的深度学习推理库，支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 3,430 363 Updated Jun 22, 2025

alibaba / TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

Python 878 134 Updated Mar 3, 2026

yoshitomo-matsubara / torchdistill

A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at TPAMI, CVPR, ICLR, ECCV, NeurIPS, ICCV, AAAI, etc…

Python 1,616 145 Updated Mar 31, 2026

faif / python-patterns

A collection of design patterns/idioms in Python

Python 42,753 7,035 Updated Mar 13, 2026

VainF / Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

Python 3,309 382 Updated Sep 7, 2025

cvlab-yonsei / EWGS

An official implementation of "Network Quantization with Element-wise Gradient Scaling" (CVPR 2021) in PyTorch.

Python 96 17 Updated Jul 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dzy 666DZY666

Achievements