Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,779 1,076 Updated Dec 22, 2025

yuhuixu1993 / qa-lora

Official PyTorch implementation of QA-LoRA

Python 145 11 Updated Mar 13, 2024

facebookresearch / ParetoQ

This repository contains the training code of ParetoQ introduced in our work "ParetoQ Scaling Laws in Extremely Low-bit LLM Quantization"

Python 116 8 Updated Oct 15, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,327 7,793 Updated Dec 21, 2025

ruikangliu / Quantized-Reasoning-Models

[COLM 2025] Official PyTorch implementation of "Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models"

Python 61 5 Updated Jul 8, 2025

OpenGVLab / EfficientQAT

[ACL 2025 Main] EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 319 25 Updated Nov 26, 2025

zjq0455 / PTQ-Bench

Python 2 1 Updated May 16, 2025

cookiedoth / nestquant

Python 5 1 Updated Jun 6, 2025

chenzx921020 / MoEQuant

Python 11 1 Updated Apr 7, 2025

SamsungLabs / BOA

Python 5 Updated Nov 28, 2025

OptimAI-Lab / RoSTE

[ICML 2025] Official code for the paper "RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models"

Python 6 Updated May 29, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,460 1,913 Updated Jun 3, 2025

ModelCloud / GPTQModel

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 940 141 Updated Dec 22, 2025

Intelligent-Computing-Lab-Panda / GPTAQ

Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)

Python 79 1 Updated Jul 28, 2025

huawei-csl / SINQ

Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model smaller while preserving accuracy.

Python 585 50 Updated Dec 19, 2025

pprp / Awesome-LLM-Quantization

Awesome list for LLM quantization

Python 371 20 Updated Oct 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cccpr brisker

Achievements

Achievements

Block or report brisker

Starred repositories

Thecommonirin / Qresafe

LeapLabTHU / limit-of-RLVR

Xingyu-Zheng / FOEM

jjyaoao / HelloAgents

24mlight / A_Share_investment_Agent

NVlabs / QeRL

OpenBitSys / BitDistiller

ylsung / rsq

jongwooko / distillm-2

CURRENTF / LowRankClone

yuezhouhu / adaspec

ustcwhy / BitVLA

huggingface / trl

IST-DASLab / MoE-Quant

modelscope / ms-swift