SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,520 281 Updated Nov 12, 2025

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,502 367 Updated Nov 13, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 2,318 258 Updated Sep 3, 2025

IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,218 184 Updated Mar 27, 2024

neuralmagic / sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,146 157 Updated Jun 2, 2025

dstackai / dstack

dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or on-prem.

Python 1,953 202 Updated Nov 13, 2025

kyegomez / BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,891 168 Updated Oct 27, 2025

Blaizzy / mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python 1,834 204 Updated Nov 11, 2025

fastmachinelearning / hls4ml

Machine learning on FPGAs using HLS

Python 1,687 488 Updated Nov 12, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,663 76 Updated Apr 18, 2025

mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,550 188 Updated Jul 12, 2024

google-research / circuit_training

Python 1,504 239 Updated Jul 10, 2025

siliconcompiler / siliconcompiler

Modular hardware build system

Python 1,104 113 Updated Nov 13, 2025

Previous Next

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jepeake

Achievements

Achievements

Highlights

Block or report jepeake

Stars

wandb / wandb

skypilot-org / skypilot

bitsandbytes-foundation / bitsandbytes

meta-llama / llama-models

mit-han-lab / streaming-llm

XuehaiPan / nvitop

meta-pytorch / gpt-fast

linkedin / Liger-Kernel

apple / ml-depth-pro

pytorch / torchtitan

tlkh / asitop

fla-org / flash-linear-attention

mit-han-lab / llm-awq

google / skywater-pdk

neuralmagic / deepsparse

ridgerchu / matmulfreellm

ml-explore / mlx-lm

intel / neural-compressor