Super superigni

🌴

On vacation

Stars

ModelCloud / GPTQModel

LLM model quantization (compression) toolkit with HW acceleration support for Nvidia, AMD, Intel GPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 1,177 187 Updated Jun 15, 2026

microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 766 59 Updated Aug 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Super superigni

Block or report superigni

Stars

ModelCloud / GPTQModel

microsoft / BitBLAS