-
imagequant
Convert 24/32-bit images to 8-bit palette with alpha channel. For lossy PNG compression and high-quality GIF images Dual-licensed like pngquant. See https://pngquant.org for details.
-
ruvector-temporal-tensor
Temporal tensor compression with tiered quantization for RuVector
-
quantette
Fast and high quality image quantization and palette generation
-
kenosis-cli
CLI for ONNX model quantization, casting, inspection, and comparison — powered by kenosis-core
-
float4
MXFP4-compatible 4-bit floating point types and block formats for Rust
-
pngquant
Convert 24/32-bit PNG images to efficient 8-bit format with alpha channel
-
a-sixel
A small sixel + palette selection + dithering library
-
qntz
Vector quantization primitives (RaBitQ, ternary, bit packing) for ANN systems
-
boostr
ML framework built on numr - attention, quantization, model architectures
-
cubek
CubeCL Kernels
-
qjl-sketch
QJL sign-based vector compression and scoring with near-optimal distortion rate
-
dithr
Buffer-first rust dithering and halftoning library
-
ternlang-compress
LLM-to-ternary compression pipeline — quantize float models to {-1,0,+1}, build sparse zero-index, export .tern files for ternlang-ml inference
-
lnmp-quant
Quantization and compression for LNMP embedding vectors with minimal accuracy loss
-
kenosis-core
Pure-Rust ONNX model optimization toolkit — static INT8 quantization, precision casting, and model analysis
-
rabitq-rs
Advanced vector search: RaBitQ quantization with IVF and MSTG (Multi-Scale Tree Graph) index
-
bitnet-core
Core BitNet implementation with fundamental data structures and algorithms
-
bitnet-quant
1.58-bit quantization engine for BitNet neural networks
-
voxtral-micro
Voxtral Micro - Minimal text-to-speech with Q4 GGUF quantization
-
diskann-quantization
DiskANN is a fast approximate nearest neighbor search library for high dimensional data
-
minimum_ml
Experimental Machine Learning Library
-
ohms-adaptq
NOVAQ: Normalized Outlier-Vector Additive Quantization - Democratic 93-100x LLM compression with 99%+ accuracy retention. No restrictions, no gatekeeping.
-
bitpolar
near-optimal vector quantization with zero training overhead — 3-bit precision, provably unbiased inner products (ICLR 2026)
-
microcnn
A minimal CNN framework in Rust with Quantization
-
ruvector-fpga-transformer
FPGA Transformer backend with deterministic latency, quantization-first design, and coherence gating
-
quantize-rs
Neural network quantization toolkit for ONNX models
-
bitnet-quantize
Microsoft BitNet b1.58 quantization and inference for Rust
-
vq
A vector quantization library for Rust
-
embedvec
Fast, lightweight, in-process vector database with HNSW indexing, E8/H4 lattice quantization (up to 24.8x compression), metadata filtering, and PyO3 bindings
-
turbovec
Fast vector quantization with 2-4 bit compression and SIMD search
-
zeta-reticula-server
GPU-accelerated ML inference server with Stripe billing, Hugging Face model caching, and SSE streaming
-
cubecl-quant
CubeCL Quantization Library
-
rust-ai-core
Unified AI engineering toolkit: orchestrates peft-rs, qlora-rs, unsloth-rs, axolotl-rs, bitnet-quantize, trit-vsa, vsa-optim-rs, and tritter-accel
-
cubek-quant
CubeK: Quantization Library
-
torsh-quantization
Model quantization for ToRSh neural networks
-
axonml-quant
Model quantization for the Axonml ML framework
-
oxillama-quant
Quantization kernels for all GGUF quantization types
-
frankensearch-index
FSVI vector index, SIMD dot product, and top-k search for frankensearch
-
ruvector-dither
Deterministic low-discrepancy dithering for low-bit quantization: golden-ratio and π-digit sequences for blue-noise error shaping
-
sevensense-embedding
Embedding bounded context for 7sense bioacoustics - Perch 2.0 ONNX integration
-
amv_decoder
Experimental AMV parser and decoder for KiriKiri2 / KiriKiriZ engine videos
-
imagequant-sys
Convert 24/32-bit images to 8-bit palette with alpha channel. C API/FFI libimagequant that powers pngquant lossy PNG compressor. Dual-licensed like pngquant. See https://pngquant.org for details.
-
turbo-vec
TurboQuant-based vector quantization and search library
-
turbo-quant
TurboQuant, PolarQuant, and QJL — zero-overhead vector quantization for semantic search and KV cache compression (ICLR 2026)
-
kizzasi-tokenizer
Signal quantization and tokenization for Kizzasi AGSP - VQ-VAE, μ-law, continuous embeddings
-
whisperforge-core
GPU-accelerated Whisper model inference with streaming audio, quantization, and KV-cached decoding
-
gollum-compute
Machine Learning and other Compute related support for Gollum
-
oxicuda-quant
GPU-accelerated quantization and model compression engine for OxiCUDA
-
ruvector-rabitq
RaBitQ: rotation-based 1-bit quantization for ultra-fast approximate nearest-neighbor search with theoretical error bounds
-
rotorvec
Vector index using Clifford-rotor block-diagonal quantization (RotorQuant)
-
oxify-vector
In-memory vector search and similarity operations for OxiFY (ported from OxiRS)
-
kwaai-compression
Compression utilities for KwaaiNet - 8-bit quantization, gradient compression
-
cp-compress
DCT-based embedding compression for Canon Protocol (CP-016)
-
aitna
A local LLM inference platform with model pulling, quantization optimization, and high-performance serving
-
pixelization
An image quantization and pixelization library implementing K-Means and PIA (Pixelated Image Abstraction)
-
haagenti-mobile
Mobile deployment support for iOS (CoreML) and Android (NNAPI)
-
haagenti-adaptive
Adaptive precision scheduling for diffusion inference
-
rvf-quant
RuVector Format temperature-tiered vector quantization (f32/f16/u8/binary)
-
rscolorq
Spatial color quantization, a Rust port of
scolorq -
zeta-quantization
Advanced quantization engine for efficient LLM inference
-
polarquant
Walsh-Hadamard rotation + polar coordinate quantization for LLM weight and KV cache compression
-
vector_quantizer
vector quantization utilities and functions
-
numquant
Quantize numbers to a smaller range to save bandwidth or memory data types and back again
-
greenfield
images
-
hanzo-llm
Hanzo AI - Llm Library
-
exoquant
Very high quality image quantization
-
palette_extract
port of Leptonica's modified media cut quantization algorithm
-
haagenti-neural
Neural compression using learned codebooks for 10x model compression
-
numcodecs-linear-quantize
Linear Quantization codec implementation for the numcodecs API
-
whisper-cpp-plus-sys
Low-level FFI bindings for whisper.cpp
-
mnemonist-quant
TurboQuant vector quantization for mnemonist — near-optimal MSE and inner-product quantizers
-
vil_quantized
D13 - Model Quantization Runtime for VIL
-
quantized-pathfinding
Quantization before pathfinding
-
rabitq
vector search algorithm
-
ashares
中国股市A股股票行情实时数据最简封装API接口,包含日线,分时分钟线,以及均线数据,可用来研究,量化分析,证券股票程序化自动化交易系统。目前提供新浪腾讯接口,…
-
ctp-sys
ctp rust binding
-
aprender-quant
K-quantization formats (Q4_K, Q5_K, Q6_K) for GGUF/APR model weights
-
iris-lib
that creates color palettes from images using the median cut algorithm
-
cliris
A cli tool that creates color palettes from images using the median cut algorithm
-
csc411_arith
Quantization of floating-point chroma values for URI CSC 411
-
qshare
量化数据:股票、期货等
-
mcq
port of Java implementation of Median Cut Quantization algorithm
-
ggml
Semi-idiomatic Rust bindings for the ggml library (from
ggml-sys) -
color-theme
A CLI tool to extract a colour 'palette' and theme color from an image
-
entrenar-inspect
SafeTensors model inspection and format conversion
-
image-reducer
Reduce image size by quantization
Try searching with DuckDuckGo.