duyihuacn

duyihuacn

Stars

intel / auto-round

A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.

Python 1,484 143 Updated Jun 23, 2026

jonny-d / openai_reproduction

Code to train a mLSTM language model using multiple GPUs

Python 7 4 Updated Mar 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

duyihuacn

Block or report duyihuacn

Stars

intel / auto-round

jonny-d / openai_reproduction