Skip to content

Pull requests: intel/auto-round

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add inc integration unit test
#1820 opened May 15, 2026 by xin3he Contributor Loading…
4 tasks
fix few merge errors
#1817 opened May 14, 2026 by n1ck-guo Contributor Loading…
4 tasks
Fix forwarding of ExtraConfig overrides
#1816 opened May 14, 2026 by dhruvil237 Loading…
2 of 4 tasks
0.13.0
add mimo-audio, Qwen-TTS model backbone quantization
#1810 opened May 13, 2026 by WeiweiZhang1 Contributor Loading…
3 of 4 tasks
0.13.0
add auto_round_rtn cli and remove fast
#1808 opened May 13, 2026 by n1ck-guo Contributor Loading…
4 tasks
0.13.0
Support Exporting Block-Wise FP8 AR Format
#1798 opened May 11, 2026 by Zhenzhong1 Contributor Loading…
support quarot/spinquant rotation before quantization
#1797 opened May 11, 2026 by lkk12014402 Contributor Loading… 0.13.0
Refine device by vibecoding
#1790 opened May 8, 2026 by wenhuach21 Contributor Loading…
4 tasks
Reduce VRAM usage of quantizing VLM models
#1777 opened May 4, 2026 by lvliang-intel Contributor Loading…
1 of 4 tasks
0.13.0
Fix QDQ inference OOM issue.
#1763 opened Apr 29, 2026 by changwangss Loading…
Awq algorithm
#1749 opened Apr 28, 2026 by WeiweiZhang1 Contributor Loading…
3 of 4 tasks
0.13.0
fix qwen3.6 vllm infer bug ready only add when the PR is ready to merge
#1746 opened Apr 27, 2026 by n1ck-guo Contributor Loading…
4 tasks
0.13.0
Fix rotation
#1724 opened Apr 23, 2026 by wenhuach21 Contributor Loading…
2 of 9 tasks
feat: support Nemotron-H / Nemotron-Cascade-2 (#1711)
#1712 opened Apr 20, 2026 by michael-rabe Loading…
4 of 9 tasks
Continuously optimize AutoScheme RAM consumption
#1703 opened Apr 17, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
Fix Qwen Omni quantization model issue for long form audio generation
#1698 opened Apr 17, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
Feats: Quantize/save/evaluate the Wan-AI/WAN2.2 models in w4a16 format
#1678 opened Apr 14, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
adjust gguf tuning algorithm
#1649 opened Apr 2, 2026 by wenhuach21 Contributor Loading…
2 of 9 tasks
0.13.0
ProTip! Type g i on any issue or pull request to go back to the issue listing page.