Skip to content

Pull requests: modelscope/ms-swift

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Improve vLLM examples regarding vllm_engine_kwargs use
#7133 opened Dec 19, 2025 by 3manifold Loading…
1 task done
[bugfix] fix response_prefix
#7126 opened Dec 19, 2025 by Jintao-Huang Loading…
[megatron] support megatron fsdp
#7117 opened Dec 18, 2025 by Jintao-Huang Loading…
[template] refactor thinking template
#7096 opened Dec 17, 2025 by Jintao-Huang Loading…
[template] support mimo-v2 template
#7095 opened Dec 17, 2025 by Jintao-Huang Loading…
[feat] support TiledMLP in Deepspeed and FSDP2
#7090 opened Dec 17, 2025 by kevssim Loading…
2 of 4 tasks
[bugfix] fix missing generate method for InternVL-2.5
#7019 opened Dec 12, 2025 by xwy-bit Loading…
1 of 4 tasks
[feat] Add Support Cut-Cross-Entropy (CCE)
#6971 opened Dec 9, 2025 by w1ida Loading…
[feat] support deepspeed elastic
#6955 opened Dec 8, 2025 by meichangsu1 Loading…
2 of 4 tasks
[WIP] [v4] refactor model_type & template
#6944 opened Dec 8, 2025 by Jintao-Huang Loading…
add muon clip optimizer
#6662 opened Nov 19, 2025 by vx120 Loading…
1 task
Add conditional distillation support for GKD trainer
#6542 opened Nov 11, 2025 by woshixiaobai2019 Loading…
3 tasks
[WIP][Exp]Support ray dpo
#6395 opened Nov 1, 2025 by tastelikefeet Loading…
1 of 4 tasks
[megatron] update megatron_args default_val
#6252 opened Oct 22, 2025 by Jintao-Huang Loading…
feat: Enable for exporting unmerged HF Lora Adapter
#6225 opened Oct 20, 2025 by jason9693 Loading…
1 of 4 tasks
[WIP] refactor template
#6085 opened Oct 11, 2025 by Jintao-Huang Loading…
update docs
#5691 opened Sep 6, 2025 by Jintao-Huang Loading…
[model] update minicpmv-4.5 video processor stale
#5679 opened Sep 5, 2025 by hjh0119 Loading…
Bug fix: eval OOM due to deepcopy of torch model stale
#5607 opened Aug 29, 2025 by hellopahe Loading…
1 task done
[init]support gptq grpo in colocate mode stale
#5569 opened Aug 27, 2025 by ItGirls Loading…
1 of 4 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.