### Reproduction ✅ use_liger_loss=true + zero2 ✅ use_liger_loss=false + zero3 ❌ use_liger_loss=true + zero3 error:size mismatch ### System Info pytorch=2.6 deepspeed=0.16.4 ### Checklist - [x] I have checked that my issue isn't already filed (see [open issues](https://github.com/huggingface/trl/issues?q=is%3Aissue)) - [x] I have included my system information - [x] Any code provided is minimal, complete, and reproducible ([more on MREs](https://docs.github.com/en/get-started/writing-on-github/working-with-advanced-formatting/creating-and-highlighting-code-blocks)) - [x] Any code provided is properly formatted in code blocks, (no screenshot, [more on code blocks](https://docs.github.com/en/get-started/writing-on-github/working-with-advanced-formatting/creating-and-highlighting-code-blocks)) - [x] Any traceback provided is complete
Reproduction
✅ use_liger_loss=true + zero2
✅ use_liger_loss=false + zero3
❌ use_liger_loss=true + zero3
error:size mismatch
System Info
pytorch=2.6
deepspeed=0.16.4
Checklist