Skip to content

Pull requests: OpenRLHF/OpenRLHF

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix overlong penalty action token length
#1246 opened May 30, 2026 by Jiang020609 Loading…
Add TokenSpeed-backed PPO rollout engine
#1237 opened May 7, 2026 by 4teven Loading…
Add Trackio logger backend
#1230 opened Apr 27, 2026 by abidlabs Loading…
Add SFT tools field support for chat templates
#1228 opened Apr 26, 2026 by taivu1998 Loading…
Replace Deepspeed backend with Automodel
#1226 opened Apr 26, 2026 by hijkzzz Collaborator Loading…
feat: full async PPO training with partial rollout agent support
#1218 opened Apr 11, 2026 by LYMDLUT Collaborator Loading…
fix: true loss aggregation across dp ranks
#1216 opened Apr 10, 2026 by alek6kun Loading…
Fast Evolutionary Algorithm Support
#1214 opened Apr 5, 2026 by DavidKoplow Loading…
feat: add --from_scratch option to initialize model with random weights
#1209 opened Apr 1, 2026 by konghw-git Contributor Loading…
2 tasks done
adding CFPO to OpenRLHF
#1184 opened Feb 9, 2026 by asparius Loading…
feat: Switch vLLM rollout sampling to oversampling.
#1179 opened Jan 20, 2026 by Freder-chen Contributor Loading…
Default overlap_comm on for ZeRO-2+ RLHF runs
#1154 opened Nov 26, 2025 by MagellaX Loading…
CLI support for top_k
#1104 opened Aug 13, 2025 by JoNeedsSleep Loading…
ProTip! Updated in the last three days: updated:>2026-06-08.