Skip to content

Renaming of internal variables: async_reward_X to async_X#5616

Merged
qgallouedec merged 8 commits into
mainfrom
reduce-diff-with-grpo
Apr 24, 2026
Merged

Renaming of internal variables: async_reward_X to async_X#5616
qgallouedec merged 8 commits into
mainfrom
reduce-diff-with-grpo

Conversation

@qgallouedec

@qgallouedec qgallouedec commented Apr 21, 2026

Copy link
Copy Markdown
Member

This reduces a bit the number of line diff between grpo and rloo, and is harmless


Note

Low Risk
Low risk rename-only refactor of internal async reward-loop attributes and helper coroutine names, with no logic changes to reward computation.

Overview
Renames RLOOTrainer’s internal async reward-loop fields from async_reward_* to shorter async_* (e.g., _has_async_funcs, async_loop, async_loop_thread, async_loop_ready_event) and updates the async invocation helper used by asyncio.gather accordingly.

No functional changes are introduced; this is a consistency/maintenance refactor around async reward execution and shutdown registration.

Reviewed by Cursor Bugbot for commit caec6bf. Bugbot is set up for automated code reviews on this repo. Configure here.

@cursor cursor Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Fix All in Cursor

❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.

Reviewed by Cursor Bugbot for commit e7ad7c7. Configure here.

Comment thread trl/trainer/rloo_trainer.py Outdated
@HuggingFaceDocBuilderDev

Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@qgallouedec qgallouedec requested review from AmineDiro and kashif April 21, 2026 15:44

@albertvillanova albertvillanova left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks.

@qgallouedec qgallouedec merged commit 43cbd78 into main Apr 24, 2026
13 checks passed
@qgallouedec qgallouedec deleted the reduce-diff-with-grpo branch April 24, 2026 14:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants