Skip to content

feat(gr00t): add n1.6 training documentation and FSDP configs#1079

Open
yangzhongii wants to merge 71 commits into
RLinf:mainfrom
yangzhongii:feature/gr00t_N1.6_sft_ppo
Open

feat(gr00t): add n1.6 training documentation and FSDP configs#1079
yangzhongii wants to merge 71 commits into
RLinf:mainfrom
yangzhongii:feature/gr00t_N1.6_sft_ppo

Conversation

@yangzhongii
Copy link
Copy Markdown

This PR supersedes #975 and fixes the branch naming convention.

Description

Motivation and Context

How has this been tested?

Additional information (optional, e.g., figures and logs):

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Documentation update (Document-only update)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)

Checklist:

  • My code follows the code style of this project.
  • My change requires a change to the documentation.
  • I have updated the documentation accordingly.
  • I have added tests to cover my changes.
  • All new and existing tests passed.

yangzhongii and others added 30 commits March 21, 2026 05:57
Signed-off-by: zhonghaoyang <zhy26399@outlook.com>
Signed-off-by: zhonghaoyang <zhy26399@outlook.com>
Signed-off-by: zhonghaoyang <zhy26399@outlook.com>
Signed-off-by: zhonghaoyang <zhy26399@outlook.com>
@yangzhongii yangzhongii force-pushed the feature/gr00t_N1.6_sft_ppo branch 2 times, most recently from a4a9825 to 93b5cf0 Compare April 30, 2026 17:10
@zoeyuchao
Copy link
Copy Markdown
Collaborator

results SFT, compare with official gr00t repo

Signed-off-by: zhonghaoyang <zhy26399@outlook.com>
Signed-off-by: zhonghaoyang <zhy26399@outlook.com>
Signed-off-by: zhonghaoyang <zhy26399@outlook.com>
Signed-off-by: zhonghaoyang <zhy26399@outlook.com>
@yangzhongii yangzhongii force-pushed the feature/gr00t_N1.6_sft_ppo branch from 48209d7 to 886ccbb Compare May 8, 2026 02:50
Signed-off-by: zhonghaoyang <zhy26399@outlook.com>
@yangzhongii yangzhongii force-pushed the feature/gr00t_N1.6_sft_ppo branch from 87c3d0c to f7f8090 Compare May 8, 2026 07:19
Signed-off-by: Haoyang Zhong <zhy26399@outlook.com>
Signed-off-by: Haoyang Zhong <zhy26399@outlook.com>
Signed-off-by: Haoyang Zhong <zhy26399@outlook.com>
@yangzhongii yangzhongii force-pushed the feature/gr00t_N1.6_sft_ppo branch 2 times, most recently from a238990 to 50ae482 Compare May 11, 2026 05:45
Signed-off-by: Haoyang Zhong <zhy26399@outlook.com>
@yangzhongii yangzhongii force-pushed the feature/gr00t_N1.6_sft_ppo branch from 0f64aa6 to 1a1ff47 Compare May 11, 2026 05:52
@Iron-Wph
Copy link
Copy Markdown
Collaborator

@yangzhongii
Hello !

I ran the SFT e2e test, and the result was normal, as shown in the screenshot below
PixPin_2026-05-14_21-41-24

I ran the PPO e2e test, and the result was normal, as shown in the screenshot below
PixPin_2026-05-14_21-41-55

I ran the gr00t evaluation using the following command with RLinf-Gr00t-N1.6-RL-Spatial-Step50 ckpt ,and the result was normal, as shown in the screenshot below

bash examples/embodiment/eval_embodiment.sh libero_spatial_ppo_gr00t_16
PixPin_2026-05-14_21-42-16

I ran the gr00t evaluation with RLinf-Gr00t-N1.6-RL-Spatial ckpt, and the result was normal, as shown in the screenshot below
PixPin_2026-05-14_21-43-35

I ran the gr00t PPO training using the following command ,and the result was normal, as shown in the screenshot below

bash examples/embodiment/run_embodiment.sh libero_spatial_ppo_gr00t_16
PixPin_2026-05-14_21-43-47

I ran the gr00t SFT training using the following command ,and the result was normal, as shown in the screenshot below

bash examples/sft/run_vla_sft.sh libero_sft_gr00t_16
PixPin_2026-05-14_21-44-39

@Iron-Wph Iron-Wph added review-reproduced This label is for PRs which has been verified and reproduced by a reviewer. and removed Student Review labels May 14, 2026
@zoeyuchao zoeyuchao added Needs Review! run-ci Enable CI tests labels May 15, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Embodied AI Needs Review! new-feature New feature review-reproduced This label is for PRs which has been verified and reproduced by a reviewer. run-ci Enable CI tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants