Skip to content
View OrangeSodahub's full-sized avatar
🎣
Fishing
🎣
Fishing

Block or report OrangeSodahub

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

RL

13 repositories

Scalable toolkit for efficient model reinforcement

Python 1,145 198 Updated Dec 19, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,678 76 Updated May 11, 2025

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,126 62 Updated Oct 13, 2025

VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning

Python 107 1 Updated Oct 6, 2025

[CVPR 2025] ReconDreamer

Python 196 17 Updated Dec 9, 2024

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,347 66 Updated Oct 16, 2025

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

920 26 Updated Nov 14, 2025

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,757 167 Updated Dec 19, 2025

Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models

Python 83 4 Updated Dec 9, 2025
Python 17 Updated Oct 31, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,629 2,853 Updated Dec 19, 2025
Python 26 Updated Dec 9, 2025