OrangeSodahub

🎣

Fishing

xiuyu yang OrangeSodahub

🎣

Fishing

78 followers · 59 following

SJTU

Achievements

x2 x2

Achievements

x2 x2

Stars

RL

13 repositories

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,328 60 Updated Sep 5, 2025

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 1,145 198 Updated Dec 19, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,678 76 Updated May 11, 2025

PRIME-RL / SimpleVLA-RL

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,126 62 Updated Oct 13, 2025

OpenHelix-Team / VLA-RFT

VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning

Python 107 1 Updated Oct 6, 2025

GigaAI-research / ReconDreamer

[CVPR 2025] ReconDreamer

Python 196 17 Updated Dec 9, 2024

XueZeyue / DanceGRPO

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,347 66 Updated Oct 16, 2025

yaotingwangofficial / Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

920 26 Updated Nov 14, 2025

RLinf / RLinf

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,757 167 Updated Dec 19, 2025

WM-PO / WMPO

Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models

Python 83 4 Updated Dec 9, 2025

amap-cvlab / world-env

Python 17 Updated Oct 31, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,629 2,853 Updated Dec 19, 2025

Richard-Zhang-AI / MIND-V

Python 26 Updated Dec 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xiuyu yang OrangeSodahub

Achievements

Achievements

Block or report OrangeSodahub

RL

xhyumiracle / Awesome-AgenticLLM-RL-Papers

NVIDIA-NeMo / RL

BytedTsinghua-SIA / DAPO

PRIME-RL / SimpleVLA-RL

OpenHelix-Team / VLA-RFT

GigaAI-research / ReconDreamer

XueZeyue / DanceGRPO

yaotingwangofficial / Awesome-MCoT

RLinf / RLinf

WM-PO / WMPO

amap-cvlab / world-env

volcengine / verl

Richard-Zhang-AI / MIND-V