-
RLinf Public
Forked from RLinf/RLinfRLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
Python Apache License 2.0 UpdatedDec 31, 2025 -
RLinf_backup Public
RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
Python Apache License 2.0 UpdatedDec 22, 2025 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python Apache License 2.0 UpdatedDec 16, 2024 -
so-vits-svc Public
Forked from svc-develop-team/so-vits-svcSoftVC VITS Singing Voice Conversion
Python GNU Affero General Public License v3.0 UpdatedAug 3, 2023 -
-
-
-
-
Muiti-Fidelity-Simulator Public
Forked from darkrush/Multi-Fidelity-SimulatorPython UpdatedApr 21, 2020 -
reinforcement-learning Public
Forked from dennybritz/reinforcement-learningImplementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Jupyter Notebook MIT License UpdatedNov 9, 2019 -
ROS_stage_test Public
Forked from Cathy10162013/ROS_stage_test简单的ROS stage仿真,用的机器人模型是turtlebot,目标把stage独立封装,使得不在ROS下的进程也能使用stage仿真。
C++ UpdatedOct 9, 2018