dandan-DoubleD

Follow

dandan-DoubleD

Follow

1 follower · 1 following

Popular repositories Loading

OpenRLHF OpenRLHF Public

Forked from OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python
RL_arm_under_sparse_reward RL_arm_under_sparse_reward Public

Forked from PiggyCh/RL_arm_under_sparse_reward

A reinforcement learning project for robotic arm under sparse reward

Python
RLMujoco RLMujoco Public

Forked from Jitu0110/RLMujoco

SAC, PPO, A2C implementation on Mujoco environments : Humanoid-v4, Ant-v4, Cheetah-v4 . Includes reward manipulation.

Python
stable-baselines3 stable-baselines3 Public

Forked from DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python
rlcard rlcard Public

Forked from datamllab/rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Python
open_spiel open_spiel Public

Forked from google-deepmind/open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++