RL
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Massively Parallel Deep Reinforcement Learning. 🔥
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
An elegant PyTorch deep reinforcement learning library.
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
Fine-tune LLM agents with online reinforcement learning
Train transformer language models with reinforcement learning.
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.