RL
Massively Parallel Deep Reinforcement Learning. 🔥
A library of reinforcement learning components and agents
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
Minimal and Clean Reinforcement Learning Examples
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
A high-performance distributed training framework for Reinforcement Learning
A curated list of reinforcement learning with human feedback resources (continually updated)
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
A minimalist environment for decision-making in autonomous driving
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
A modular RL library to fine-tune language models to human preferences
Chess reinforcement learning by AlphaGo Zero methods.
An End-To-End, Lightweight and Flexible Platform for Game Research
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Evolutionary Algorithm using Python, 莫烦Python 中文AI教学