Poet-LiBai

Follow

🪐

Poet-LiBai

🪐

Follow

24 followers · 359 following

Stars

RL

强化学习

341 repositories

AI4Finance-Foundation / ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

Python 4,256 966 Updated Dec 6, 2025

google-deepmind / acme

A library of reinforcement learning components and agents

Python 3,877 518 Updated Dec 2, 2025

pytorch / ELF

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

C++ 3,413 571 Updated Jun 21, 2019

rlcode / reinforcement-learning

Minimal and Clean Reinforcement Learning Examples

Python 3,610 741 Updated Mar 24, 2023

google-research / football

Check out the new game server:

Python 3,531 1,342 Updated Jun 17, 2025

junxiaosong / AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Python 3,561 1,022 Updated Apr 24, 2024

PaddlePaddle / PARL

A high-performance distributed training framework for Reinforcement Learning

Python 3,433 822 Updated Sep 13, 2025

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

4,245 250 Updated Dec 9, 2025

seungeunrho / minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Python 3,121 485 Updated Apr 22, 2023

datamllab / rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Python 3,335 714 Updated Jun 26, 2024

tensorflow / agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Python 2,981 745 Updated Dec 7, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 13,897 1,303 Updated Oct 28, 2025

Farama-Foundation / HighwayEnv

A minimalist environment for decision-making in autonomous driving

Python 3,116 840 Updated Oct 18, 2025

Farama-Foundation / PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Python 3,240 465 Updated Nov 25, 2025

werner-duvaud / muzero-general

MuZero

Python 2,743 666 Updated Sep 3, 2024

google-deepmind / mctx

Monte Carlo tree search in JAX

Python 2,573 209 Updated Sep 2, 2025

tirthajyoti / Papers-Literature-ML-DL-RL-AI

Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning

2,773 786 Updated Feb 18, 2023

allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences

Python 2,376 203 Updated Mar 1, 2024

Zeta36 / chess-alpha-zero

Chess reinforcement learning by AlphaGo Zero methods.

Jupyter Notebook 2,201 479 Updated Mar 24, 2023

facebookresearch / ELF

An End-To-End, Lightweight and Flexible Platform for Game Research

C++ 2,094 285 Updated Aug 30, 2021

pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,219 425 Updated Dec 18, 2025

DLR-RM / rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 2,671 580 Updated Dec 15, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,635 838 Updated Dec 18, 2025

facebookresearch / habitat-lab

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Python 2,749 614 Updated Oct 12, 2025

Curt-Park / rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Jupyter Notebook 2,003 349 Updated Sep 26, 2025

Yvictor / TradingGym

Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.

Python 1,776 357 Updated Feb 11, 2024

tigerneil / awesome-deep-rl

For deep RL and the future of AI.

HTML 1,498 221 Updated Mar 1, 2024

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,568 128 Updated Nov 24, 2025

kengz / SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Python 1,324 280 Updated Dec 21, 2025

MorvanZhou / Evolutionary-Algorithm

Evolutionary Algorithm using Python, 莫烦Python 中文AI教学

Python 1,231 629 Updated Nov 26, 2023