Skip to content
View Poet-LiBai's full-sized avatar
🪐
🪐

Block or report Poet-LiBai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

RL

强化学习
341 repositories

Deep Reinforcement Learning: Zero to Hero!

Jupyter Notebook 2,231 101 Updated Oct 27, 2025

🔬 A curated list of awesome LLMs & deep learning strategies & tools in financial market.

4,682 506 Updated Nov 3, 2025

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

Jupyter Notebook 4,452 631 Updated Jun 30, 2020

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,885 8,716 Updated Oct 11, 2024

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 5,811 2,825 Updated Dec 20, 2025
Python 969 111 Updated Jan 23, 2025

The reinforcement learning training code for AgiBot X1.

Python 1,619 501 Updated Oct 23, 2024

Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer https://arxiv.org/abs/2404.05695

Python 1,769 235 Updated Jan 26, 2025

Isaac Gym Environments for Legged Robots

Python 2,566 529 Updated May 29, 2025

The unitree_il_lerobot open-source project is a modification of the LeRobot open-source training framework, enabling the training and testing of data collected using the dual-arm dexterous hands of…

Python 499 67 Updated Dec 11, 2025
Python 1,039 129 Updated Oct 27, 2025
Python 8 2 Updated Oct 15, 2024

[IROS 2025] Learning Smooth Humanoid Locomotion through Lipschitz-Constrained Policies

Python 211 11 Updated Jun 16, 2025

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,932 139 Updated Dec 6, 2024

Code for training locomotion policies with RL

Python 302 34 Updated Sep 30, 2023

🕹️ A diverse suite of scalable reinforcement learning environments in JAX

Python 788 94 Updated Dec 1, 2025

Recipes to train reward model for RLHF.

Python 1,490 107 Updated Apr 24, 2025

An Open Large Reasoning Model for Real-World Solutions

Python 1,537 80 Updated May 30, 2025

Repository for most of the code from my YouTube channel

Python 933 492 Updated Jul 24, 2023

Deep RL for MPC control of Quadruped Robot Locomotion

Python 867 87 Updated Jul 18, 2025

NMPC, WBC, state estimation, and sim2real framework for legged robots based on OCS2 and ros-controls

C++ 1,513 321 Updated Feb 13, 2025

Software tools for agile quadrupeds, developed by the Robomechanics Lab at Carnegie Mellon University.

C++ 893 162 Updated Nov 14, 2025

An open source implementation of MIT Cheetah 3 controllers

C++ 795 150 Updated Dec 5, 2022

Code accompanying the paper "Learning Agile Robotic Locomotion Skills by Imitating Animals"

Python 1,370 314 Updated Mar 24, 2023

Deep reinforcement learning without experience replay, target networks, or batch updates.

Python 272 31 Updated Mar 18, 2025

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Python 2,459 204 Updated Mar 13, 2025

全球最小的桌面级双轮腿机器人!

C++ 2,537 380 Updated Dec 12, 2024

Natural Language Reinforcement Learning

Python 100 7 Updated Jul 30, 2025

Unified Reinforcement Learning Framework

Python 799 79 Updated Sep 6, 2024