Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, Brax and other environments

Python 897 113 Updated Oct 22, 2025

amirhossein-kz / Awesome-Diffusion-Models-in-Medical-Imaging

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

1,977 167 Updated Aug 26, 2025

opendilab / Mastermind

[ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis

Python 11 Updated Apr 27, 2025

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

66,700 7,520 Updated Jun 4, 2025

Notify-ctrl / motalang

臸娥粂陆亩竟

JavaScript 10 2 Updated May 11, 2024

microsoft / WSL

Windows Subsystem for Linux

C++ 30,276 1,509 Updated Nov 6, 2025

kshitijsanghvi / tetris

A deep reinforcement learning agent learning to play the classical game of Tetris.

Python 4 1 Updated Dec 12, 2020

opendilab / GenerativeRL

Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).

Python 159 12 Updated Feb 18, 2025

frinkleko / AutoHajimiMosaic

一款自动为你的色图进行哈基米马赛克处理的打码器😎再也不用担心家里请不到高人了|自动哈基米打码器

Python 278 15 Updated Jun 18, 2025

datawhalechina / distil-rl-introduction

An reconstruction of RL Introduction and its course materials for a more efficient entry

16 3 Updated May 23, 2025

Rotwall72 / Threatmixer

A webpage centered around Rain World and its many threat themes.

JavaScript 16 6 Updated Nov 2, 2025

AgileRL / AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary hyperparameter optimization.

Python 841 67 Updated Nov 4, 2025

CleanDiffuserTeam / CleanDiffuser

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 654 63 Updated Apr 20, 2025

uoe-agents / epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 647 169 Updated Sep 24, 2024

computerhistory / AlexNet-Source-Code

This package contains the original 2012 AlexNet code.

Cuda 2,765 358 Updated Mar 12, 2025

EdanToledo / Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 368 42 Updated Oct 29, 2025

Mai-with-u / MaiBot

麦麦bot，一款专注于群组聊天的赛博网友（比较专注）多平台智能体

Python 3,532 382 Updated Nov 6, 2025

Farama-Foundation / Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 10,563 1,177 Updated Nov 6, 2025

StarCycle / Awesome-Embodied-AI-Job

Lumina Robotics Talent Call | Lumina社区具身智能招贤榜 | A list for Embodied AI / Robotics Jobs (PhD, RA, intern, full-time, etc

1,042 22 Updated Oct 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0Pinky0

Block or report 0Pinky0

Stars

BUAA-TrustworthyMARL / adv_marl_benchmark

openpsi-project / srl

google-deepmind / disco_rl

TsinghuaC3I / Awesome-RL-for-LRMs

clashverge-dev / clash

Reytuag / transformerXL_PPO_JAX

ziwenhahaha / Code-of-RL-Beginning

HW-whistleblower / True-Story-of-Pangu

hzwer / WritingAIPaper

kexinoh / kaiwu_obs_auto

patrick-kidger / jaxtyping

Toni-SM / skrl