Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, Brax and other environments

Python 895 113 Updated Oct 22, 2025

amirhossein-kz / Awesome-Diffusion-Models-in-Medical-Imaging

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

1,976 166 Updated Aug 26, 2025

opendilab / Mastermind

[ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis

Python 11 Updated Apr 27, 2025

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

66,645 7,514 Updated Jun 4, 2025

Notify-ctrl / motalang

臸娥粂陆亩竟

JavaScript 10 2 Updated May 11, 2024

microsoft / WSL

Windows Subsystem for Linux

C++ 30,269 1,508 Updated Nov 5, 2025

kshitijsanghvi / tetris

A deep reinforcement learning agent learning to play the classical game of Tetris.

Python 4 1 Updated Dec 12, 2020

opendilab / GenerativeRL

Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).

Python 158 12 Updated Feb 18, 2025

frinkleko / AutoHajimiMosaic

一款自动为你的色图进行哈基米马赛克处理的打码器😎再也不用担心家里请不到高人了|自动哈基米打码器

Python 277 15 Updated Jun 18, 2025

datawhalechina / distil-rl-introduction

An reconstruction of RL Introduction and its course materials for a more efficient entry

16 3 Updated May 23, 2025

Rotwall72 / Threatmixer

A webpage centered around Rain World and its many threat themes.

JavaScript 16 6 Updated Nov 2, 2025

AgileRL / AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary hyperparameter optimization.

Python 840 66 Updated Nov 4, 2025

CleanDiffuserTeam / CleanDiffuser

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 654 63 Updated Apr 20, 2025

uoe-agents / epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 646 169 Updated Sep 24, 2024

computerhistory / AlexNet-Source-Code

This package contains the original 2012 AlexNet code.

Cuda 2,762 356 Updated Mar 12, 2025

EdanToledo / Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 366 42 Updated Oct 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0Pinky0

Block or report 0Pinky0

Stars

BUAA-TrustworthyMARL / adv_marl_benchmark

openpsi-project / srl

google-deepmind / disco_rl

svc-develop-team / so-vits-svc

TsinghuaC3I / Awesome-RL-for-LRMs

google / evojax

clashverge-dev / clash

Reytuag / transformerXL_PPO_JAX

facebookresearch / ConvNeXt

ziwenhahaha / Code-of-RL-Beginning

HW-whistleblower / True-Story-of-Pangu

hzwer / WritingAIPaper

kexinoh / kaiwu_obs_auto

patrick-kidger / jaxtyping

Toni-SM / skrl