-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
-
-
-
stable-baselines3 Public
Forked from DLR-RM/stable-baselines3PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Python MIT License UpdatedDec 29, 2024 -
-
open_x_embodiment Public
Forked from google-deepmind/open_x_embodimentJupyter Notebook Apache License 2.0 UpdatedNov 27, 2024 -
simba Public
Forked from SonyResearch/simba -
synthetic-corpus-vocoder Public
Official repository for the paper "A SYNTHETIC CORPUS GENERATION METHOD FOR NEURAL VOCODER TRAINING"
-
-
ModuMorph Public
Forked from MasterXiong/ModuMorphCode of the paper "Universal Morphology Control via Contextual Modulation" at ICML 2023
Python UpdatedJul 3, 2023 -
DI-engine Public
Forked from opendilab/DI-engineOpenDILab Decision AI Engine
Python Apache License 2.0 UpdatedJul 3, 2023 -
DI-adventure Public
Forked from opendilab/DI-adventureDecision Intelligence Adventure for Beginners
Python Apache License 2.0 UpdatedJun 16, 2023 -
OfflineRL-Kit Public
Forked from yihaosun1124/OfflineRL-KitAn elegant PyTorch offline reinforcement learning library for researchers.
Python MIT License UpdatedMay 22, 2023 -
-
minRLHF Public
Forked from thomfoster/minRLHFA (somewhat) minimal library for finetuning language models with PPO on human feedback.
-
RL4LMs Public
Forked from allenai/RL4LMsA modular RL library to fine-tune language models to human preferences
-
homework_fall2022 Public
Forked from berkeleydeeprlcourse/homework_fall2022Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)
Jupyter Notebook UpdatedFeb 2, 2023 -
DI-engine-docs Public
Forked from opendilab/DI-engine-docsDI-engine docs (Chinese and English)
Python Apache License 2.0 UpdatedDec 27, 2022 -
-
MARLlib Public
Forked from Replicable-MARL/MARLlibThe MARL extension for RLlib. A benchmark for research and industry.
Python MIT License UpdatedDec 10, 2022 -
drqv2 Public
Forked from facebookresearch/drqv2DrQ-v2: Improved Data-Augmented Reinforcement Learning
Python MIT License UpdatedDec 5, 2022 -
DA-in-visualRL Public
Forked from Guozheng-Ma/DA-in-visualRLCollection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).
UpdatedNov 29, 2022 -
off-policy Public
Forked from marlbenchmark/off-policyPyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
Python MIT License UpdatedNov 29, 2022 -
on-policy Public
Forked from marlbenchmark/on-policyThis is the official implementation of Multi-Agent PPO (MAPPO).
Python MIT License UpdatedNov 22, 2022 -
cleanrl Public
Forked from vwxyzjn/cleanrlHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Python Other UpdatedNov 19, 2022 -
s2client-proto Public
Forked from Blizzard/s2client-protoStarCraft II Client - protocol definitions used to communicate with StarCraft II.
Python MIT License UpdatedNov 16, 2022 -
Mask-based-Latent-Reconstruction Public
Forked from microsoft/Mask-based-Latent-ReconstructionThis repo is the official implementation of "Mask-based Latent Reconstruction for Reinforcement Learning" (NeurIPS 2022).
Python MIT License UpdatedNov 1, 2022 -
-
pymarl2 Public
Forked from hijkzzz/pymarl2Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Python Apache License 2.0 UpdatedOct 22, 2022 -
MoTIF Public
Forked from aburns4/MoTIFMobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments
Jupyter Notebook UpdatedOct 20, 2022