Stars
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
(CoRL 2019) Driving in CARLA using waypoint prediction and two-stage imitation learning
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
一款自动为你的色图进行哈基米马赛克处理的打码器😎再也不用担心家里请不到高人了|自动哈基米打码器
Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication
Code for the paper "Phasic Policy Gradient"
Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch
An easier PyTorch deep reinforcement learning library.
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Clean baseline implementation of PPO using an episodic TransformerXL memory
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and e…
Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).
Adaptive Attention Span for Reinforcement Learning
[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
Challenging Memory-based Deep Reinforcement Learning Agents
2D Simulator for Smart Decision in ICRA 2019 RoboMaster AI Challenge
The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"
Baselines for Neural MMO -- new users should treat this repo as a starter project
A3C-LSTM algorithm tested on CartPole OpenAI Gym environment
Official implementation of the paper "Learning Coverage Paths in Unknown Environments with Deep Reinforcement Learning"
Benchmarks of different devices I have come across
qiwihui / spinningup
Forked from openai/spinningupOpenAI团队的深度强化学习教程中文版
[S&P 2024] Replication Package for "Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets".
An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.