Skip to content
View RLAgent's full-sized avatar

Highlights

  • Pro

Block or report RLAgent

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020

Python 45 8 Updated Jul 19, 2023

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

Python 238 37 Updated Apr 16, 2018

Efficient Exploration via State Marginal Matching (2019)

Python 69 12 Updated Jun 30, 2019

Gated Path Planning Networks (ICML 2018)

Python 180 36 Updated Jan 23, 2019

Weakly-Supervised RL for Controllable Behavior (NeurIPS 2020)

Python 8 3 Updated Oct 3, 2020

hanabi_learning_environment is a research platform for Hanabi experiments.

Python 666 162 Updated Feb 14, 2023

A collection of video resources for machine learning

1,555 212 Updated Jan 23, 2021

TensorFlow Reinforcement Learning

Python 3,134 384 Updated Dec 8, 2022

ICML 2018 Self-Imitation Learning

Python 275 43 Updated Apr 18, 2020

Soft Actor-Critic

Python 1,242 249 Updated Nov 29, 2023

Collection of reinforcement learning algorithms

Python 2,895 571 Updated Jun 17, 2024

Implementation of TRPO and related algorithms

Python 649 163 Updated May 20, 2018

A continuous action space version of A3C LSTM in pytorch plus A3G design

Python 259 58 Updated Oct 11, 2024

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,891 843 Updated May 29, 2022

A modern and intuitive terminal-based text editor

Go 28,402 1,301 Updated Apr 12, 2026

DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.

Python 2,167 490 Updated Apr 2, 2023

Publicly releasable baselines for the Retro contest

Python 130 57 Updated Nov 22, 2018

An End-To-End, Lightweight and Flexible Platform for Game Research

C++ 2,092 284 Updated Aug 30, 2021

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C++ 33,952 8,054 Updated Aug 3, 2024

Variational Auto-encoder with Non-parametric Bayesian Prior

Python 44 17 Updated May 18, 2017

code for R-C3D

Jupyter Notebook 257 95 Updated Dec 22, 2019

This repo allocate DAPs code of our ECCV 2016 publication

Python 77 27 Updated Sep 22, 2018

SST: Single-Stream Temporal Action Proposal

Jupyter Notebook 68 25 Updated Jun 4, 2017

Segment-CNN: A Framework for Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs

Jupyter Notebook 234 102 Updated Mar 2, 2019

A starter agent that can solve a number of universe environments.

Python 1,103 313 Updated Apr 7, 2018

A list of resources for all invited talks, tutorials, workshops and presentations at NIPS 2016

223 35 Updated Jan 7, 2017

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Python 39,478 7,309 Updated Nov 27, 2022

Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presente…

Java 4,250 607 Updated Sep 3, 2025

Reinforcement Learning environments based on the 1993 game Doom :godmode:

C++ 2,005 440 Updated Mar 4, 2026

The Arcade Learning Environment (ALE) -- a platform for AI research.

C++ 2,414 464 Updated Apr 6, 2026
Next