PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,891 843 Updated May 29, 2022

micro-editor / micro

A modern and intuitive terminal-based text editor

Go 28,402 1,301 Updated Apr 12, 2026

openai / roboschool

DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.

Python 2,167 490 Updated Apr 2, 2023

openai / retro-baselines

Publicly releasable baselines for the Retro contest

Python 130 57 Updated Nov 22, 2018

facebookresearch / ELF

An End-To-End, Lightweight and Flexible Platform for Game Research

C++ 2,092 284 Updated Aug 30, 2021

CMU-Perceptual-Computing-Lab / openpose

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C++ 33,952 8,054 Updated Aug 3, 2024

bobchennan / VAE_NBP

Variational Auto-encoder with Non-parametric Bayesian Prior

Python 44 17 Updated May 18, 2017

VisionLearningGroup / R-C3D

code for R-C3D

Jupyter Notebook 257 95 Updated Dec 22, 2019

escorciav / daps

This repo allocate DAPs code of our ECCV 2016 publication

Python 77 27 Updated Sep 22, 2018

ranjaykrishna / SST

SST: Single-Stream Temporal Action Proposal

Jupyter Notebook 68 25 Updated Jun 4, 2017

zhengshou / scnn

Segment-CNN: A Framework for Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs

Jupyter Notebook 234 102 Updated Mar 2, 2019

openai / universe-starter-agent

A starter agent that can solve a number of universe environments.

Python 1,103 313 Updated Apr 7, 2018

hindupuravinash / nips2016

A list of resources for all invited talks, tutorials, workshops and presentations at NIPS 2016

223 35 Updated Jan 7, 2017

floodsung / Deep-Learning-Papers-Reading-Roadmap

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Python 39,478 7,309 Updated Nov 27, 2022

microsoft / malmo

Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presente…

Java 4,250 607 Updated Sep 3, 2025

Farama-Foundation / ViZDoom

Reinforcement Learning environments based on the 1993 game Doom

C++ 2,005 440 Updated Mar 4, 2026

Farama-Foundation / Arcade-Learning-Environment

The Arcade Learning Environment (ALE) -- a platform for AI research.

C++ 2,414 464 Updated Apr 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lisa Lee RLAgent

Achievements

Achievements

Highlights

Block or report RLAgent

Stars

twni2016 / f-IRL

devendrachaplot / DeepRL-Grounding

RLAgent / state-marginal-matching

RLAgent / gated-path-planning-networks

google-research / weakly_supervised_control

google-deepmind / hanabi-learning-environment

dustinvtran / ml-videos

google-deepmind / trfl

junhyukoh / self-imitation-learning

haarnoja / sac

rail-berkeley / rlkit

joschu / modular_rl

dgriff777 / a3c_continuous

ikostrikov / pytorch-a2c-ppo-acktr-gail