Skip to content
View GeminiLight's full-sized avatar

Highlights

  • Pro

Block or report GeminiLight

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

RL

Reinforcement Learning
45 repositories
Python 33 4 Updated Nov 21, 2022

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

Python 148 27 Updated Jan 12, 2019

Curiosity-driven Exploration by Self-supervised Prediction

Python 147 32 Updated Mar 12, 2023

PyTorch implementation of deep reinforcement learning algorithms

Python 487 57 Updated Nov 19, 2021

Reinforcement Learning Algorithms Based on PyTorch

Python 453 93 Updated Oct 21, 2021

Series of deep reinforcement learning algorithms 🤖

Jupyter Notebook 29 12 Updated Jun 19, 2021
C++ 43 5 Updated Feb 8, 2026

(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Rese…

Python 368 60 Updated Nov 9, 2022

StarCraft II Learning Environment

Python 8,296 1,159 Updated Jul 23, 2024

An offline deep reinforcement learning library

Python 1,664 265 Updated Sep 10, 2025

Multi-Agent Reinforcement Learning (MARL) papers

299 41 Updated Sep 19, 2022

A library of reinforcement learning components and agents

Python 4,003 540 Updated Apr 8, 2026

Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning

Python 17 5 Updated Mar 11, 2020

Learn to Steer through Deep Reinforcement Learning

Python 5 1 Updated Aug 22, 2019

Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019

Python 807 181 Updated May 29, 2022

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Python 3,617 1,023 Updated Apr 24, 2024

Massively Parallel Deep Reinforcement Learning. 🔥

Python 4,340 974 Updated Feb 20, 2026

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Python 3,023 678 Updated May 7, 2026

A pack of reinforcement learning algorithms.

Python 84 13 Updated Oct 26, 2021

This project is implementation code of AlphaStar

Python 207 28 Updated Jan 19, 2024

(AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning

Python 122 22 Updated Feb 3, 2023

This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit

Python 21 3 Updated Sep 10, 2016

Multi-Objective Reinforcement Learning

Python 307 57 Updated Aug 10, 2021
Python 1,053 308 Updated Jan 29, 2023
Python 163 47 Updated May 3, 2019

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

Python 605 92 Updated Oct 28, 2020

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,631 899 Updated Mar 24, 2023
Python 4 Updated Mar 3, 2021

Environments for OR and RL Research

Python 446 99 Updated Oct 12, 2023