GeminiLight

Tianfu Wang GeminiLight

PhD in AI @ HKUST

102 followers · 21 following

USTC
Guangzhou, China
tianfuwang.tech

Achievements

Highlights

Stars

RL

Reinforcement Learning

45 repositories

wangyuhuix / TRGPPO

Python 33 4 Updated Nov 21, 2022

adik993 / ppo-pytorch

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

Python 148 27 Updated Jan 12, 2019

jcwleo / curiosity-driven-exploration-pytorch

Curiosity-driven Exploration by Self-supervised Prediction

Python 147 32 Updated Mar 12, 2023

MadryLab / implementation-matters

Python 136 18 Updated Jul 25, 2024

dongminlee94 / deep_rl

PyTorch implementation of deep reinforcement learning algorithms

Python 487 57 Updated Nov 19, 2021

StepNeverStop / RLs

Reinforcement Learning Algorithms Based on PyTorch

Python 453 93 Updated Oct 21, 2021

DarylRodrigo / rl_lib

Series of deep reinforcement learning algorithms 🤖

Jupyter Notebook 29 12 Updated Jun 19, 2021

google-research / tf-opt

C++ 43 5 Updated Feb 8, 2026

liuruoze / mini-AlphaStar

(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Rese…

Python 368 60 Updated Nov 9, 2022

google-deepmind / pysc2

StarCraft II Learning Environment

Python 8,296 1,159 Updated Jul 23, 2024

takuseno / d3rlpy

An offline deep reinforcement learning library

Python 1,664 265 Updated Sep 10, 2025

TimeBreaker / Multi-Agent-Reinforcement-Learning-papers

Multi-Agent Reinforcement Learning (MARL) papers

299 41 Updated Sep 19, 2022

google-deepmind / acme

A library of reinforcement learning components and agents

Python 4,003 540 Updated Apr 8, 2026

renweiya / RFQ-RFAC

Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning

Python 17 5 Updated Mar 11, 2020

KerryWu16 / BND-DDQN

Learn to Steer through Deep Reinforcement Learning

Python 5 1 Updated Aug 22, 2019

shariqiqbal2810 / MAAC

Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019

Python 807 181 Updated May 29, 2022

junxiaosong / AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Python 3,617 1,023 Updated Apr 24, 2024

AI4Finance-Foundation / ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

Python 4,340 974 Updated Feb 20, 2026

facebookresearch / habitat-lab

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Python 3,023 678 Updated May 7, 2026

liber145 / rlpack

A pack of reinforcement learning algorithms.

Python 84 13 Updated Oct 26, 2021

kimbring2 / AlphaStar_Implementation

This project is implementation code of AlphaStar

Python 207 28 Updated Jan 19, 2024

atavakol / action-branching-agents

(AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning

Python 122 22 Updated Feb 3, 2023

jvking / reddit-RL-simulator

This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit

Python 21 3 Updated Sep 10, 2016

RunzheYang / MORL

Multi-Objective Reinforcement Learning

Python 307 57 Updated Aug 10, 2021

louisnino / RLcode

Python 1,053 308 Updated Jan 29, 2023

rcheng805 / RL-CBF

Python 163 47 Updated May 3, 2019

MishaLaskin / curl

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

Python 605 92 Updated Oct 28, 2020

sweetice / Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,631 899 Updated Mar 24, 2023

CherryPieSexy / learn_to_move

Python 4 Updated Mar 3, 2021

hubbs5 / or-gym

Environments for OR and RL Research

Python 446 99 Updated Oct 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tianfu Wang GeminiLight

Achievements

Achievements

Highlights

Block or report GeminiLight

RL

wangyuhuix / TRGPPO

adik993 / ppo-pytorch

jcwleo / curiosity-driven-exploration-pytorch

MadryLab / implementation-matters

dongminlee94 / deep_rl

StepNeverStop / RLs

DarylRodrigo / rl_lib

google-research / tf-opt

liuruoze / mini-AlphaStar

google-deepmind / pysc2

takuseno / d3rlpy

TimeBreaker / Multi-Agent-Reinforcement-Learning-papers

google-deepmind / acme

renweiya / RFQ-RFAC

KerryWu16 / BND-DDQN

shariqiqbal2810 / MAAC

junxiaosong / AlphaZero_Gomoku

AI4Finance-Foundation / ElegantRL

facebookresearch / habitat-lab

liber145 / rlpack

kimbring2 / AlphaStar_Implementation

atavakol / action-branching-agents

jvking / reddit-RL-simulator

RunzheYang / MORL

louisnino / RLcode

rcheng805 / RL-CBF

MishaLaskin / curl

sweetice / Deep-reinforcement-learning-with-pytorch

CherryPieSexy / learn_to_move

hubbs5 / or-gym