zerlinwang

😃

Say hello

Zilin Wang zerlinwang

😃

Say hello

Reinforcement Learning. CS PhD@Oxford

52 followers · 104 following

Oxford
zerlinwang.github.io

Achievements

Lists (1)

Sort

MARL

1 repository

Starred repositories

210 stars written in Python

Clear filter

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 969 81 Updated Sep 9, 2024

PRIME-RL / SimpleVLA-RL

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 957 48 Updated Oct 13, 2025

MrYxJ / calculate-flops.pytorch

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 893 36 Updated Jun 27, 2024

google-deepmind / chex

Python 888 59 Updated Nov 5, 2025

RobertTLange / gymnax

RL Environments in JAX 🌍

Python 826 88 Updated May 30, 2025

Kautenja / gym-super-mario-bros

An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES

Python 808 162 Updated Aug 1, 2023

tencent-ailab / hok_env

Honor of Kings AI Open Environment of Tencent

Python 773 95 Updated Jul 17, 2024

instadeepai / jumanji

🕹️ A diverse suite of scalable reinforcement learning environments in JAX

Python 771 93 Updated Nov 6, 2025

tinyzqh / light_mappo

Lightweight version of MAPPO to help you quickly migrate to your local environment.

Python 752 99 Updated Oct 23, 2025

hijkzzz / pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python 691 132 Updated May 18, 2024

OpenRL-Lab / Wandb_Tutorial

How to use wandb?

Python 682 55 Updated Sep 5, 2023

RobertTLange / evosax

Evolution Strategies in JAX 🦎

Python 680 58 Updated Sep 20, 2025

FLAIROx / JaxMARL

Multi-Agent Reinforcement Learning with JAX

Python 679 127 Updated Nov 5, 2025

google-deepmind / hanabi-learning-environment

hanabi_learning_environment is a research platform for Hanabi experiments.

Python 654 163 Updated Feb 14, 2023

uoe-agents / epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 647 169 Updated Sep 24, 2024

sfujim / BCQ

Author's PyTorch implementation of BCQ for continuous and discrete actions

Python 643 143 Updated Apr 6, 2021

opendilab / DI-drive

Decision Intelligence Platform for Autonomous Driving simulation.

Python 618 58 Updated Mar 13, 2025

google-deepmind / distrax

Python 605 39 Updated Nov 1, 2025

corl-team / CORL

Forked from tinkoff-ai/CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 603 37 Updated Feb 10, 2024