zerlinwang

Follow

😃

Say hello

Zilin Wang zerlinwang

😃

Say hello

Follow

Reinforcement Learning. CS PhD@Oxford

52 followers · 104 following

Oxford
zerlinwang.github.io

Achievements

Achievements

Lists (1)

Sort

MARL

Starred repositories

210 stars written in Python

jik876 / hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,254 539 Updated Jul 27, 2024

nikhilbarhate99 / PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python 2,225 403 Updated Jul 9, 2024

fatchord / WaveRNN

WaveRNN Vocoder + TTS

Python 2,173 697 Updated Jul 2, 2022

google / gin-config

Gin provides a lightweight configuration framework for Python

Python 2,132 119 Updated Sep 22, 2025

oxwhirl / pymarl

Python Multi-Agent Reinforcement Learning framework

Python 2,106 405 Updated Dec 8, 2022

ming024 / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 2,102 604 Updated Oct 27, 2023

google-deepmind / optax

Optax is a gradient processing and optimization library for JAX.

Python 2,076 275 Updated Nov 7, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,059 119 Updated Jun 2, 2025

gotcha / ipdb

Integration of IPython pdb

Python 1,947 151 Updated Jul 28, 2025

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,782 108 Updated Sep 27, 2024

marlbenchmark / on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,750 349 Updated Jul 18, 2024

geek-ai / MAgent

A Platform for Many-Agent Reinforcement Learning

Python 1,748 335 Updated Oct 22, 2022

Thinklab-SJTU / Bench2Drive

[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert

Python 1,690 104 Updated Feb 18, 2025

JunweiLiang / awesome_lists

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

Python 1,611 93 Updated Feb 1, 2024

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,606 68 Updated Jun 5, 2025

Farama-Foundation / D4RL

A collection of reference environments for offline reinforcement learning

Python 1,602 301 Updated Nov 18, 2024

takuseno / d3rlpy

An offline deep reinforcement learning library

Python 1,583 260 Updated Sep 10, 2025

yifan123 / flow_grpo

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,560 85 Updated Nov 4, 2025

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,459 178 Updated Nov 5, 2025

BytedanceSpeech / seed-tts-eval

Python 1,457 133 Updated Jun 14, 2024

Ceruleanacg / Personae

📈 Personae is a repo of implements and environment of Deep Reinforcement Learning & Supervised Learning for Quantitative Trading.

Python 1,396 341 Updated Nov 29, 2018

opendilab / DI-star

An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.

Python 1,309 122 Updated Mar 13, 2025

marl / crepe

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Python 1,304 171 Updated Aug 19, 2024

oxwhirl / smac

SMAC: The StarCraft Multi-Agent Challenge

Python 1,280 238 Updated Feb 18, 2024

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,272 161 Updated Aug 3, 2023

zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

Python 1,265 73 Updated Jun 8, 2025

Replicable-MARL / MARLlib

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Python 1,202 186 Updated Nov 28, 2024

metadriverse / metadrive

MetaDrive: Lightweight driving simulator for everyone

Python 1,037 163 Updated Aug 15, 2025

waymo-research / waymax

A JAX-based simulator for autonomous driving research.

Python 983 120 Updated Oct 23, 2025

danijar / dreamerv2

Mastering Atari with Discrete World Models

Python 969 207 Updated Jan 21, 2023

Starred topics

Awesome Lists