zerlinwang

😃

Say hello

Zilin Wang zerlinwang

😃

Say hello

Reinforcement Learning. CS PhD@Oxford

52 followers · 104 following

Oxford
zerlinwang.github.io

Achievements

trl Public
Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python 1 Apache License 2.0 Updated Aug 18, 2025
jaxpruner Public

Python 1 Apache License 2.0 Updated Jan 26, 2025
zerlinwang.github.io Public

HTML MIT License Updated Jan 14, 2025
stable-baselines3 Public
Forked from DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python MIT License Updated Dec 29, 2024
zerlinwang Public

Blog

Updated Dec 7, 2024
open_x_embodiment Public
Forked from google-deepmind/open_x_embodiment

Jupyter Notebook Apache License 2.0 Updated Nov 27, 2024
simba Public
Forked from SonyResearch/simba

Jupyter Notebook 1 Apache License 2.0 Updated Oct 15, 2024
synthetic-corpus-vocoder Public

Official repository for the paper "A SYNTHETIC CORPUS GENERATION METHOD FOR NEURAL VOCODER TRAINING"

Python 1 Updated Jun 29, 2024
SenseXAMP Public
Forked from William-Zhanng/SenseXAMP

Python Updated Aug 17, 2023
ModuMorph Public
Forked from MasterXiong/ModuMorph

Code of the paper "Universal Morphology Control via Contextual Modulation" at ICML 2023

Python Updated Jul 3, 2023
DI-engine Public
Forked from opendilab/DI-engine

OpenDILab Decision AI Engine

Python Apache License 2.0 Updated Jul 3, 2023
DI-adventure Public
Forked from opendilab/DI-adventure

Decision Intelligence Adventure for Beginners

Python Apache License 2.0 Updated Jun 16, 2023
OfflineRL-Kit Public
Forked from yihaosun1124/OfflineRL-Kit

An elegant PyTorch offline reinforcement learning library for researchers.

Python MIT License Updated May 22, 2023
rl-papers Public
Forked from datawhalechina/rl-papers

rl-papers

Updated Mar 17, 2023
minRLHF Public
Forked from thomfoster/minRLHF

A (somewhat) minimal library for finetuning language models with PPO on human feedback.

Python 2 Updated Mar 11, 2023
RL4LMs Public
Forked from allenai/RL4LMs

A modular RL library to fine-tune language models to human preferences

Python 1 Apache License 2.0 Updated Feb 25, 2023
homework_fall2022 Public
Forked from berkeleydeeprlcourse/homework_fall2022

Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)

Jupyter Notebook Updated Feb 2, 2023
DI-engine-docs Public
Forked from opendilab/DI-engine-docs

DI-engine docs (Chinese and English)

Python Apache License 2.0 Updated Dec 27, 2022
rst_test Public

To test the display of ReStructuredText file

Updated Dec 22, 2022
MARLlib Public
Forked from Replicable-MARL/MARLlib

The MARL extension for RLlib. A benchmark for research and industry.

Python MIT License Updated Dec 10, 2022
drqv2 Public
Forked from facebookresearch/drqv2

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Python MIT License Updated Dec 5, 2022
DA-in-visualRL Public
Forked from Guozheng-Ma/DA-in-visualRL

Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).

Updated Nov 29, 2022
off-policy Public
Forked from marlbenchmark/off-policy

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Python MIT License Updated Nov 29, 2022
on-policy Public
Forked from marlbenchmark/on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python MIT License Updated Nov 22, 2022
cleanrl Public
Forked from vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python Other Updated Nov 19, 2022
s2client-proto Public
Forked from Blizzard/s2client-proto

StarCraft II Client - protocol definitions used to communicate with StarCraft II.

Python MIT License Updated Nov 16, 2022
Mask-based-Latent-Reconstruction Public
Forked from microsoft/Mask-based-Latent-Reconstruction

This repo is the official implementation of "Mask-based Latent Reconstruction for Reinforcement Learning" (NeurIPS 2022).

Python MIT License Updated Nov 1, 2022
adventure Public

Python Apache License 2.0 Updated Oct 25, 2022
pymarl2 Public
Forked from hijkzzz/pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python Apache License 2.0 Updated Oct 22, 2022
MoTIF Public
Forked from aburns4/MoTIF

Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments

Jupyter Notebook Updated Oct 20, 2022

Zilin Wang zerlinwang

Achievements

Achievements

trl Public

Uh oh!

jaxpruner Public

Uh oh!

zerlinwang.github.io Public

Uh oh!

stable-baselines3 Public

Uh oh!

zerlinwang Public

Uh oh!

open_x_embodiment Public

Uh oh!

simba Public

Uh oh!

synthetic-corpus-vocoder Public

Uh oh!

SenseXAMP Public

Uh oh!

ModuMorph Public

Uh oh!

DI-engine Public

Uh oh!

DI-adventure Public

Uh oh!

OfflineRL-Kit Public

Uh oh!

rl-papers Public

Uh oh!

minRLHF Public

Uh oh!

RL4LMs Public

Uh oh!

homework_fall2022 Public

Uh oh!

DI-engine-docs Public

Uh oh!

rst_test Public

Uh oh!

MARLlib Public

Uh oh!

drqv2 Public

Uh oh!

DA-in-visualRL Public

Uh oh!

off-policy Public

Uh oh!

on-policy Public

Uh oh!

cleanrl Public

Uh oh!

s2client-proto Public

Uh oh!

Mask-based-Latent-Reconstruction Public

Uh oh!

adventure Public

Uh oh!

pymarl2 Public

Uh oh!

MoTIF Public

Uh oh!