zerlinwang

Follow

😃

Say hello

Zilin Wang zerlinwang

😃

Say hello

Follow

Reinforcement Learning. CS PhD@Oxford

52 followers · 104 following

Oxford
zerlinwang.github.io

Achievements

Achievements

Lists (1)

Sort

MARL

Starred repositories

Thinklab-SJTU / Bench2Drive

[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert

Python 1,682 104 Updated Feb 18, 2025

MasterXiong / Hyper-VLA

Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"

Python 9 Updated Oct 8, 2025

romkatv / zsh-bin

Statically-linked, hermetic, relocatable Zsh

Shell 365 23 Updated Jul 27, 2023

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 77,940 11,503 Updated Nov 3, 2025

AlexGoldie / learn-rl-algorithms

Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"

Python 23 1 Updated Sep 7, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,106 1,902 Updated Nov 1, 2025

phlippe / uvadlc_notebooks

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Jupyter Notebook 3,006 650 Updated Oct 31, 2025

bsarkar321 / jaxrwkv

Python 7 1 Updated Aug 30, 2025

bsarkar321 / purejaxfsdp

Implementation of Fully Sharded Data Parallelism in Jax

Python 1 Updated Jun 12, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,743 291 Updated Nov 3, 2025

YunyiShen / ARM-FI

Active reward modeling with last layer Fisher Information (ICML'25)

Python 7 Updated Jul 9, 2025

lilucse / SparseNetwork4DRL

[ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Python 39 1 Updated Jun 5, 2025

PRIME-RL / SimpleVLA-RL

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 949 45 Updated Oct 13, 2025

Letian-Wang / asaprl

RSS 2023: This repository contains code for the paper Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors.

Python 103 10 Updated May 10, 2023

Paper2Poster / Paper2Poster

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 2,815 187 Updated Nov 3, 2025

eclipse-sumo / sumo

Eclipse SUMO is an open source, highly portable, microscopic and continuous traffic simulation package designed to handle large networks. It allows for intermodal simulation including pedestrians a…

C++ 3,756 1,644 Updated Nov 5, 2025

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

1,988 111 Updated Nov 5, 2025

xiaomi-research / r1-aqa

🤗 R1-AQA Model: mispeech/r1-aqa

Python 306 26 Updated Mar 28, 2025

Vicky-0256 / DEPfold

Jupyter Notebook 6 1 Updated Apr 4, 2025

NVlabs / catk

Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models. CVPR Oral 2025.

Python 157 14 Updated Apr 4, 2025

Reytuag / transformerXL_PPO_JAX

Python 86 5 Updated Nov 3, 2024

MrYxJ / calculate-flops.pytorch

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 891 36 Updated Jun 27, 2024

clemgris / IGDrivSim

Benchmark for studying the imitation gap when training autonomous driving policies from human demonstrations

Jupyter Notebook 20 Updated Apr 14, 2025

yifan123 / flow_grpo

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,544 83 Updated Nov 4, 2025

BytedanceSpeech / seed-tts-eval

Python 1,455 132 Updated Jun 14, 2024

ylacombe / finetune-hf-vits

Finetune VITS and MMS using HuggingFace's tools

Python 172 65 Updated Mar 31, 2024

SafeRoboticsLab / VBD

VBD: Versatile Behavior Diffusion for Generalized Traffic Agent Simulation

Jupyter Notebook 83 9 Updated Jan 2, 2025

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,606 68 Updated Jun 5, 2025

ming024 / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 2,102 602 Updated Oct 27, 2023

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,258 1,757 Updated Oct 13, 2025

Starred topics

Awesome Lists