zerlinwang

😃

Say hello

Zilin Wang zerlinwang

😃

Say hello

Reinforcement Learning. CS PhD@Oxford

52 followers · 104 following

Oxford
zerlinwang.github.io

Achievements

Lists (1)

Sort

MARL

1 repository

Starred repositories

35 stars written in Jupyter Notebook

Clear filter

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 78,026 11,522 Updated Nov 6, 2025

aymericdamien / TensorFlow-Examples

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

Jupyter Notebook 43,754 14,843 Updated Jul 26, 2024

google-research / google-research

Google Research

Jupyter Notebook 36,671 8,232 Updated Oct 30, 2025

dennybritz / reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,684 6,165 Updated Jul 13, 2023

NLP-LOVE / ML-NLP

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现，也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 17,279 4,643 Updated Jun 21, 2022

AI4Finance-Foundation / FinRL

FinRL®: Financial Reinforcement Learning. 🔥

Jupyter Notebook 13,066 3,004 Updated Oct 13, 2025

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 12,928 2,148 Updated Sep 6, 2025

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,666 1,711 Updated Apr 26, 2025

google / dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,817 1,390 Updated Nov 4, 2024

cloneofsimo / lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,466 493 Updated Mar 22, 2024

google / flax

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,897 753 Updated Nov 6, 2025

lixin4ever / Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Jupyter Notebook 4,655 312 Updated Sep 23, 2025

boyu-ai / Hands-on-RL

https://hrl.boyuai.com/

Jupyter Notebook 4,136 759 Updated Nov 22, 2022

phlippe / uvadlc_notebooks

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Jupyter Notebook 3,007 650 Updated Oct 31, 2025

google / brax

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 2,915 316 Updated Oct 30, 2025

TradeMaster-NTU / TradeMaster

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈

Jupyter Notebook 2,193 428 Updated Jun 4, 2025

Curt-Park / rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Jupyter Notebook 1,993 349 Updated Sep 26, 2025

datawhalechina / statistical-learning-method-solutions-manual

机器学习方法习题解答，在线阅读地址：https://datawhalechina.github.io/statistical-learning-method-solutions-manual

Jupyter Notebook 1,960 245 Updated Sep 9, 2025

datawhalechina / team-learning-data-mining

主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。

Jupyter Notebook 1,780 821 Updated Mar 16, 2022

google-deepmind / open_x_embodiment

Jupyter Notebook 1,486 98 Updated Nov 5, 2025

MrSyee / pg-is-all-you-need

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

Jupyter Notebook 968 127 Updated May 30, 2025

ikostrikov / jaxrl

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Jupyter Notebook 727 72 Updated Oct 26, 2022

Emerge-Lab / gpudrive

1 million FPS multi-agent driving simulator

Jupyter Notebook 539 71 Updated Oct 2, 2025

denisyarats / drq

DrQ: Data regularized Q

Jupyter Notebook 417 54 Updated Jan 13, 2023

r9y9 / tacotron_pytorch

PyTorch implementation of Tacotron speech synthesis model.

Jupyter Notebook 308 79 Updated Jul 12, 2019

decisionforce / CoPO

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

Jupyter Notebook 131 21 Updated Jan 29, 2024

SonyResearch / simba

Jupyter Notebook 109 6 Updated Feb 25, 2025

SafeRoboticsLab / VBD

VBD: Versatile Behavior Diffusion for Generalized Traffic Agent Simulation

Jupyter Notebook 83 9 Updated Jan 2, 2025

DAVIAN-Robotics / SimbaV2

Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"

Jupyter Notebook 75 2 Updated Nov 4, 2025

aburns4 / MoTIF

Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments

Jupyter Notebook 60 3 Updated Aug 19, 2024

Zilin Wang zerlinwang

Lists (1)

MARL

Starred repositories

Awesome Lists