Skip to content
View zerlinwang's full-sized avatar
😃
Say hello
😃
Say hello

Block or report zerlinwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

210 stars written in Python
Clear filter

Really Fast End-to-End Jax RL Implementations

Python 969 81 Updated Sep 9, 2024

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 957 48 Updated Oct 13, 2025

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 893 36 Updated Jun 27, 2024
Python 888 59 Updated Nov 5, 2025

RL Environments in JAX 🌍

Python 826 88 Updated May 30, 2025

An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES

Python 808 162 Updated Aug 1, 2023

Honor of Kings AI Open Environment of Tencent

Python 773 95 Updated Jul 17, 2024

🕹️ A diverse suite of scalable reinforcement learning environments in JAX

Python 771 93 Updated Nov 6, 2025

Lightweight version of MAPPO to help you quickly migrate to your local environment.

Python 752 99 Updated Oct 23, 2025

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python 691 132 Updated May 18, 2024

How to use wandb?

Python 682 55 Updated Sep 5, 2023

Evolution Strategies in JAX 🦎

Python 680 58 Updated Sep 20, 2025

Multi-Agent Reinforcement Learning with JAX

Python 679 127 Updated Nov 5, 2025

hanabi_learning_environment is a research platform for Hanabi experiments.

Python 654 163 Updated Feb 14, 2023

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 647 169 Updated Sep 24, 2024

Author's PyTorch implementation of BCQ for continuous and discrete actions

Python 643 143 Updated Apr 6, 2021

Decision Intelligence Platform for Autonomous Driving simulation.

Python 618 58 Updated Mar 13, 2025
Python 605 39 Updated Nov 1, 2025

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 603 37 Updated Feb 10, 2024

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

Python 595 91 Updated Oct 28, 2020

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python 579 157 Updated Aug 19, 2023

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 565 54 Updated Aug 10, 2025

Dream to Control: Learning Behaviors by Latent Imagination

Python 562 113 Updated Sep 10, 2021

Effortlessly add AI-generated transcription subtitles to your videos

Python 546 65 Updated Nov 13, 2024

♟️ Vectorized RL game environments in JAX

Python 539 37 Updated Mar 6, 2025

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

Python 518 68 Updated Oct 6, 2022

Matplotlib中文教程,在线阅读地址:https://datawhalechina.github.io/fantastic-matplotlib/

Python 515 110 Updated Jul 31, 2022

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Python 507 78 Updated Jul 21, 2023

Pytorch implementation of the CREPE pitch tracker

Python 488 73 Updated May 16, 2025

Implementation of benchmark RL algorithms

Python 470 82 Updated Jul 20, 2022