Skip to content
View zerlinwang's full-sized avatar
😃
Say hello
😃
Say hello

Block or report zerlinwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert

Python 1,682 104 Updated Feb 18, 2025

Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"

Python 9 Updated Oct 8, 2025

Statically-linked, hermetic, relocatable Zsh

Shell 365 23 Updated Jul 27, 2023

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 77,940 11,503 Updated Nov 3, 2025

Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"

Python 23 1 Updated Sep 7, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,106 1,902 Updated Nov 1, 2025

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Jupyter Notebook 3,006 650 Updated Oct 31, 2025
Python 7 1 Updated Aug 30, 2025

Implementation of Fully Sharded Data Parallelism in Jax

Python 1 Updated Jun 12, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,743 291 Updated Nov 3, 2025

Active reward modeling with last layer Fisher Information (ICML'25)

Python 7 Updated Jul 9, 2025

[ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Python 39 1 Updated Jun 5, 2025

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 949 45 Updated Oct 13, 2025

RSS 2023: This repository contains code for the paper Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors.

Python 103 10 Updated May 10, 2023

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 2,815 187 Updated Nov 3, 2025

Eclipse SUMO is an open source, highly portable, microscopic and continuous traffic simulation package designed to handle large networks. It allows for intermodal simulation including pedestrians a…

C++ 3,756 1,644 Updated Nov 5, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

1,988 111 Updated Nov 5, 2025

🤗 R1-AQA Model: mispeech/r1-aqa

Python 306 26 Updated Mar 28, 2025
Jupyter Notebook 6 1 Updated Apr 4, 2025

Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models. CVPR Oral 2025.

Python 157 14 Updated Apr 4, 2025

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 891 36 Updated Jun 27, 2024

Benchmark for studying the imitation gap when training autonomous driving policies from human demonstrations

Jupyter Notebook 20 Updated Apr 14, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,544 83 Updated Nov 4, 2025

Finetune VITS and MMS using HuggingFace's tools

Python 172 65 Updated Mar 31, 2024

VBD: Versatile Behavior Diffusion for Generalized Traffic Agent Simulation

Jupyter Notebook 83 9 Updated Jan 2, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,606 68 Updated Jun 5, 2025

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 2,102 602 Updated Oct 27, 2023

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,258 1,757 Updated Oct 13, 2025
Next