Skip to content
View zerlinwang's full-sized avatar
😃
Say hello
😃
Say hello

Highlights

  • Pro

Block or report zerlinwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Jax Codebase for Evolutionary Strategies at the Hyperscale

Python 198 15 Updated Nov 20, 2025

Official Repository of Absolute Zero Reasoner

Python 1,779 291 Updated Aug 24, 2025

Official implementation of "DiscoBench: An Open-Ended Benchmark For Algorithm Discovery"

Python 19 1 Updated Dec 10, 2025

Network Analysis in Python

Python 16,441 3,448 Updated Dec 20, 2025

Manipulation and analysis of geometric objects

Python 4,329 603 Updated Dec 16, 2025

[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert

Python 1,746 114 Updated Feb 18, 2025

Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"

Python 15 Updated Oct 8, 2025

Statically-linked, hermetic, relocatable Zsh

Shell 372 23 Updated Jul 27, 2023

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,390 12,166 Updated Dec 21, 2025

Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"

Python 23 1 Updated Sep 7, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,443 1,995 Updated Nov 1, 2025

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Jupyter Notebook 3,047 659 Updated Oct 31, 2025
Python 12 1 Updated Aug 30, 2025

Implementation of Fully Sharded Data Parallelism in Jax

Python 1 Updated Jun 12, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,089 333 Updated Dec 20, 2025

Active reward modeling with last layer Fisher Information (ICML'25)

Python 7 Updated Jul 9, 2025

[ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Python 39 1 Updated Jun 5, 2025

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,126 62 Updated Oct 13, 2025

RSS 2023: This repository contains code for the paper Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors.

Python 105 11 Updated May 10, 2023

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 2,980 204 Updated Dec 4, 2025

Eclipse SUMO is an open source, highly portable, microscopic and continuous traffic simulation package designed to handle large networks. It allows for intermodal simulation including pedestrians a…

C++ 3,819 1,677 Updated Dec 20, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,179 120 Updated Nov 9, 2025

🤗 R1-AQA Model: mispeech/r1-aqa

Python 309 27 Updated Mar 28, 2025
Jupyter Notebook 6 Updated Apr 4, 2025

Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models. CVPR Oral 2025.

Python 166 15 Updated Apr 4, 2025

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 905 38 Updated Jun 27, 2024

Benchmark for studying the imitation gap when training autonomous driving policies from human demonstrations

Jupyter Notebook 20 Updated Dec 8, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,761 104 Updated Nov 4, 2025
Next