Lists (1)
Sort Name ascending (A-Z)
Starred repositories
[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
Implementation of Fully Sharded Data Parallelism in Jax
🚀 Efficient implementations of state-of-the-art linear attention models
Active reward modeling with last layer Fisher Information (ICML'25)
[ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
RSS 2023: This repository contains code for the paper Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors.
[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers
Eclipse SUMO is an open source, highly portable, microscopic and continuous traffic simulation package designed to handle large networks. It allows for intermodal simulation including pedestrians a…
A Survey of Reinforcement Learning for Large Reasoning Models
Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models. CVPR Oral 2025.
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
Benchmark for studying the imitation gap when training autonomous driving policies from human demonstrations
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
Finetune VITS and MMS using HuggingFace's tools
VBD: Versatile Behavior Diffusion for Generalized Traffic Agent Simulation
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.