Lists (1)
Sort Name ascending (A-Z)
Starred repositories
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Gin provides a lightweight configuration framework for Python
Python Multi-Agent Reinforcement Learning framework
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Optax is a gradient processing and optimization library for JAX.
Official Repo for Open-Reasoner-Zero
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
This is the official implementation of Multi-Agent PPO (MAPPO).
A Platform for Many-Agent Reinforcement Learning
[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
A collection of reference environments for offline reinforcement learning
An offline deep reinforcement learning library
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
📈 Personae is a repo of implements and environment of Deep Reinforcement Learning & Supervised Learning for Quantitative Trading.
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Latest Advances on System-2 Reasoning
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
MetaDrive: Lightweight driving simulator for everyone
A JAX-based simulator for autonomous driving research.