Lists (8)
Sort Name ascending (A-Z)
Stars
Numerical differential equation solvers in JAX. Autodifferentiable and GPU-capable. https://docs.kidger.site/diffrax/
This is the official implementation of Multi-Agent PPO (MAPPO).
Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
High-quality implementations of standard and SOTA methods on a variety of tasks.
An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents
ChainerRL is a deep reinforcement learning library built on top of Chainer.
Riemannian Adaptive Optimization Methods with pytorch optim
Pytorch implementation of MixNMatch
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Figure sizes, font sizes, fonts, and more configurations at minimal overhead. Fix your journal papers, conference proceedings, and other scientific publications.
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
A curated list of Monte Carlo tree search papers with implementations.
Dream to Control: Learning Behaviors by Latent Imagination
[COLM’25] DeepRetrieval — 🔥 The First Search Agent Trained by On-Policy Reinforcement Learning
The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.
[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)
Official PyTorch implementation of "Joint Object Detection and Multi-Object Tracking with Graph Neural Networks"
a Lightweight library for sequential learning agents, including reinforcement learning
An implementation of the Augmented Random Search algorithm
Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
A every-so-often-updated collection of every causality + machine learning paper submitted to arXiv in the recent past.