Skip to content
View kinalmehta's full-sized avatar

Block or report kinalmehta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
127 stars written in Python
Clear filter

Numerical differential equation solvers in JAX. Autodifferentiable and GPU-capable. https://docs.kidger.site/diffrax/

Python 1,815 161 Updated Oct 3, 2025

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,749 349 Updated Jul 18, 2024

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,625 80 Updated Oct 3, 2025

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Python 1,574 156 Updated Sep 3, 2025

High-quality implementations of standard and SOTA methods on a variety of tasks.

Python 1,545 215 Updated Nov 6, 2025
Python 1,451 120 Updated Feb 15, 2025

An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents

Python 1,286 96 Updated Aug 27, 2025

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Python 1,197 224 Updated Aug 10, 2021

Library for Model Based RL

Python 1,027 170 Updated Jul 12, 2024

Riemannian Adaptive Optimization Methods with pytorch optim

Python 999 91 Updated Aug 4, 2025

Pytorch implementation of MixNMatch

Python 974 190 Updated Jul 7, 2020

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Python 914 142 Updated Dec 20, 2023

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Python 802 69 Updated Jun 8, 2025

Figure sizes, font sizes, fonts, and more configurations at minimal overhead. Fix your journal papers, conference proceedings, and other scientific publications.

Python 714 30 Updated Jul 14, 2025

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python 691 132 Updated May 18, 2024

A curated list of Monte Carlo tree search papers with implementations.

Python 685 76 Updated Mar 16, 2024

Dream to Control: Learning Behaviors by Latent Imagination

Python 679 79 Updated Jul 14, 2020

[COLM’25] DeepRetrieval — 🔥 The First Search Agent Trained by On-Policy Reinforcement Learning

Python 666 83 Updated Oct 12, 2025

The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.

Python 659 59 Updated Jun 10, 2025

[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

Python 561 71 Updated Mar 15, 2024

Gaussian processes in JAX and Flax.

Python 549 69 Updated Oct 30, 2025

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Python 507 78 Updated Jul 21, 2023

Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)

Python 494 82 Updated May 1, 2025

Official PyTorch implementation of "Joint Object Detection and Multi-Object Tracking with Graph Neural Networks"

Python 466 74 Updated Mar 6, 2022

a Lightweight library for sequential learning agents, including reinforcement learning

Python 431 41 Updated Mar 22, 2023

An implementation of the Augmented Random Search algorithm

Python 425 104 Updated Sep 29, 2021

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Python 419 51 Updated Aug 8, 2021

Tonic RL library

Python 418 49 Updated Jul 24, 2024

A every-so-often-updated collection of every causality + machine learning paper submitted to arXiv in the recent past.

Python 415 50 Updated Sep 10, 2020