Skip to content
View kinalmehta's full-sized avatar

Block or report kinalmehta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
127 stars written in Python
Clear filter

Experiment. Plot. Tabulate.

Python 72 7 Updated Aug 22, 2024

MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957

Python 65 10 Updated Apr 30, 2021

Official Pytorch Implementation of our paper: Video Person Re-ID : Fantastic Techniques and Where to Find Them

Python 62 10 Updated May 8, 2023

Fast reinforcement learning research

Python 61 16 Updated Dec 7, 2024

The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge

Python 60 15 Updated Jan 3, 2023

Code to reproduce the results for Compositional Attention

Python 59 6 Updated Nov 16, 2022

Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"

Python 47 10 Updated Oct 3, 2023

Code for A General Recipe for Likelihood-free Bayesian Optimization, ICML 2022

Python 45 2 Updated Jun 30, 2022

Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M frames. 🌈

Python 44 4 Updated Dec 11, 2021

IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation

Python 40 5 Updated Jul 18, 2025

Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"

Python 38 9 Updated Oct 5, 2022

Model-based Policy Gradients

Python 32 4 Updated Mar 12, 2020
Python 31 4 Updated Apr 25, 2021

A modular implementation of PPO, and soon hopefully other algorithms.

Python 26 2 Updated Jan 16, 2024

Sandbox environment for generalizable agent research

Python 25 7 Updated Aug 19, 2022

Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)

Python 23 1 Updated Jul 16, 2022

Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".

Python 22 3 Updated Nov 8, 2024

PyTorch Implementation of COPA for coordinating teams that can dynamically change.

Python 22 8 Updated Apr 16, 2022
Python 18 1 Updated Mar 30, 2023

Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)

Python 17 1 Updated Oct 23, 2021

Minimal Decision Transformer Implementation written in Jax (Flax).

Python 17 2 Updated Aug 8, 2022

Model-based reinforcement learning using CEM, MPC and PETS

Python 16 Updated Nov 20, 2019

A library containing a collection of distance and similarity measures for data analysis

Python 16 Updated Jul 31, 2025

Source code for paper: Efficient deep reinforcement learning via adaptive policy transfer

Python 15 6 Updated Aug 15, 2022

V-MPO torch version with DMLab30 and GTrXL

Python 13 1 Updated Mar 1, 2021

Reward Propagation using Graph Convolutional Networks

Python 13 12 Updated Jun 19, 2021

A simple black-box optimization framework to train your pytorch models for optimizing non-differentiable objectives

Python 11 3 Updated Mar 5, 2023