Skip to content
View kinalmehta's full-sized avatar

Block or report kinalmehta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
127 stars written in Python
Clear filter

A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL

Python 413 58 Updated Dec 27, 2022

Repo for reproduction of sequential social dilemmas

Python 406 137 Updated Mar 6, 2025

CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control

Python 397 29 Updated May 2, 2025

Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.

Python 362 36 Updated Mar 16, 2023

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Python 357 80 Updated Apr 1, 2019

Real-World RL Benchmark Suite

Python 356 28 Updated Aug 11, 2020
Python 355 57 Updated Oct 12, 2022

Agent Learning Framework https://alf.readthedocs.io

Python 352 58 Updated Nov 6, 2025

Lightweight Nearest Neighbors with Flexible Backends

Python 312 10 Updated Oct 5, 2025

Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"

Python 310 80 Updated Apr 13, 2023

A library for ready-made reinforcement learning agents and reusable components for neat prototyping

Python 299 63 Updated Feb 13, 2024

Multi Task RL Baselines

Python 255 28 Updated Dec 31, 2021

Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch

Python 252 10 Updated Sep 1, 2022

[ICLR-2025] POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically designed to be flexible, tunable and scalable. It can…

Python 250 29 Updated Aug 28, 2025

Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks

Python 219 52 Updated Oct 3, 2023
Python 218 57 Updated Jun 4, 2023

Load tensorboard event logs as pandas DataFrames for scientific plotting; Supports both PyTorch and TensorFlow

Python 205 3 Updated Aug 16, 2024

Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)

Python 197 38 Updated Mar 15, 2023

A library that makes Evolutionary Strategies (ES) simple to use.

Python 180 14 Updated Apr 14, 2021

Official code for ICML 2022: Mitigating Neural Network Overconfidence with Logit Normalization

Python 154 14 Updated Jul 5, 2022

My solution to the Unity Obstacle Tower Challenge

Python 136 8 Updated May 23, 2021

This project downloads and stores the daily SBI forex rates in a CSV file enabling you to access historical rates, easily.

Python 123 29 Updated Nov 7, 2025

JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"

Python 102 7 Updated May 17, 2022
Python 101 9 Updated Feb 14, 2024

Reinforcement learning library in JAX.

Python 100 3 Updated Oct 22, 2023

Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)

Python 90 11 Updated Nov 21, 2023

Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)

Python 83 18 Updated Dec 8, 2022

Chat with PDF using Llama 3.3

Python 77 13 Updated Dec 8, 2024

Fast Flexible Replay Buffer Library (Mirror repository of https://gitlab.com/ymd_h/cpprb)

Python 73 10 Updated Dec 14, 2024