[ICLR-2025] POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically designed to be flexible, tunable and scalable. It can…

Python 250 29 Updated Aug 28, 2025

IC3Net / IC3Net

Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks

Python 219 52 Updated Oct 3, 2023

cyanrain7 / TRPO-in-MARL

Python 218 57 Updated Jun 4, 2023

j3soon / tbparse

Load tensorboard event logs as pandas DataFrames for scientific plotting; Supports both PyTorch and TensorFlow

Python 205 3 Updated Aug 16, 2024

Farama-Foundation / D4RL-Evaluations

Python 203 29 Updated Mar 25, 2023

lmzintgraf / varibad

Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)

Python 197 38 Updated Mar 15, 2023

rasmusbergpalm / evostrat

A library that makes Evolutionary Strategies (ES) simple to use.

Python 180 14 Updated Apr 14, 2021

hongxin001 / logitnorm_ood

Official code for ICML 2022: Mitigating Neural Network Overconfidence with Logit Normalization

Python 154 14 Updated Jul 5, 2022

unixpickle / obs-tower2

My solution to the Unity Obstacle Tower Challenge

Python 136 8 Updated May 23, 2021

sahilgupta / sbi-fx-ratekeeper

This project downloads and stores the daily SBI forex rates in a CSV file enabling you to access historical rates, easily.

Python 123 29 Updated Nov 7, 2025

evgenii-nikishin / rl_with_resets

JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"

Python 102 7 Updated May 17, 2022

chandar-lab / RLHive

Python 101 9 Updated Feb 14, 2024

ethanluoyc / magi

Reinforcement learning library in JAX.

Python 100 3 Updated Oct 22, 2023

zhihanyang2022 / off-policy-continuous-control

Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)

Python 90 11 Updated Nov 21, 2023

TonghanWang / NDQ

Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)

Python 83 18 Updated Dec 8, 2022

Sumanth077 / chat_with_pdf

Chat with PDF using Llama 3.3

Python 77 13 Updated Dec 8, 2024

ymd-h / cpprb

Fast Flexible Replay Buffer Library (Mirror repository of https://gitlab.com/ymd_h/cpprb)

Python 73 10 Updated Dec 14, 2024

Previous Next

Kinal Mehta kinalmehta

Lists (8)

3dPose

Chess softwares

Diffusion Models

Interview Prep

LLM stuff

Productivity and opensource apps

Softwares

successor representations

Stars