minesh1291

🚖

On The Journey to Neverland

Minesh A. Jethva minesh1291

🚖

On The Journey to Neverland

Kaggle 3x Expert, Data Scientist focusing on deep sequence modeling for TimeSeries, Computer Vision and NLP

236 followers · 3k following

Achievements

Highlights

Organizations

Stars

🏋️‍♂️ Reinforcement Learning

54 repositories

archsyscall / DeepRL-TensorFlow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

Python 608 140 Updated Jun 4, 2022

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,555 926 Updated Jul 8, 2025

belzu / Evotorch

Evotorch is a neuro-evolution library written in Python that makes use of the Pytorch library formalism. It allows the evolution of multilayer and convolutional networks. This project was conceived…

Python 2 Updated Dec 11, 2020

DEAP / deap

Distributed Evolutionary Algorithms in Python

Python 6,290 1,159 Updated Nov 16, 2025

nnaisense / pgpelib

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer (https://arxiv.org/abs/2008.02387) from NNAISENSE.

Python 73 5 Updated Dec 10, 2020

nikhilbarhate99 / PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python 2,273 413 Updated Jul 9, 2024

Pyomo / pyomo-gallery

A collection of Pyomo examples

Jupyter Notebook 311 161 Updated Mar 11, 2025

MarkoMlakar / sklearn-neuro-evolution

Python 8 Updated Nov 9, 2020

CMA-ES / pycma

Python implementation of CMA-ES

Jupyter Notebook 1,264 191 Updated Nov 30, 2025

google / evojax

Jupyter Notebook 928 109 Updated Jun 27, 2024

lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,870 681 Updated Oct 11, 2025

openai / gym

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,871 8,715 Updated Oct 11, 2024

AminHP / gym-anytrading

The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)

Python 2,358 489 Updated Mar 14, 2024

hsperr / halite-3-reinforcment

Python 1 3 Updated May 19, 2020

HaliteChallenge / Halite-III

Season 3 of @twosigma's artificial intelligence programming challenge

WebAssembly 198 113 Updated Dec 6, 2025

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 12,348 2,017 Updated Dec 17, 2025

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,288 162 Updated Aug 3, 2023

yihaosun1124 / OfflineRL-Kit

An elegant PyTorch offline reinforcement learning library for researchers.

Python 373 42 Updated Jul 11, 2025

swap-10 / Dynamic-Drone-Swarm-DeepRL

Using Deep Reinforcement Learning (associated with Deep Learning) to control a swarm of drones for dynamic area maximization problem

Jupyter Notebook 4 Updated Apr 20, 2023

danijar / dreamerv3

Mastering Diverse Domains through World Models

Python 2,541 426 Updated Sep 23, 2025

google-research / batch-ppo

Efficient Batched Reinforcement Learning in TensorFlow

Python 972 148 Updated Jan 11, 2019

danijar / awesome-rl-envs

8 2 Updated Aug 23, 2022

ryanxhr / POR

[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"

Python 57 7 Updated Apr 6, 2023

snu-mllab / EDAC

Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

Python 79 6 Updated Aug 14, 2022

BY571 / CQL

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

Python 143 23 Updated May 6, 2024

ZhengyaoJiang / latentplan

Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.

Python 110 12 Updated May 12, 2023

polixir / NeoRL

Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets

Python 130 13 Updated Nov 21, 2024

polixir / OfflineRL

A collection of offline reinforcement learning algorithms.

Python 207 27 Updated Nov 26, 2024

nikhilbarhate99 / min-decision-transformer

Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

Python 283 29 Updated Jun 10, 2022

CPS-TUWien / racing_dreamer

Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing

Python 86 17 Updated May 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minesh A. Jethva minesh1291

Achievements

Achievements

Highlights

Organizations

Block or report minesh1291

🏋️‍♂️ Reinforcement Learning

archsyscall / DeepRL-TensorFlow2

vwxyzjn / cleanrl

belzu / Evotorch

DEAP / deap

nnaisense / pgpelib

nikhilbarhate99 / PPO-PyTorch

Pyomo / pyomo-gallery

MarkoMlakar / sklearn-neuro-evolution

CMA-ES / pycma

google / evojax

lucidrains / PaLM-rlhf-pytorch

openai / gym

AminHP / gym-anytrading

hsperr / halite-3-reinforcment

HaliteChallenge / Halite-III

DLR-RM / stable-baselines3

tinkoff-ai / CORL

yihaosun1124 / OfflineRL-Kit

swap-10 / Dynamic-Drone-Swarm-DeepRL

danijar / dreamerv3

google-research / batch-ppo

danijar / awesome-rl-envs

ryanxhr / POR

snu-mllab / EDAC

BY571 / CQL

ZhengyaoJiang / latentplan

polixir / NeoRL

polixir / OfflineRL

nikhilbarhate99 / min-decision-transformer

CPS-TUWien / racing_dreamer