Skip to content
View minesh1291's full-sized avatar
πŸš–
On The Journey to Neverland
πŸš–
On The Journey to Neverland

Highlights

  • Pro

Organizations

@DiSCoBGU @Front-end-for-Data-Science @Deep-Learning-aided-Drug-Designing @datAIsmServices

Block or report minesh1291

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

πŸ‹οΈβ€β™‚οΈ Reinforcement Learning

54 repositories

πŸ‹ Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

Python 608 140 Updated Jun 4, 2022

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,555 926 Updated Jul 8, 2025

Evotorch is a neuro-evolution library written in Python that makes use of the Pytorch library formalism. It allows the evolution of multilayer and convolutional networks. This project was conceived…

Python 2 Updated Dec 11, 2020

Distributed Evolutionary Algorithms in Python

Python 6,290 1,159 Updated Nov 16, 2025

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer (https://arxiv.org/abs/2008.02387) from NNAISENSE.

Python 73 5 Updated Dec 10, 2020

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python 2,273 413 Updated Jul 9, 2024

A collection of Pyomo examples

Jupyter Notebook 311 161 Updated Mar 11, 2025

Python implementation of CMA-ES

Jupyter Notebook 1,264 191 Updated Nov 30, 2025
Jupyter Notebook 928 109 Updated Jun 27, 2024

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,870 681 Updated Oct 11, 2025

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,871 8,715 Updated Oct 11, 2024

The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)

Python 2,358 489 Updated Mar 14, 2024
Python 1 3 Updated May 19, 2020

Season 3 of @twosigma's artificial intelligence programming challenge

WebAssembly 198 113 Updated Dec 6, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 12,348 2,017 Updated Dec 17, 2025

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,288 162 Updated Aug 3, 2023

An elegant PyTorch offline reinforcement learning library for researchers.

Python 373 42 Updated Jul 11, 2025

Using Deep Reinforcement Learning (associated with Deep Learning) to control a swarm of drones for dynamic area maximization problem

Jupyter Notebook 4 Updated Apr 20, 2023

Mastering Diverse Domains through World Models

Python 2,541 426 Updated Sep 23, 2025

Efficient Batched Reinforcement Learning in TensorFlow

Python 972 148 Updated Jan 11, 2019

[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"

Python 57 7 Updated Apr 6, 2023

Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

Python 79 6 Updated Aug 14, 2022

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

Python 143 23 Updated May 6, 2024

Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.

Python 110 12 Updated May 12, 2023

Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets

Python 130 13 Updated Nov 21, 2024

A collection of offline reinforcement learning algorithms.

Python 207 27 Updated Nov 26, 2024

Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

Python 283 29 Updated Jun 10, 2022

Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing

Python 86 17 Updated May 10, 2023