exploration-exploitation

Here are 59 public repositories matching this topic...

haoyangzheng-ai / ts_ulmc

The GitHub repository for "Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo", AISTATS 2024.

monte-carlo thompson-sampling multi-armed-bandit langevin-dynamics exploration-exploitation

Updated Oct 19, 2024
Python

mohitpandey92 / k_arm_bandit

Star

A simple exercise in reinforcement learning

machine-learning reinforcement-learning exploration-exploitation

Updated Mar 24, 2022
Jupyter Notebook

ako-kamattechan / improved-robot

Star

Single-file demo of RELAX/CONSTRAIN control loop with frame shifting and Merkle-like lineage.

visualization javascript lineage single-file exploration-exploitation generative-systems

Updated Feb 26, 2026

siavashadpey / MultiArmedBandits

Star

reinforcement-learning active-learning bandit-algorithms exploration-exploitation

Updated Mar 27, 2022
Python

nithinrajkore / Reinforcement-Learning

Star

Upper Confidence Bound (UCB) multi-armed bandit algorithm for ad click-through rate optimization over 10,000 rounds

python reinforcement-learning jupyter-notebook ucb multi-armed-bandit exploration-exploitation

Updated Apr 2, 2026
Jupyter Notebook

JiahongXu123 / OSPO-algorithm

Star

OSPO is a novel metaheuristic algorithm which has the potential to solve different kinds of problems with promising performance.

global-optimization adaptive optimization-algorithms metaheuristics exploration-exploitation

Updated Aug 12, 2021

Alishafzd / NoisyQ

Star

NoisyQ is a noise-based exploration method for DDQN

reinforcement-learning ddqn exploration-exploitation

Updated Dec 16, 2023
Jupyter Notebook

SXV357 / Inspirit-AI-Deep-Dive-Designing-DL-Systems-FinalProject-RL-for-Autonomous-Vehicles

Star

This project uses Reinforcement Learning to teach an agent to drive by itself and learn from its observations so that it can maximize the reward(180+ lines)

reinforcement-learning q-learning epsilon-greedy loss-functions deep-q-learning exploration-exploitation

Updated Mar 21, 2026
Jupyter Notebook

Sagarnandeshwar / Bandit_Algorithms

Star

Reinforcement Learning (COMP 579) Project

reinforcement-learning thompson-sampling epsilon-greedy ucb bernoulli-distribution bandit-algorithms exploration-exploitation

Updated Aug 4, 2023
Jupyter Notebook

hashem20 / Active-Passive-Gap-in-Exploration

Star

Active versus Passive exploration

decision-making psychology active-learning exploration-exploitation

Updated Feb 3, 2019
MATLAB

rom1mouret / exploration

Star

over-parameterization = exploration ?

global-optimization gradient-descent hypernetworks exploration-exploitation over-parameterization

Updated Aug 23, 2020
Python

nagsujosh / contexto-solver-agent

Star

Uses GloVe embeddings + bandit-style exploration/exploitation with adaptive diversification, UCB-driven cluster search, and stagnation recovery.

ucb glove-embeddings bandit-algorithms exploration-exploitation

Updated Aug 29, 2025
Python

tomooda / HiDeHo

Star

the HiDeHo (HInts for Directing the Exploration from History of Operations) framework for Pharo

pharo creativity history-management exploration-exploitation

Updated Apr 30, 2025
Smalltalk

SaurabhJalendra / Treasure-Hunt-in-the-Frozen-Lake-and-Optimizing-Movie-Recommendations-Using-Multi-Armed-Bandits

Star

This repository contains two reinforcement learning projects: "Treasure Hunt in the Frozen Lake," which navigates a modified FrozenLake using dynamic programming, and "Optimizing Movie Recommendations," which employs Multi-Armed Bandits to enhance user satisfaction.

python data-science machine-learning reinforcement-learning jupyter-notebook openai-gym dynamic-programming multi-armed-bandits exploration-exploitation policy-improvement movie-recommedation

Updated Feb 27, 2025
Jupyter Notebook

pranav0904 / Reinforcement-Learning

Star

OpenAI, gym environment implementation

reinforcement-learning openai gym exploration-exploitation

Updated Nov 14, 2020
Jupyter Notebook

Amshra267 / Thompson-Greedy-Comparison-for-MultiArmed-Bandits

Star

Repository Containing Comparison of two methods for dealing with Exploration-Exploitation dilemma for MultiArmed Bandits

thompson-sampling epsilon-greedy exploration-exploitation optimistic-bayesian-sampling

Updated Jul 2, 2021
Python

ivotints / Learn2Slither

Star

A reinforcement learning project where a snake learns to navigate and survive in a dynamic environment through Q-learning.

reinforcement-learning neural-network tensorflow keras q-learning snake-game exploration-exploitation ai-agent

Updated Apr 16, 2025
Python

isabellahmann / active-inference-exploration

Star

A systematic parameter study of exploration–exploitation trade-offs in an Active Inference agent under varying precision and sensory noise.

uncertainty computational-neuroscience exploration-exploitation active-inference

Updated Jan 14, 2026
Python

03chrisk / RL-Bandits

Star

Investigating different exploration strategies and their hyperparameters on a 10-arm bandit testbed inspired by Reinforcement Learning: An Introduction (Sutton & Barto)

reinforcement-learning multi-armed-bandits exploration-exploitation

Updated Dec 13, 2024
Jupyter Notebook

baturaysaglam / DISCOVER

Star

Deep Intrinsically Motivated Exploration in Continuous Control

deep-reinforcement-learning actor-critic exploration-exploitation

Updated Mar 2, 2024
Python

Improve this page

Add a description, image, and links to the exploration-exploitation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the exploration-exploitation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

exploration-exploitation

Here are 59 public repositories matching this topic...

haoyangzheng-ai / ts_ulmc

mohitpandey92 / k_arm_bandit

ako-kamattechan / improved-robot

siavashadpey / MultiArmedBandits

nithinrajkore / Reinforcement-Learning

JiahongXu123 / OSPO-algorithm

Alishafzd / NoisyQ

SXV357 / Inspirit-AI-Deep-Dive-Designing-DL-Systems-FinalProject-RL-for-Autonomous-Vehicles

Sagarnandeshwar / Bandit_Algorithms

hashem20 / Active-Passive-Gap-in-Exploration

rom1mouret / exploration

nagsujosh / contexto-solver-agent

tomooda / HiDeHo

SaurabhJalendra / Treasure-Hunt-in-the-Frozen-Lake-and-Optimizing-Movie-Recommendations-Using-Multi-Armed-Bandits

pranav0904 / Reinforcement-Learning

Amshra267 / Thompson-Greedy-Comparison-for-MultiArmed-Bandits

ivotints / Learn2Slither

isabellahmann / active-inference-exploration

03chrisk / RL-Bandits

baturaysaglam / DISCOVER

Improve this page

Add this topic to your repo