ucb

Star

Here are 27 public repositories matching this topic...

annieyan / Bandits-using-UCB-algorithm

Star

Thompson Sampling for Bandits using UCB policy

reinforcement-learning thompson-sampling ucb bandits

Updated Jul 29, 2017
Python

akshaykhadse / reinforcement-learning

Star

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

LittleWat / hyper-parameter-optimization-by-GMRF-GPUCB

Star

R.I.T project

python3 ucb gaussian-processes gmrf markov-random-field gp

Updated Jul 29, 2019
Python

Suchetaaa / CS747-Assignments

Star

Foundations Of Intelligent Learning Agents (FILA) Assignments

reinforcement-learning monte-carlo linear-programming thompson-sampling ucb bootstrapping multi-armed-bandits bellman-equation temporal-differencing-learning howards-pi sarsa-learning kl-ucb windy-gridworld intelligent-learning-agents

Updated Nov 8, 2019
Python

rudrajit1729 / Machine-Learning-Codes-And-Templates

Star

Codes and templates for ML algorithms created, modified and optimized in Python and R.

feature-selection datascience feature-extraction thompson-sampling dimensionality-reduction ucb ann regression-models nlp-machine-learning kmeans-clustering apriori-algorithm hierarchical-clustering classification-algorithims parameter-tuning regression-algorithms xgboost-model kfold-cross-validation cnn-classification eclat-algorithm

Updated Mar 28, 2020
Python

salimandre / Monte-Carlo-Tree-Search

Star

We implemented a Monte Carlo Tree Search (MCTS) from scratch and we successfully applied it to Tic-Tac-Toe game.

reinforcement-learning graphics mcts ucb monte-carlo-tree-search tic-tac-toe-game upper-confidence-bound

Updated Jul 9, 2020
Python

salimandre / Monte-Carlo-Tree-Search-for-checkers-game

Star

We compare different policies for the checkers game using reinforcement learning algorithms.

python reinforcement-learning turtle-graphics ucb monte-carlo-tree-search checkers-game upper-confidence-bound mcts-algorithm

Updated Aug 24, 2020
Python

educup / ucb-python-api

Star

Python package for Unity Cloud Build api

python api unity poetry unity3d python3 ucb typer python-package unity-tool unity-cloud-build ucb-api

Updated Sep 12, 2020
Python

czahie / CS61A

Star

Structure and Interpretation of Computer Programs

python scheme data-structure sqlite ucb

Updated Sep 12, 2020
Python

Murtazali05 / Multi-armed-bandit

Star

Multi Armed Bandits implementation using the Jester Dataset

thompson-sampling ucb multi-armed-bandits e-greedy

Updated Apr 5, 2021
Python

ishank-juneja / Correlated-AoI-Bandits

Star

Author's implementation of the paper Correlated Age-of-Information Bandits.

thompson-sampling ucb multi-armed-bandit aoi age-of-information correlated-multi-armed-bandits correlated-arms aoi-regret

Updated Jun 19, 2021
Python

paramrathour / Intelligent-and-Learning-Agents

Star

My programs during CS747 (Foundations of Intelligent and Learning Agents) Autumn 2021-22

linear-programming thompson-sampling epsilon-greedy mountain-car sarsa ucb markov-decision-processes multi-armed-bandit policy-iteration value-iteration tile-coding kl-ucb policy-control

Updated Apr 17, 2022
Python

alison-carrera / mabalgs

Star

👤 Multi-Armed Bandit Algorithms Library (MAB) 👮

arm algorithm reinforcement-learning simulation monte-carlo rank thompson-sampling reinforcement-learning-algorithms ucb reward multi-armed-bandit montecarlo-simulation contextual-bandits ranking-algorithm mab ranked-mab

Updated Sep 6, 2022
Python

MaxenceGiraud / ucb-nonstationary

Star

On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems

ucb multi-armed-bandits non-stationary-bandit discounted-ucb sliding-ucb

Updated Oct 7, 2022
Python

sarthakmittal92 / multi-armed-bandits

Star

Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.

python thompson-sampling reinforcement-learning-algorithms ucb multi-armed-bandits bandits kl-ucb

Updated Oct 14, 2022
Python

idanmoradarthas / MutiArmedBandit-DeepLearning

Star

Multi-armed bandit algorithm with tensorflow and 11 policies

tensorflow deep-reinforcement-learning python3 ucb multi-armed-bandit epsilon softmax

Updated Dec 27, 2022
Python

amaitammar / Hex-Game

Star

Python implementation of the Hex game with AI based on MC and MCTS methods. Interactive mode with pygame.

game python hex reinforcement-learning ai ucb

Updated Mar 11, 2023
Python

JoelJa835 / MAB_Algorithms

Star

Implementation of Multi-Armed Bandit (MAB) algorithms UCB and Epsilon-Greedy. MAB is a class of problems in reinforcement learning where an agent learns to choose actions from a set of arms, each associated with an unknown reward distribution. UCB and Epsilon-Greedy are popular algorithms for solving MAB problems.

reinforcement-learning-algorithms ucb bandits mab e-greedy

Updated Mar 26, 2023
Python

SarCode / ML-Code-Tutorials-Udemy

Star

Complete Tutorial Guide with Code for learning ML

natural-language-processing random-forest svm scikit-learn artificial-neural-networks logistic-regression ucb polynomial-regression kmeans-clustering knearest-neighbor-algorithm apriori-algorithm classification-methods svr kernel-svm kernel-pca heirarchical-clustering decison-trees

Updated Apr 21, 2023
Python

JoelJa835 / Least-Loaded-Server

Star

reinforcement-learning-algorithms ucb multiplicative-weights

Updated Apr 26, 2023
Python

Improve this page

Add a description, image, and links to the ucb topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ucb topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ucb

Here are 27 public repositories matching this topic...

annieyan / Bandits-using-UCB-algorithm

akshaykhadse / reinforcement-learning

LittleWat / hyper-parameter-optimization-by-GMRF-GPUCB

Suchetaaa / CS747-Assignments

rudrajit1729 / Machine-Learning-Codes-And-Templates

salimandre / Monte-Carlo-Tree-Search

salimandre / Monte-Carlo-Tree-Search-for-checkers-game

educup / ucb-python-api

czahie / CS61A

Murtazali05 / Multi-armed-bandit

ishank-juneja / Correlated-AoI-Bandits

paramrathour / Intelligent-and-Learning-Agents

alison-carrera / mabalgs

MaxenceGiraud / ucb-nonstationary

sarthakmittal92 / multi-armed-bandits

idanmoradarthas / MutiArmedBandit-DeepLearning

amaitammar / Hex-Game

JoelJa835 / MAB_Algorithms

SarCode / ML-Code-Tutorials-Udemy

JoelJa835 / Least-Loaded-Server

Improve this page

Add this topic to your repo