#

ucb

Here are 27 public repositories matching this topic...

czahie / CS61A

Structure and Interpretation of Computer Programs

python scheme data-structure sqlite ucb

Updated Sep 12, 2020
Python

alison-carrera / mabalgs

👤 Multi-Armed Bandit Algorithms Library (MAB) 👮

arm algorithm reinforcement-learning simulation monte-carlo rank thompson-sampling reinforcement-learning-algorithms ucb reward multi-armed-bandit montecarlo-simulation contextual-bandits ranking-algorithm mab ranked-mab

Updated Sep 6, 2022
Python

xuyanshi / cs61a-2022

CS 61A: Structure and Interpretation of Computer Programs, Fall 2022, UC Berkeley

study cs61a ucb ucberkeley

Updated Aug 15, 2023
Python

akshaykhadse / reinforcement-learning

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

rudrajit1729 / Machine-Learning-Codes-And-Templates

Codes and templates for ML algorithms created, modified and optimized in Python and R.

feature-selection datascience feature-extraction thompson-sampling dimensionality-reduction ucb ann regression-models nlp-machine-learning kmeans-clustering apriori-algorithm hierarchical-clustering classification-algorithims parameter-tuning regression-algorithms xgboost-model kfold-cross-validation cnn-classification eclat-algorithm

Updated Mar 28, 2020
Python

annieyan / Bandits-using-UCB-algorithm

Thompson Sampling for Bandits using UCB policy

reinforcement-learning thompson-sampling ucb bandits

Updated Jul 29, 2017
Python

erdogant / thompson

Thompson is Python package to evaluate the multi-armed bandit problem. In addition to thompson, Upper Confidence Bound (UCB) algorithm, and randomized results are also implemented.

python machine-learning reinforcement-learning genetic-algorithm bayesian ucb multi-armed-bandit thompson thompson-algorithm

Updated Apr 25, 2025
Python

csfive / CS61A

🚧 UCB CS61A Solutions

python cs61a sicp cs ucb

Updated Feb 1, 2026
Python

educup / ucb-python-api

Python package for Unity Cloud Build api

python api unity poetry unity3d python3 ucb typer python-package unity-tool unity-cloud-build ucb-api

Updated Sep 12, 2020
Python

MaxenceGiraud / ucb-nonstationary

On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems

ucb multi-armed-bandits non-stationary-bandit discounted-ucb sliding-ucb

Updated Oct 7, 2022
Python

idanmoradarthas / MutiArmedBandit-DeepLearning

Multi-armed bandit algorithm with tensorflow and 11 policies

tensorflow deep-reinforcement-learning python3 ucb multi-armed-bandit epsilon softmax

Updated Dec 27, 2022
Python

woctezuma / puissance4

AI for the game "Connect Four". Available on PyPI.

Updated Nov 20, 2025
Python

Correlated-AoI-Bandits

ishank-juneja / Correlated-AoI-Bandits

Author's implementation of the paper Correlated Age-of-Information Bandits.

thompson-sampling ucb multi-armed-bandit aoi age-of-information correlated-multi-armed-bandits correlated-arms aoi-regret

Updated Jun 19, 2021
Python

Murtazali05 / Multi-armed-bandit

Multi Armed Bandits implementation using the Jester Dataset

thompson-sampling ucb multi-armed-bandits e-greedy

Updated Apr 5, 2021
Python

SarCode / ML-Code-Tutorials-Udemy

Complete Tutorial Guide with Code for learning ML

natural-language-processing random-forest svm scikit-learn artificial-neural-networks logistic-regression ucb polynomial-regression kmeans-clustering knearest-neighbor-algorithm apriori-algorithm classification-methods svr kernel-svm kernel-pca heirarchical-clustering decison-trees

Updated Apr 21, 2023
Python

salimandre / Monte-Carlo-Tree-Search

We implemented a Monte Carlo Tree Search (MCTS) from scratch and we successfully applied it to Tic-Tac-Toe game.

reinforcement-learning graphics mcts ucb monte-carlo-tree-search tic-tac-toe-game upper-confidence-bound

Updated Jul 9, 2020
Python

Vinit-4689 / Multi-Armed-Bandit

Efficient exploration and exploitation strategies using Epsilon-Greedy, UCB1, and Thompson Sampling — with code, math, and intuition.

python machine-learning reinforcement-learning thompson-sampling epsilon-greedy ucb multi-armed-bandits portfolio-project bandit-algorithms exploration-vs-exploitation

Updated Apr 13, 2025
Python

sarthakmittal92 / multi-armed-bandits

Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.

python thompson-sampling reinforcement-learning-algorithms ucb multi-armed-bandits bandits kl-ucb

Updated Oct 14, 2022
Python

LittleWat / hyper-parameter-optimization-by-GMRF-GPUCB

R.I.T project

python3 ucb gaussian-processes gmrf markov-random-field gp

Updated Jul 29, 2019
Python

Suchetaaa / CS747-Assignments

Foundations Of Intelligent Learning Agents (FILA) Assignments

reinforcement-learning monte-carlo linear-programming thompson-sampling ucb bootstrapping multi-armed-bandits bellman-equation temporal-differencing-learning howards-pi sarsa-learning kl-ucb windy-gridworld intelligent-learning-agents

Updated Nov 8, 2019
Python

Improve this page

Add a description, image, and links to the ucb topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ucb topic, visit your repo's landing page and select "manage topics."