Quantum heavy-tailed bandits

Y Wu, C Guan, V Aggarwal, D Wang - arXiv preprint arXiv:2301.09680, 2023 - arxiv.org
… for heavy-tailed bandits, we first propose a new quantum … for heavy-tailed distributions,
which is based on the Quantumquantum mean estimator, we focus on quantum heavy-tailed

Adaptive best-of-both-worlds algorithm for heavy-tailed multi-armed bandits

J Huang, Y Dai, L Huang - international conference on …, 2022 - proceedings.mlr.press
… In this paper, we generalize the concept of heavytailed multi-armed banditsheavy-tailed
bandits in both stochastic and adversarial cases. In contrast to existing (stochastic) heavy-tailed

Bandits with heavy tail

S Bubeck, N Cesa-Bianchi… - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
… the UCB algorithm to heavytailed stochastic multi-armed bandit problems in which the …
heavytailed bandits with dependent reward processes. While we focused our attention on bandit

No-regret reinforcement learning with heavy-tailed rewards

V Zhuang, Y Sui - International Conference on Artificial …, 2021 - proceedings.mlr.press
… The median-of-means estimator is a commonly-used strategy for performing robust mean
estimation in heavy-tailed bandit algorithms. In an orthogonal line of work, Pazis et al. [2016] …

Quantum bayesian optimization

Z Dai, GKR Lau, A Verma, Y Shu… - Advances in Neural …, 2023 - proceedings.neurips.cc
… approaches to introduce quantum bandit algorithms for, respectively, stochastic convex
bandits and bandits with heavy-tailed reward distributions. In addition to quantum bandits, some …

Quantum Lipschitz Bandits

B Yi, Y Kang, Y Li - arXiv preprint arXiv:2504.02251, 2025 - arxiv.org
quantum computing and the demonstrated success of quantum Monte Carlo in simpler bandit
… [43] extended this line of research to quantum bandits with heavy-tailed rewards, while [25] …

Quantum Best Arm Identification with Quantum Oracles

X Wang, YZJ Chen, MG de Andrade, J Allcock… - Proceedings of the …, 2025 - ojs.aaai.org
… the quantum information feedback from these quantum systems can be leveraged to improve
the learning efficiency. In this paper, we study the BAI problem in quantumQuantum bandit

Quantum sub-Gaussian mean estimator

Y Hamoudi - arXiv preprint arXiv:2108.12172, 2021 - arxiv.org
quantum algorithm for estimating the mean of a real-valued random variable obtained as
the output of a quantum … to estimate the mean of a heavy-tailed distribution with a sub-Gaussian …

Corruption-tolerant bandit learning

S Kapoor, KK Patel, P Kar - Machine Learning, 2019 - Springer
… We will consider a much more powerful fully adaptive adversary in the next section on
linear-contextual bandits. We note that although algorithms for heavy-tailed bandits can handle …

On the sample complexity and metastability of heavy-tailed policy search in continuous control

AS Bedi, A Parayil, J Zhang, M Wang… - Journal of Machine …, 2024 - jmlr.org
… parameterize policies as heavy-tailed distributions, which induces heavy-tailed gradient
noise. … • We present a few heavy-tailed policy parameterizations that may be used in lieu of a …