Google Scholar

Quantum heavy-tailed bandits

Y Wu, C Guan, V Aggarwal, D Wang - arXiv preprint arXiv:2301.09680, 2023 - arxiv.org

… for heavy-tailed bandits, we first propose a new quantum … for heavy-tailed distributions,
which is based on the Quantum … quantum mean estimator, we focus on quantum heavy-tailed …

Save Cite Cited by 8 Related articles All 2 versions View as HTML

[PDF] mlr.press

Adaptive best-of-both-worlds algorithm for heavy-tailed multi-armed bandits

J Huang, Y Dai, L Huang - international conference on …, 2022 - proceedings.mlr.press

… In this paper, we generalize the concept of heavytailed multi-armed bandits … heavy-tailed
bandits in both stochastic and adversarial cases. In contrast to existing (stochastic) heavy-tailed …

Save Cite Cited by 26 Related articles All 3 versions View as HTML

[PDF] arxiv.org

Bandits with heavy tail

S Bubeck, N Cesa-Bianchi… - IEEE Transactions on …, 2013 - ieeexplore.ieee.org

… the UCB algorithm to heavytailed stochastic multi-armed bandit problems in which the …
heavytailed bandits with dependent reward processes. While we focused our attention on bandit …

Save Cite Cited by 361 Related articles All 13 versions

[PDF] mlr.press

No-regret reinforcement learning with heavy-tailed rewards

V Zhuang, Y Sui - International Conference on Artificial …, 2021 - proceedings.mlr.press

… The median-of-means estimator is a commonly-used strategy for performing robust mean
estimation in heavy-tailed bandit algorithms. In an orthogonal line of work, Pazis et al. [2016] …

Save Cite Cited by 19 Related articles All 6 versions View as HTML

[PDF] neurips.cc

Quantum bayesian optimization

Z Dai, GKR Lau, A Verma, Y Shu… - Advances in Neural …, 2023 - proceedings.neurips.cc

… approaches to introduce quantum bandit algorithms for, respectively, stochastic convex
bandits and bandits with heavy-tailed reward distributions. In addition to quantum bandits, some …

Save Cite Cited by 23 Related articles All 10 versions View as HTML

[PDF] arxiv.org

Quantum Lipschitz Bandits

B Yi, Y Kang, Y Li - arXiv preprint arXiv:2504.02251, 2025 - arxiv.org

… quantum computing and the demonstrated success of quantum Monte Carlo in simpler bandit
… [43] extended this line of research to quantum bandits with heavy-tailed rewards, while [25] …

Save Cite Cited by 2 Related articles All 2 versions View as HTML

[PDF] aaai.org

Quantum Best Arm Identification with Quantum Oracles

X Wang, YZJ Chen, MG de Andrade, J Allcock… - Proceedings of the …, 2025 - ojs.aaai.org

… the quantum information feedback from these quantum systems can be leveraged to improve
the learning efficiency. In this paper, we study the BAI problem in quantum … Quantum bandit …

Save Cite Related articles View as HTML

[PDF] arxiv.org

Quantum sub-Gaussian mean estimator

Y Hamoudi - arXiv preprint arXiv:2108.12172, 2021 - arxiv.org

… quantum algorithm for estimating the mean of a real-valued random variable obtained as
the output of a quantum … to estimate the mean of a heavy-tailed distribution with a sub-Gaussian …

Save Cite Cited by 27 Related articles All 12 versions View as HTML

[PDF] springer.com

Corruption-tolerant bandit learning

S Kapoor, KK Patel, P Kar - Machine Learning, 2019 - Springer

… We will consider a much more powerful fully adaptive adversary in the next section on
linear-contextual bandits. We note that although algorithms for heavy-tailed bandits can handle …

Save Cite Cited by 70 Related articles All 5 versions

[PDF] jmlr.org

On the sample complexity and metastability of heavy-tailed policy search in continuous control

AS Bedi, A Parayil, J Zhang, M Wang… - Journal of Machine …, 2024 - jmlr.org

… parameterize policies as heavy-tailed distributions, which induces heavy-tailed gradient
noise. … • We present a few heavy-tailed policy parameterizations that may be used in lieu of a …

Save Cite Cited by 16 Related articles All 2 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

Quantum heavy-tailed bandits

Adaptive best-of-both-worlds algorithm for heavy-tailed multi-armed bandits

Bandits with heavy tail

No-regret reinforcement learning with heavy-tailed rewards

Quantum bayesian optimization

Quantum Lipschitz Bandits

Quantum Best Arm Identification with Quantum Oracles

Quantum sub-Gaussian mean estimator

Corruption-tolerant bandit learning

On the sample complexity and metastability of heavy-tailed policy search in continuous control

Related searches