Semiparametric contextual bandits

A Krishnamurthy, ZS Wu… - … Conference on Machine …, 2018 - proceedings.mlr.press
… Working towards an affirmative answer to this question, we consider a semiparametric
contextual bandit setup where the reward is modeled as a linear function of the decision …

[PDF][PDF] A practical semi-parametric contextual bandit.

Y Peng, M Xie, J Liu, X Meng, N Li, C Yang, T Yao… - IJCAI, 2019 - ijcai.org
… tures in semi-parametric bandits. It is the first time to propose a novel Semi-Parametric
Contextual Bandits Problem, as far as we known, for cases where contextual features are evolved …

Semi-parametric dynamic contextual pricing

V Shah, R Johari, J Blanchet - Advances in Neural …, 2019 - proceedings.neurips.cc
… the seller can leverage contextual information describing the … Unlike in usual contextual
bandit settings, the optimal price/… We develop a semi-parametric model in which the residual …

Semi-parametric contextual bandits with graph-Laplacian regularization

YG Choi, GS Kim, S Paik, MC Paik - Information Sciences, 2023 - Elsevier
… We study the semi-parametric contextual bandit problem for multiple users equipped with a
user network. Suppose that there are n users, say j ∈ V = { 1 , … , n } . For each time step t = …

Semi-parametric contextual bandits with graph-laplacian regularization

YG Choi, GS Kim, S Paik, MC Paik - arXiv preprint arXiv:2205.08295, 2022 - arxiv.org
… We study the semi-parametric contextual bandit problem for multiple users equipped with
a user network. Suppose that there are n users, say j ∈ V = {1,...,n}. For each time step t = 1,...,T…

Semi-parametric sampling for stochastic bandits with many arms

M Ou, N Li, C Yang, S Zhu, R Jin - Proceedings of the AAAI Conference on …, 2019 - aaai.org
Contextual bandit algorithms were then proposed to improve the efficiency when the arm …
contextual bandit algorithms may suffer linear regret bound in more general semi-parametric

Contextual multi-armed bandit algorithm for semiparametric reward model

GS Kim, MC Paik - International Conference on Machine …, 2019 - proceedings.mlr.press
… the context of the action. This paper proposes a new contextual MAB algorithm for a relaxed,
semiparametric … We verify that the contextual bandit algorithms achieve substantially higher …

Contextual multi-armed bandit algorithm for semiparametric reward model

김지수 - 2019 - s-space.snu.ac.kr
… review existing contextual bandit algorithms and their theoretical properties. In Chapter 3,
we present a new contextual MAB algorithm which works well under a semiparametric reward …

Estimation considerations in contextual bandits

M Dimakopoulou, Z Zhou, S Athey… - arXiv preprint arXiv …, 2017 - arxiv.org
… strong practical advantage of balanced contextual bandits on a large … Additionally, we develop
contextual bandits with simpler … Adjusting for nonignorable drop-out using semiparametric

Empirical likelihood for contextual bandits

N Karampatziakis, J Langford… - Advances in Neural …, 2020 - proceedings.neurips.cc
… Learning algorithms for contextual bandits include theoretical … A recent paper about empirical
contextual bandit learning [4] … Our combination of CIs with learning is a contextual bandit