Академия Google

Semiparametric contextual bandits

A Krishnamurthy, ZS Wu… - … Conference on Machine …, 2018 - proceedings.mlr.press

… Working towards an affirmative answer to this question, we consider a semiparametric
contextual bandit setup where the reward is modeled as a linear function of the decision …

Сохранить Цитировать Цитируется: 58 Похожие статьи Все версии статьи (5) В виде HTML

[PDF] ijcai.org

[PDF][PDF] A practical semi-parametric contextual bandit.

Y Peng, M Xie, J Liu, X Meng, N Li, C Yang, T Yao… - IJCAI, 2019 - ijcai.org

… tures in semi-parametric bandits. It is the first time to propose a novel Semi-Parametric
Contextual Bandits Problem, as far as we known, for cases where contextual features are evolved …

Сохранить Цитировать Цитируется: 12 Похожие статьи Все версии статьи (2) В виде HTML

[PDF] neurips.cc

Semi-parametric dynamic contextual pricing

V Shah, R Johari, J Blanchet - Advances in Neural …, 2019 - proceedings.neurips.cc

… the seller can leverage contextual information describing the … Unlike in usual contextual
bandit settings, the optimal price/… We develop a semi-parametric model in which the residual …

Сохранить Цитировать Цитируется: 44 Похожие статьи Все версии статьи (9) В виде HTML

Semi-parametric contextual bandits with graph-Laplacian regularization

YG Choi, GS Kim, S Paik, MC Paik - Information Sciences, 2023 - Elsevier

… We study the semi-parametric contextual bandit problem for multiple users equipped with a
user network. Suppose that there are n users, say j ∈ V = { 1 , … , n } . For each time step t = …

Сохранить Цитировать Цитируется: 3 Похожие статьи Все версии статьи (5)

[PDF] arxiv.org

Semi-parametric contextual bandits with graph-laplacian regularization

YG Choi, GS Kim, S Paik, MC Paik - arXiv preprint arXiv:2205.08295, 2022 - arxiv.org

… We study the semi-parametric contextual bandit problem for multiple users equipped with
a user network. Suppose that there are n users, say j ∈ V = {1,...,n}. For each time step t = 1,...,T…

Сохранить Цитировать Цитируется: 4 Похожие статьи Все версии статьи (2) В виде HTML

[PDF] aaai.org

Semi-parametric sampling for stochastic bandits with many arms

M Ou, N Li, C Yang, S Zhu, R Jin - Proceedings of the AAAI Conference on …, 2019 - aaai.org

… Contextual bandit algorithms were then proposed to improve the efficiency when the arm …
contextual bandit algorithms may suffer linear regret bound in more general semi-parametric …

Сохранить Цитировать Цитируется: 6 Похожие статьи Все версии статьи (6) В виде HTML

[PDF] mlr.press

Contextual multi-armed bandit algorithm for semiparametric reward model

GS Kim, MC Paik - International Conference on Machine …, 2019 - proceedings.mlr.press

… the context of the action. This paper proposes a new contextual MAB algorithm for a relaxed,
semiparametric … We verify that the contextual bandit algorithms achieve substantially higher …

Сохранить Цитировать Цитируется: 18 Похожие статьи Все версии статьи (9) В виде HTML

[PDF] snu.ac.kr

Contextual multi-armed bandit algorithm for semiparametric reward model

김지수 - 2019 - s-space.snu.ac.kr

… review existing contextual bandit algorithms and their theoretical properties. In Chapter 3,
we present a new contextual MAB algorithm which works well under a semiparametric reward …

Сохранить Цитировать Похожие статьи В виде HTML

[PDF] arxiv.org

Estimation considerations in contextual bandits

M Dimakopoulou, Z Zhou, S Athey… - arXiv preprint arXiv …, 2017 - arxiv.org

… strong practical advantage of balanced contextual bandits on a large … Additionally, we develop
contextual bandits with simpler … Adjusting for nonignorable drop-out using semiparametric …

Сохранить Цитировать Цитируется: 241 Похожие статьи Все версии статьи (6) В виде HTML

[PDF] neurips.cc

Empirical likelihood for contextual bandits

N Karampatziakis, J Langford… - Advances in Neural …, 2020 - proceedings.neurips.cc

… Learning algorithms for contextual bandits include theoretical … A recent paper about empirical
contextual bandit learning [4] … Our combination of CIs with learning is a contextual bandit …

Сохранить Цитировать Цитируется: 12 Похожие статьи Все версии статьи (7) В виде HTML

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Semiparametric contextual bandits

[PDF][PDF] A practical semi-parametric contextual bandit.

Semi-parametric dynamic contextual pricing

Semi-parametric contextual bandits with graph-Laplacian regularization

Semi-parametric contextual bandits with graph-laplacian regularization

Semi-parametric sampling for stochastic bandits with many arms

Contextual multi-armed bandit algorithm for semiparametric reward model

Contextual multi-armed bandit algorithm for semiparametric reward model

Estimation considerations in contextual bandits

Empirical likelihood for contextual bandits

Похожие запросы