Contextual bandits with cross-learning

Contextual bandits with cross-learning

© 2019 Neural information processing systems foundation. All rights reserved. In the classical contextual bandits problem, in each round t, a learner observes some context c, chooses some action a to perform, and receives some reward ra,t(c). We consider the variant of this problem where in addition...

Full description

Bibliographic Details
Format:	Article
Language:	English
Published:	2021
Online Access:	https://hdl.handle.net/1721.1/137415

Similar Items

Contextual bandits with cross-learning
by: Balseiro, Santiago, et al.
Published: (2021)

Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles
by: Foster, Dylan J, et al.
Published: (2021)

Top-k eXtreme Contextual Bandits with Arm Hierarchy
by: Sen, Rajat, et al.
Published: (2023)

Online and Distribution-Free Robustness: Regression and Contextual Bandits with Huber Contamination
by: Chen, Sitan, et al.
Published: (2022)

Undiscounted bandit games
by: Keller, G, et al.
Published: (2020)

Linearly parameterized bandits
by: Tsitsiklis, John N., et al.
Published: (2012)

Batched Bandit Problems
by: Perchet, Vianney, et al.
Published: (2015)

Multi-arm bandit-led clustering in federated learning
by: Zhao, Joe Chen Xuan
Published: (2024)

Dealers, Insiders and Bandits: Learning and its Effects on Market Outcomes
by: Das, Sanmay
Published: (2006)

Dealers, insiders and bandits : learning and its effects on market outcomes
by: Das, Sanmay, 1979-
Published: (2007)

Nonstochastic Bandits with Infinitely Many Experts
by: Meng, X Flora, et al.
Published: (2022)

Bandit Problems under Censored Feedback
by: Guinet, Gauthier Marc Benoit
Published: (2023)

Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
by: Jin, Chi, et al.
Published: (2022)

Aging Wireless Bandits: Regret Analysis and Order-Optimal Learning Algorithm
by: Atay, Eray Unsal, et al.
Published: (2022)

A hybrid bandit framework for diversified recommendation
by: Ding, Qinxu, et al.
Published: (2021)

Thompson Sampling on Symmetric Alpha-Stable Bandits
by: Dubey, Abhimanyu, et al.
Published: (2021)

Multi-armed linear bandits with latent biases
by: Kang, Qiyu, et al.
Published: (2024)

Optimization as estimation with Gaussian processes in bandit settings
by: Wang, Zi, Ph.D. Massachusetts Institute of Technology
Published: (2016)

Performance of bandit methods in acoustic relay positioning
by: Cheung, Mei Yi, et al.
Published: (2015)

A Structured Multiarmed Bandit Problem and the Greedy Policy
by: Rusmevichientong, Paat, et al.
Published: (2010)

Hyperparameter Tuning in Bandit-Based Adaptive Operator Selection
by: Pacula, Maciej, et al.
Published: (2012)

Reproduksi kultural Duta (Bandit-Sosial) Kayuagung :: Studi kasus Bandit-Sosial Transnasional di Kabupaten Ogan Komering Ilir Sumatera Selatan
by: , MULYADI, et al.
Published: (2006)

Prior convictions: Black-box adversarial attacks with bandits and priors
by: Ilyas, Andrew., et al.
Published: (2021)

Output-weighted sampling for multi-armed bandits with extreme payoffs
by: Yang, Yibo, et al.
Published: (2024)

A Bayesian bandit approach to personalized online coupon recommendations
by: Song, Xiang, Ph. D. Massachusetts Institute of Technology
Published: (2016)

Branching bandits and Klimov's problem : achievable region and side constraints
Published: (2003)

Bandit strategies in social search: the case of the DARPA red balloon challenge
by: Chen, Haohui, et al.
Published: (2021)

Restless bandit, linear programming relaxations and a primal-dual heuristic
Published: (2003)

Bayesian tuning and bandits : an extensible, open source library for AutoML
by: Gustafson, Laura (Laura N.)
Published: (2018)

Restless Bandits, Linear Programming Relaxations and a Primal-Dual Heuristic
by: Bertsimas, Dimitris J., et al.
Published: (2004)

Efficient crowdsourcing of unknown experts using bounded multi-armed bandits
by: Tran-Thanh, L, et al.
Published: (2014)

Histogram contextualization
by: Feng, Jiashi, et al.
Published: (2013)

The essential contextual
by: Stalnaker, Robert
Published: (2020)

Sarcasm detection using deep learning with contextual features
by: Razali, Md Saifullah, et al.
Published: (2021)

Distributed bandit online convex optimization with time-varying coupled inequality constraints
by: Yi, Xinlei, et al.
Published: (2022)

Evolusi jawara di Banten :: Studi evolusi dari bandit menjadi pejabat
by: , BANDIYAH, et al.
Published: (2008)

Regulating exploration in multi-armed bandit problems with time patterns and dying arms
by: Tracà, Stefano
Published: (2018)

Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency Parsing
by: Schuster, Tal, et al.
Published: (2020)

Contextualizing Human Psychology
by: Pentland, Alex
Published: (2021)

Contextual leadership practices
by: Noman, Mohammad, et al.
Published: (2016)