Multi-armed linear bandits with latent biases
In a linear stochastic bandit model, each arm corresponds to a vector in Euclidean space, and the expected return observed at each time step is determined by an unknown linear function of the selected arm. This paper addresses the challenge of identifying the optimal arm in a linear stochastic bandi...
Main Authors: | Kang, Qiyu, Tay, Wee Peng, She, Rui, Wang, Sijie, Liu, Xiaoqian, Yang, Yuan-Rui |
---|---|
Other Authors: | School of Electrical and Electronic Engineering |
Format: | Journal Article |
Language: | English |
Published: |
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/175416 |
Similar Items
-
Multi-arm bandit-led clustering in federated learning
by: Zhao, Joe Chen Xuan
Published: (2024) -
Design of Multi-Armed Bandit-Based Routing for in-Network Caching
by: Gen Tabei, et al.
Published: (2023-01-01) -
LEO-Assisted Aerial Deployment in Post-Disaster Scenarios Using a Combinatorial Bandit and Genetic Algorithmic Approach
by: Ehab Mahmoud Mohamed, et al.
Published: (2023-12-01) -
Signal detection models as contextual bandits
by: Thomas N. Sherratt, et al.
Published: (2023-06-01) -
Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm
by: Emanuele Cavenaghi, et al.
Published: (2021-03-01)