Laplacian-regularized graph bandits: algorithms and theoretical analysis

We consider a stochastic linear bandit problem with multiple users, where the relationship between users is captured by an underlying graph and user preferences are represented as smooth signals on the graph. We introduce a novel bandit algorithm where the smoothness prior is imposed via the random-...

Full description

Bibliographic Details
Main Authors: Yang, K, Dong, X, Toni, L
Format: Conference item
Language:English
Published: Proceedings of Machine Learning Research 2020