Collaboratively Learning the Best Option on Graphs, Using Bounded Local Memory

<jats:p>We consider multi-armed bandit problems in social groups wherein each individual has bounded memory and shares the common goal of learning the best arm/option. We say an individual learns the best option if eventually (as $t\diverge$) it pulls only the arm with the highest expected rew...

Полное описание

Библиографические подробности
Главные авторы:	Su, Lili, Zubeldia, Martin, Lynch, Nancy
Другие авторы:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Формат:	Статья
Язык:	English
Опубликовано:	Association for Computing Machinery (ACM) 2021
Online-ссылка:	https://hdl.handle.net/1721.1/135402

Internet

https://hdl.handle.net/1721.1/135402

Collaboratively Learning the Best Option on Graphs, Using Bounded Local Memory

Internet

Схожие документы