Collaboratively Learning the Best Option on Graphs, Using Bounded Local Memory
<jats:p>We consider multi-armed bandit problems in social groups wherein each individual has bounded memory and shares the common goal of learning the best arm/option. We say an individual learns the best option if eventually (as $t\diverge$) it pulls only the arm with the highest expected rew...
Main Authors: | Su, Lili, Zubeldia, Martin, Lynch, Nancy |
---|---|
Other Authors: | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science |
Format: | Article |
Language: | English |
Published: |
Association for Computing Machinery (ACM)
2021
|
Online Access: | https://hdl.handle.net/1721.1/135402 |
Similar Items
-
Reliably Detecting Connectivity using Local Graph Traits
by: Cornejo, Alejandro, et al.
Published: (2010) -
Reliably Detecting Connectivity Using Local Graph Traits
by: Cornejo Collado, Alex, et al.
Published: (2011) -
Information-Theoretic Lower Bounds on the Storage Cost of Shared Memory Emulation
by: Cadambe, Viveck R., et al.
Published: (2021) -
A lower bound on the queueing delay in resource constrained load balancing
by: Gamarnik, David, et al.
Published: (2021) -
Delay, Memory, and Messaging Tradeoffs in Distributed Service Systems
by: Gamarnik, David, et al.
Published: (2019)