Undiscounted bandit games
We analyze undiscounted continuous-time games of strategic experimentation with two-armed bandits. The risky arm generates payoffs according to a Lévy process with an unknown average payoff per unit of time which nature draws from an arbitrary finite set. Observing all actions and realized payoffs,...
Κύριοι συγγραφείς: | , |
---|---|
Μορφή: | Journal article |
Γλώσσα: | English |
Έκδοση: |
Elsevier
2020
|