Achieving pareto optimality through distributed learning
We propose a simple payoff-based learning rule that is completely decentralized, and that leads to an efficient configuaration of actions in any n-person finite strategic-form game with generic payoffs. The algorithm follows the theme of exploration versus exploitation and is hence stochastic in na...
Những tác giả chính: | , , |
---|---|
Định dạng: | Working paper |
Được phát hành: |
University of Oxford
2011
|