On the convergence of reinforcement learning
This paper examines the convergence of payoffs and strategies in Erev and Roth's model of reinforcement learning. When all players use this rule it eliminates iteratively dominated strategies and in two-person constant-sum games average payoffs converge to the value of the game. Strategies conv...
Hlavní autor: | |
---|---|
Médium: | Working paper |
Vydáno: |
University of Oxford
2002
|