Do learning rates adapt to the distribution of rewards?
Studies of reinforcement learning have shown that humans learn differently in response to positive and negative reward prediction errors, a phenomenon that can be captured computationally by positing asymmetric learning rates. This asymmetry, motivated by neurobiological and cognitive considerations...
主要作者: | |
---|---|
其他作者: | |
格式: | Article |
語言: | English |
出版: |
Springer US
2016
|
在線閱讀: | http://hdl.handle.net/1721.1/103813 |