Do learning rates adapt to the distribution of rewards?
Studies of reinforcement learning have shown that humans learn differently in response to positive and negative reward prediction errors, a phenomenon that can be captured computationally by positing asymmetric learning rates. This asymmetry, motivated by neurobiological and cognitive considerations...
Main Author: | Gershman, Samuel J. |
---|---|
Other Authors: | Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences |
Format: | Article |
Language: | English |
Published: |
Springer US
2016
|
Online Access: | http://hdl.handle.net/1721.1/103813 |
Similar Items
-
Dopamine Ramps Are a Consequence of Reward Prediction Errors
by: Gershman, Samuel J.
Published: (2014) -
Where do hypotheses come from?
by: Dasgupta, Ishita, et al.
Published: (2016) -
A functional MRI study of the distributed neural circuitry of learning and reward
by: Awai, Alexandra F
Published: (2006) -
Scalable reward learning from demonstration
by: Michini, Bernard J., et al.
Published: (2015) -
Rank2Reward: Learning Robot Reward Functions
from Passive Video
by: Yang, Daniel Xin
Published: (2023)