Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time
Abstract The dominant theoretical framework to account for reinforcement learning in the brain is temporal difference learning (TD) learning, whereby certain units signal reward prediction errors (RPE). The TD algorithm has been traditionally mapped onto the dopaminergic system, as firing properties...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2024-07-01
|
Series: | Nature Communications |
Online Access: | https://doi.org/10.1038/s41467-024-50205-3 |