Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time

Abstract The dominant theoretical framework to account for reinforcement learning in the brain is temporal difference learning (TD) learning, whereby certain units signal reward prediction errors (RPE). The TD algorithm has been traditionally mapped onto the dopaminergic system, as firing properties...

Full description

Bibliographic Details
Main Authors: Ian Cone, Claudia Clopath, Harel Z. Shouval
Format: Article
Language:English
Published: Nature Portfolio 2024-07-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-024-50205-3