A hypothesis for basal ganglia-dependent reinforcement learning in the songbird

Most of our motor skills are not innately programmed, but are learned by a combination of motor exploration and performance evaluation, suggesting that they proceed through a reinforcement learning (RL) mechanism. Songbirds have emerged as a model system to study how a complex behavioral sequence ca...

Full description

Bibliographic Details
Main Authors: Fee, Michale S., Goldberg, Jesse H.
Other Authors: McGovern Institute for Brain Research at MIT
Format: Article
Language:English
Published: Elsevier BV 2020
Online Access:https://hdl.handle.net/1721.1/126651
_version_ 1826195942267682816
author Fee, Michale S.
Goldberg, Jesse H.
author2 McGovern Institute for Brain Research at MIT
author_facet McGovern Institute for Brain Research at MIT
Fee, Michale S.
Goldberg, Jesse H.
author_sort Fee, Michale S.
collection MIT
description Most of our motor skills are not innately programmed, but are learned by a combination of motor exploration and performance evaluation, suggesting that they proceed through a reinforcement learning (RL) mechanism. Songbirds have emerged as a model system to study how a complex behavioral sequence can be learned through an RL-like strategy. Interestingly, like motor sequence learning in mammals, song learning in birds requires a basal ganglia (BG)-thalamocortical loop, suggesting common neural mechanisms. Here, we outline a specific working hypothesis for how BG-forebrain circuits could utilize an internally computed reinforcement signal to direct song learning. Our model includes a number of general concepts borrowed from the mammalian BG literature, including a dopaminergic reward prediction error and dopamine-mediated plasticity at corticostriatal synapses. We also invoke a number of conceptual advances arising from recent observations in the songbird. Specifically, there is evidence for a specialized cortical circuit that adds trial-to-trial variability to stereotyped cortical motor programs, and a role for the BG in ''biasing'' this variability to improve behavioral performance. This BG-dependent ''premotor bias'' may in turn guide plasticity in downstream cortical synapses to consolidate recently learned song changes. Given the similarity between mammalian and songbird BG-thalamocortical circuits, our model for the role of the BG in this process may have broader relevance to mammalian BG function.This article is part of a Special Issue entitled: Function and Dysfunction of the Basal Ganglia.
first_indexed 2024-09-23T10:18:16Z
format Article
id mit-1721.1/126651
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T10:18:16Z
publishDate 2020
publisher Elsevier BV
record_format dspace
spelling mit-1721.1/1266512022-09-26T17:07:00Z A hypothesis for basal ganglia-dependent reinforcement learning in the songbird Fee, Michale S. Goldberg, Jesse H. McGovern Institute for Brain Research at MIT Most of our motor skills are not innately programmed, but are learned by a combination of motor exploration and performance evaluation, suggesting that they proceed through a reinforcement learning (RL) mechanism. Songbirds have emerged as a model system to study how a complex behavioral sequence can be learned through an RL-like strategy. Interestingly, like motor sequence learning in mammals, song learning in birds requires a basal ganglia (BG)-thalamocortical loop, suggesting common neural mechanisms. Here, we outline a specific working hypothesis for how BG-forebrain circuits could utilize an internally computed reinforcement signal to direct song learning. Our model includes a number of general concepts borrowed from the mammalian BG literature, including a dopaminergic reward prediction error and dopamine-mediated plasticity at corticostriatal synapses. We also invoke a number of conceptual advances arising from recent observations in the songbird. Specifically, there is evidence for a specialized cortical circuit that adds trial-to-trial variability to stereotyped cortical motor programs, and a role for the BG in ''biasing'' this variability to improve behavioral performance. This BG-dependent ''premotor bias'' may in turn guide plasticity in downstream cortical synapses to consolidate recently learned song changes. Given the similarity between mammalian and songbird BG-thalamocortical circuits, our model for the role of the BG in this process may have broader relevance to mammalian BG function.This article is part of a Special Issue entitled: Function and Dysfunction of the Basal Ganglia. NIH (Grant R01DC009183, R01MH067105 and K99NS067062) 2020-08-18T17:52:51Z 2020-08-18T17:52:51Z 2011-12 2019-09-30T18:09:22Z Article http://purl.org/eprint/type/JournalArticle 0306-4522 https://hdl.handle.net/1721.1/126651 Fee, M.S. and J. H. Goldberg. "A hypothesis for basal ganglia-dependent reinforcement learning in the songbird." Neuroscience 198 (December 2011): 152-170 © 2011 IBRO en http://dx.doi.org/10.1016/j.neuroscience.2011.09.069 Neuroscience Creative Commons Attribution-NonCommercial-NoDerivs License http://creativecommons.org/licenses/by-nc-nd/4.0/ application/pdf Elsevier BV PMC
spellingShingle Fee, Michale S.
Goldberg, Jesse H.
A hypothesis for basal ganglia-dependent reinforcement learning in the songbird
title A hypothesis for basal ganglia-dependent reinforcement learning in the songbird
title_full A hypothesis for basal ganglia-dependent reinforcement learning in the songbird
title_fullStr A hypothesis for basal ganglia-dependent reinforcement learning in the songbird
title_full_unstemmed A hypothesis for basal ganglia-dependent reinforcement learning in the songbird
title_short A hypothesis for basal ganglia-dependent reinforcement learning in the songbird
title_sort hypothesis for basal ganglia dependent reinforcement learning in the songbird
url https://hdl.handle.net/1721.1/126651
work_keys_str_mv AT feemichales ahypothesisforbasalgangliadependentreinforcementlearninginthesongbird
AT goldbergjesseh ahypothesisforbasalgangliadependentreinforcementlearninginthesongbird
AT feemichales hypothesisforbasalgangliadependentreinforcementlearninginthesongbird
AT goldbergjesseh hypothesisforbasalgangliadependentreinforcementlearninginthesongbird