Distinct Functions of the Primate Putamen Direct and Indirect Pathways in Adaptive Outcome-Based Action Selection
Cortico-basal ganglia circuits are critical regulators of reward-based decision making. Reinforcement learning models posit that action reward value is encoded by the firing activity of striatal medium spiny neurons (MSNs) and updated upon changing reinforcement contingencies by dopamine (DA) signal...
Main Authors: | , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2017-08-01
|
Series: | Frontiers in Neuroanatomy |
Subjects: | |
Online Access: | http://journal.frontiersin.org/article/10.3389/fnana.2017.00066/full |
_version_ | 1818515579608236032 |
---|---|
author | Yasumasa Ueda Yasumasa Ueda Ko Yamanaka Ko Yamanaka Atsushi Noritake Kazuki Enomoto Kazuki Enomoto Naoyuki Matsumoto Naoyuki Matsumoto Hiroshi Yamada Hiroshi Yamada Hiroshi Yamada Kazuyuki Samejima Hitoshi Inokawa Yukiko Hori Yukiko Hori Kae Nakamura Minoru Kimura Minoru Kimura |
author_facet | Yasumasa Ueda Yasumasa Ueda Ko Yamanaka Ko Yamanaka Atsushi Noritake Kazuki Enomoto Kazuki Enomoto Naoyuki Matsumoto Naoyuki Matsumoto Hiroshi Yamada Hiroshi Yamada Hiroshi Yamada Kazuyuki Samejima Hitoshi Inokawa Yukiko Hori Yukiko Hori Kae Nakamura Minoru Kimura Minoru Kimura |
author_sort | Yasumasa Ueda |
collection | DOAJ |
description | Cortico-basal ganglia circuits are critical regulators of reward-based decision making. Reinforcement learning models posit that action reward value is encoded by the firing activity of striatal medium spiny neurons (MSNs) and updated upon changing reinforcement contingencies by dopamine (DA) signaling to these neurons. However, it remains unclear how the anatomically distinct direct and indirect pathways through the basal ganglia are involved in updating action reward value under changing contingencies. MSNs of the direct pathway predominantly express DA D1 receptors and those of the indirect pathway predominantly D2 receptors, so we tested for distinct functions in behavioral adaptation by injecting D1 and D2 receptor antagonists into the putamen of two macaque monkeys performing a free choice task for probabilistic reward. In this task, monkeys turned a handle toward either a left or right target depending on an asymmetrically assigned probability of large reward. Reward probabilities of left and right targets changed after 30–150 trials, so the monkeys were required to learn the higher-value target choice based on action–outcome history. In the control condition, the monkeys showed stable selection of the higher-value target (that more likely to yield large reward) and kept choosing the higher-value target regardless of less frequent small reward outcomes. The monkeys also made flexible changes of selection away from the high-value target when two or three small reward outcomes occurred randomly in succession. DA D1 antagonist injection significantly increased the probability of the monkey switching to the alternate target in response to successive small reward outcomes. Conversely, D2 antagonist injection significantly decreased the switching probability. These results suggest distinct functions of D1 and D2 receptor-mediated signaling processes in action selection based on action–outcome history, with D1 receptor-mediated signaling promoting the stable choice of higher-value targets and D2 receptor-mediated signaling promoting a switch in action away from small reward outcomes. Therefore, direct and indirect pathways appear to have complementary functions in maintaining optimal goal-directed action selection and updating action value, which are dependent on D1 and D2 DA receptor signaling. |
first_indexed | 2024-12-11T00:30:42Z |
format | Article |
id | doaj.art-b69900a034e94c7ca14f19ccc60389ca |
institution | Directory Open Access Journal |
issn | 1662-5129 |
language | English |
last_indexed | 2024-12-11T00:30:42Z |
publishDate | 2017-08-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Neuroanatomy |
spelling | doaj.art-b69900a034e94c7ca14f19ccc60389ca2022-12-22T01:27:21ZengFrontiers Media S.A.Frontiers in Neuroanatomy1662-51292017-08-011110.3389/fnana.2017.00066270734Distinct Functions of the Primate Putamen Direct and Indirect Pathways in Adaptive Outcome-Based Action SelectionYasumasa Ueda0Yasumasa Ueda1Ko Yamanaka2Ko Yamanaka3Atsushi Noritake4Kazuki Enomoto5Kazuki Enomoto6Naoyuki Matsumoto7Naoyuki Matsumoto8Hiroshi Yamada9Hiroshi Yamada10Hiroshi Yamada11Kazuyuki Samejima12Hitoshi Inokawa13Yukiko Hori14Yukiko Hori15Kae Nakamura16Minoru Kimura17Minoru Kimura18Department of Physiology, Kyoto Prefectural University of MedicineKyoto, JapanDepartment of Physiology, Kansai Medical UniversityHirakata, JapanDepartment of Physiology, Kyoto Prefectural University of MedicineKyoto, JapanDepartment of Physiology, Faculty of Health and Sports Science, Juntendo UniversityChiba, JapanDepartment of Physiology, Kansai Medical UniversityHirakata, JapanDepartment of Physiology, Kyoto Prefectural University of MedicineKyoto, JapanTamagawa University Brain Science InstituteMachida, JapanDepartment of Physiology, Kyoto Prefectural University of MedicineKyoto, JapanDepartment of Food and Health Sciences, Faculty of Environmental and Symbiotic Sciences, Prefectural University of KumamotoKumamoto, JapanDepartment of Physiology, Kyoto Prefectural University of MedicineKyoto, JapanDivision of Biomedical Science, Faculty of Medicine, University of TsukubaTsukuba, JapanGraduate School of Comprehensive Human Sciences, University of TsukubaTsukuba, JapanTamagawa University Brain Science InstituteMachida, JapanDepartment of Physiology, Kyoto Prefectural University of MedicineKyoto, JapanDepartment of Physiology, Kyoto Prefectural University of MedicineKyoto, JapanDepartment of Functional Brain Imaging, National Institute of Radiological Sciences, National Institutes for Quantum and Radiological Science and TechnologyChiba, JapanDepartment of Physiology, Kansai Medical UniversityHirakata, JapanDepartment of Physiology, Kyoto Prefectural University of MedicineKyoto, JapanTamagawa University Brain Science InstituteMachida, JapanCortico-basal ganglia circuits are critical regulators of reward-based decision making. Reinforcement learning models posit that action reward value is encoded by the firing activity of striatal medium spiny neurons (MSNs) and updated upon changing reinforcement contingencies by dopamine (DA) signaling to these neurons. However, it remains unclear how the anatomically distinct direct and indirect pathways through the basal ganglia are involved in updating action reward value under changing contingencies. MSNs of the direct pathway predominantly express DA D1 receptors and those of the indirect pathway predominantly D2 receptors, so we tested for distinct functions in behavioral adaptation by injecting D1 and D2 receptor antagonists into the putamen of two macaque monkeys performing a free choice task for probabilistic reward. In this task, monkeys turned a handle toward either a left or right target depending on an asymmetrically assigned probability of large reward. Reward probabilities of left and right targets changed after 30–150 trials, so the monkeys were required to learn the higher-value target choice based on action–outcome history. In the control condition, the monkeys showed stable selection of the higher-value target (that more likely to yield large reward) and kept choosing the higher-value target regardless of less frequent small reward outcomes. The monkeys also made flexible changes of selection away from the high-value target when two or three small reward outcomes occurred randomly in succession. DA D1 antagonist injection significantly increased the probability of the monkey switching to the alternate target in response to successive small reward outcomes. Conversely, D2 antagonist injection significantly decreased the switching probability. These results suggest distinct functions of D1 and D2 receptor-mediated signaling processes in action selection based on action–outcome history, with D1 receptor-mediated signaling promoting the stable choice of higher-value targets and D2 receptor-mediated signaling promoting a switch in action away from small reward outcomes. Therefore, direct and indirect pathways appear to have complementary functions in maintaining optimal goal-directed action selection and updating action value, which are dependent on D1 and D2 DA receptor signaling.http://journal.frontiersin.org/article/10.3389/fnana.2017.00066/fullmonkeybasal gangliadopaminestriatumrewarddirect pathway |
spellingShingle | Yasumasa Ueda Yasumasa Ueda Ko Yamanaka Ko Yamanaka Atsushi Noritake Kazuki Enomoto Kazuki Enomoto Naoyuki Matsumoto Naoyuki Matsumoto Hiroshi Yamada Hiroshi Yamada Hiroshi Yamada Kazuyuki Samejima Hitoshi Inokawa Yukiko Hori Yukiko Hori Kae Nakamura Minoru Kimura Minoru Kimura Distinct Functions of the Primate Putamen Direct and Indirect Pathways in Adaptive Outcome-Based Action Selection Frontiers in Neuroanatomy monkey basal ganglia dopamine striatum reward direct pathway |
title | Distinct Functions of the Primate Putamen Direct and Indirect Pathways in Adaptive Outcome-Based Action Selection |
title_full | Distinct Functions of the Primate Putamen Direct and Indirect Pathways in Adaptive Outcome-Based Action Selection |
title_fullStr | Distinct Functions of the Primate Putamen Direct and Indirect Pathways in Adaptive Outcome-Based Action Selection |
title_full_unstemmed | Distinct Functions of the Primate Putamen Direct and Indirect Pathways in Adaptive Outcome-Based Action Selection |
title_short | Distinct Functions of the Primate Putamen Direct and Indirect Pathways in Adaptive Outcome-Based Action Selection |
title_sort | distinct functions of the primate putamen direct and indirect pathways in adaptive outcome based action selection |
topic | monkey basal ganglia dopamine striatum reward direct pathway |
url | http://journal.frontiersin.org/article/10.3389/fnana.2017.00066/full |
work_keys_str_mv | AT yasumasaueda distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT yasumasaueda distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT koyamanaka distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT koyamanaka distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT atsushinoritake distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT kazukienomoto distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT kazukienomoto distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT naoyukimatsumoto distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT naoyukimatsumoto distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT hiroshiyamada distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT hiroshiyamada distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT hiroshiyamada distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT kazuyukisamejima distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT hitoshiinokawa distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT yukikohori distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT yukikohori distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT kaenakamura distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT minorukimura distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection AT minorukimura distinctfunctionsoftheprimateputamendirectandindirectpathwaysinadaptiveoutcomebasedactionselection |