Learning the payoffs and costs of actions

A set of sub-cortical nuclei called basal ganglia is critical for learning the values of actions. The basal ganglia include two pathways, which have been associated with approach and avoid behavior respectively and are differentially modulated by dopamine projections from the midbrain. Inspired by t...

Full description

Bibliographic Details
Main Authors: Möller, M, Bogacz, R
Format: Journal article
Language:English
Published: Public Library of Science 2019
_version_ 1797065561443663872
author Möller, M
Bogacz, R
author_facet Möller, M
Bogacz, R
author_sort Möller, M
collection OXFORD
description A set of sub-cortical nuclei called basal ganglia is critical for learning the values of actions. The basal ganglia include two pathways, which have been associated with approach and avoid behavior respectively and are differentially modulated by dopamine projections from the midbrain. Inspired by the influential opponent actor learning model, we demonstrate that, under certain circumstances, these pathways may represent learned estimates of the positive and negative consequences (payoffs and costs) of individual actions. In the model, the level of dopamine activity encodes the motivational state and controls to what extent payoffs and costs enter the overall evaluation of actions. We show that a set of previously proposed plasticity rules is suitable to extract payoffs and costs from a prediction error signal if they occur at different moments in time. For those plasticity rules, successful learning requires differential effects of positive and negative outcome prediction errors on the two pathways and a weak decay of synaptic weights over trials. We also confirm through simulations that the model reproduces drug-induced changes of willingness to work, as observed in classical experiments with the D2-antagonist haloperidol.
first_indexed 2024-03-06T21:30:23Z
format Journal article
id oxford-uuid:4480d75f-d328-4621-9174-e58daba3ef08
institution University of Oxford
language English
last_indexed 2024-03-06T21:30:23Z
publishDate 2019
publisher Public Library of Science
record_format dspace
spelling oxford-uuid:4480d75f-d328-4621-9174-e58daba3ef082022-03-26T15:01:52ZLearning the payoffs and costs of actionsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:4480d75f-d328-4621-9174-e58daba3ef08EnglishSymplectic Elements at OxfordPublic Library of Science2019Möller, MBogacz, RA set of sub-cortical nuclei called basal ganglia is critical for learning the values of actions. The basal ganglia include two pathways, which have been associated with approach and avoid behavior respectively and are differentially modulated by dopamine projections from the midbrain. Inspired by the influential opponent actor learning model, we demonstrate that, under certain circumstances, these pathways may represent learned estimates of the positive and negative consequences (payoffs and costs) of individual actions. In the model, the level of dopamine activity encodes the motivational state and controls to what extent payoffs and costs enter the overall evaluation of actions. We show that a set of previously proposed plasticity rules is suitable to extract payoffs and costs from a prediction error signal if they occur at different moments in time. For those plasticity rules, successful learning requires differential effects of positive and negative outcome prediction errors on the two pathways and a weak decay of synaptic weights over trials. We also confirm through simulations that the model reproduces drug-induced changes of willingness to work, as observed in classical experiments with the D2-antagonist haloperidol.
spellingShingle Möller, M
Bogacz, R
Learning the payoffs and costs of actions
title Learning the payoffs and costs of actions
title_full Learning the payoffs and costs of actions
title_fullStr Learning the payoffs and costs of actions
title_full_unstemmed Learning the payoffs and costs of actions
title_short Learning the payoffs and costs of actions
title_sort learning the payoffs and costs of actions
work_keys_str_mv AT mollerm learningthepayoffsandcostsofactions
AT bogaczr learningthepayoffsandcostsofactions