Integrating reward information for prospective behaviour

Value-based decision-making is often studied in a static context, where participants decide which option to select from those currently available. However, everyday life often involves an additional dimension: deciding when to select to maximise reward. Recent evidence suggests that agents track the...

Full description

Bibliographic Details
Main Authors: Hall-McMaster, S, Stokes, MG, Myers, NE
Format: Journal article
Language:English
Published: Society for Neuroscience 2022
_version_ 1797101938429394944
author Hall-McMaster, S
Stokes, MG
Myers, NE
author_facet Hall-McMaster, S
Stokes, MG
Myers, NE
author_sort Hall-McMaster, S
collection OXFORD
description Value-based decision-making is often studied in a static context, where participants decide which option to select from those currently available. However, everyday life often involves an additional dimension: deciding when to select to maximise reward. Recent evidence suggests that agents track the latent reward of an option, updating changes in their latent reward estimate, to achieve appropriate selection timing (latent reward tracking). However, this strategy can be difficult to distinguish from one in which the optimal selection time is estimated in advance, allowing an agent to wait a pre-determined amount of time before selecting, without needing to monitor an option’s latent reward (distance-to-goal tracking). Here we show that these strategies can in principle be dissociated. Human brain activity was recorded using electroencephalography (EEG) while female and male participants performed a novel decision task. Participants were shown an option and decided when to select it, as its latent reward changed from trial-to-trial. While the latent reward was uncued, it could be estimated using cued information about the option’s starting value and value growth rate. We then used representational similarity analysis to assess whether EEG signals more closely resembled latent reward tracking or distance-to-goal tracking. This approach successfully dissociated the strategies in this task. Starting value and growth rate were translated into a distance-to-goal signal, far in advance of selecting the option. Latent reward could not be independently decoded. These results demonstrate the feasibility of using high temporal resolution neural recordings to identify internally computed decision variables in the human brain.
first_indexed 2024-03-07T05:58:56Z
format Journal article
id oxford-uuid:eb7f646a-530e-42cb-83ec-ced5aeba023d
institution University of Oxford
language English
last_indexed 2024-03-07T05:58:56Z
publishDate 2022
publisher Society for Neuroscience
record_format dspace
spelling oxford-uuid:eb7f646a-530e-42cb-83ec-ced5aeba023d2022-03-27T11:10:11ZIntegrating reward information for prospective behaviourJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:eb7f646a-530e-42cb-83ec-ced5aeba023dEnglishSymplectic ElementsSociety for Neuroscience2022Hall-McMaster, SStokes, MGMyers, NEValue-based decision-making is often studied in a static context, where participants decide which option to select from those currently available. However, everyday life often involves an additional dimension: deciding when to select to maximise reward. Recent evidence suggests that agents track the latent reward of an option, updating changes in their latent reward estimate, to achieve appropriate selection timing (latent reward tracking). However, this strategy can be difficult to distinguish from one in which the optimal selection time is estimated in advance, allowing an agent to wait a pre-determined amount of time before selecting, without needing to monitor an option’s latent reward (distance-to-goal tracking). Here we show that these strategies can in principle be dissociated. Human brain activity was recorded using electroencephalography (EEG) while female and male participants performed a novel decision task. Participants were shown an option and decided when to select it, as its latent reward changed from trial-to-trial. While the latent reward was uncued, it could be estimated using cued information about the option’s starting value and value growth rate. We then used representational similarity analysis to assess whether EEG signals more closely resembled latent reward tracking or distance-to-goal tracking. This approach successfully dissociated the strategies in this task. Starting value and growth rate were translated into a distance-to-goal signal, far in advance of selecting the option. Latent reward could not be independently decoded. These results demonstrate the feasibility of using high temporal resolution neural recordings to identify internally computed decision variables in the human brain.
spellingShingle Hall-McMaster, S
Stokes, MG
Myers, NE
Integrating reward information for prospective behaviour
title Integrating reward information for prospective behaviour
title_full Integrating reward information for prospective behaviour
title_fullStr Integrating reward information for prospective behaviour
title_full_unstemmed Integrating reward information for prospective behaviour
title_short Integrating reward information for prospective behaviour
title_sort integrating reward information for prospective behaviour
work_keys_str_mv AT hallmcmasters integratingrewardinformationforprospectivebehaviour
AT stokesmg integratingrewardinformationforprospectivebehaviour
AT myersne integratingrewardinformationforprospectivebehaviour