How pupil responses track value-based decision-making during and after reinforcement learning.

Cognition can reveal itself in the pupil, as latent cognitive processes map onto specific pupil responses. For instance, the pupil dilates when we make decisions and these pupil size fluctuations reflect decision-making computations during and after a choice. Surprisingly little is known, however, a...

Full description

Bibliographic Details
Main Authors: Joanne C Van Slooten, Sara Jahfari, Tomas Knapen, Jan Theeuwes
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-11-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1006632
_version_ 1818716798852268032
author Joanne C Van Slooten
Sara Jahfari
Tomas Knapen
Jan Theeuwes
author_facet Joanne C Van Slooten
Sara Jahfari
Tomas Knapen
Jan Theeuwes
author_sort Joanne C Van Slooten
collection DOAJ
description Cognition can reveal itself in the pupil, as latent cognitive processes map onto specific pupil responses. For instance, the pupil dilates when we make decisions and these pupil size fluctuations reflect decision-making computations during and after a choice. Surprisingly little is known, however, about how pupil responses relate to decisions driven by the learned value of stimuli. This understanding is important, as most real-life decisions are guided by the outcomes of earlier choices. The goal of this study was to investigate which cognitive processes the pupil reflects during value-based decision-making. We used a reinforcement learning task to study pupil responses during value-based decisions and subsequent decision evaluations, employing computational modeling to quantitatively describe the underlying cognitive processes. We found that the pupil closely tracks reinforcement learning processes independently across participants and across trials. Prior to choice, the pupil dilated as a function of trial-by-trial fluctuations in value beliefs about the to-be chosen option and predicted an individual's tendency to exploit high value options. After feedback a biphasic pupil response was observed, the amplitude of which correlated with participants' learning rates. Furthermore, across trials, early feedback-related dilation scaled with value uncertainty, whereas later constriction scaled with signed reward prediction errors. These findings show that pupil size fluctuations can provide detailed information about the computations underlying value-based decisions and the subsequent updating of value beliefs. As these processes are affected in a host of psychiatric disorders, our results indicate that pupillometry can be used as an accessible tool to non-invasively study the processes underlying ongoing reinforcement learning in the clinic.
first_indexed 2024-12-17T19:24:59Z
format Article
id doaj.art-bb2c0cff7eb94d36aac557548f4931a0
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-12-17T19:24:59Z
publishDate 2018-11-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-bb2c0cff7eb94d36aac557548f4931a02022-12-21T21:35:24ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582018-11-011411e100663210.1371/journal.pcbi.1006632How pupil responses track value-based decision-making during and after reinforcement learning.Joanne C Van SlootenSara JahfariTomas KnapenJan TheeuwesCognition can reveal itself in the pupil, as latent cognitive processes map onto specific pupil responses. For instance, the pupil dilates when we make decisions and these pupil size fluctuations reflect decision-making computations during and after a choice. Surprisingly little is known, however, about how pupil responses relate to decisions driven by the learned value of stimuli. This understanding is important, as most real-life decisions are guided by the outcomes of earlier choices. The goal of this study was to investigate which cognitive processes the pupil reflects during value-based decision-making. We used a reinforcement learning task to study pupil responses during value-based decisions and subsequent decision evaluations, employing computational modeling to quantitatively describe the underlying cognitive processes. We found that the pupil closely tracks reinforcement learning processes independently across participants and across trials. Prior to choice, the pupil dilated as a function of trial-by-trial fluctuations in value beliefs about the to-be chosen option and predicted an individual's tendency to exploit high value options. After feedback a biphasic pupil response was observed, the amplitude of which correlated with participants' learning rates. Furthermore, across trials, early feedback-related dilation scaled with value uncertainty, whereas later constriction scaled with signed reward prediction errors. These findings show that pupil size fluctuations can provide detailed information about the computations underlying value-based decisions and the subsequent updating of value beliefs. As these processes are affected in a host of psychiatric disorders, our results indicate that pupillometry can be used as an accessible tool to non-invasively study the processes underlying ongoing reinforcement learning in the clinic.https://doi.org/10.1371/journal.pcbi.1006632
spellingShingle Joanne C Van Slooten
Sara Jahfari
Tomas Knapen
Jan Theeuwes
How pupil responses track value-based decision-making during and after reinforcement learning.
PLoS Computational Biology
title How pupil responses track value-based decision-making during and after reinforcement learning.
title_full How pupil responses track value-based decision-making during and after reinforcement learning.
title_fullStr How pupil responses track value-based decision-making during and after reinforcement learning.
title_full_unstemmed How pupil responses track value-based decision-making during and after reinforcement learning.
title_short How pupil responses track value-based decision-making during and after reinforcement learning.
title_sort how pupil responses track value based decision making during and after reinforcement learning
url https://doi.org/10.1371/journal.pcbi.1006632
work_keys_str_mv AT joannecvanslooten howpupilresponsestrackvaluebaseddecisionmakingduringandafterreinforcementlearning
AT sarajahfari howpupilresponsestrackvaluebaseddecisionmakingduringandafterreinforcementlearning
AT tomasknapen howpupilresponsestrackvaluebaseddecisionmakingduringandafterreinforcementlearning
AT jantheeuwes howpupilresponsestrackvaluebaseddecisionmakingduringandafterreinforcementlearning