A Recurrent Neural Network for Attenuating Non-cognitive Components of Pupil Dynamics

There is increasing interest in how the pupil dynamics of the eye reflect underlying cognitive processes and brain states. Problematic, however, is that pupil changes can be due to non-cognitive factors, for example luminance changes in the environment, accommodation and movement. In this paper we c...

Full description

Bibliographic Details
Main Authors: Sharath Koorathota, Kaveri Thakoor, Linbi Hong, Yaoli Mao, Patrick Adelman, Paul Sajda
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-02-01
Series:Frontiers in Psychology
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpsyg.2021.604522/full
_version_ 1818329491060031488
author Sharath Koorathota
Sharath Koorathota
Kaveri Thakoor
Linbi Hong
Yaoli Mao
Patrick Adelman
Paul Sajda
author_facet Sharath Koorathota
Sharath Koorathota
Kaveri Thakoor
Linbi Hong
Yaoli Mao
Patrick Adelman
Paul Sajda
author_sort Sharath Koorathota
collection DOAJ
description There is increasing interest in how the pupil dynamics of the eye reflect underlying cognitive processes and brain states. Problematic, however, is that pupil changes can be due to non-cognitive factors, for example luminance changes in the environment, accommodation and movement. In this paper we consider how by modeling the response of the pupil in real-world environments we can capture the non-cognitive related changes and remove these to extract a residual signal which is a better index of cognition and performance. Specifically, we utilize sequence measures such as fixation position, duration, saccades, and blink-related information as inputs to a deep recurrent neural network (RNN) model for predicting subsequent pupil diameter. We build and evaluate the model for a task where subjects are watching educational videos and subsequently asked questions based on the content. Compared to commonly-used models for this task, the RNN had the lowest errors rates in predicting subsequent pupil dilation given sequence data. Most importantly was how the model output related to subjects' cognitive performance as assessed by a post-viewing test. Consistent with our hypothesis that the model captures non-cognitive pupil dynamics, we found (1) the model's root-mean square error was less for lower performing subjects than for those having better performance on the post-viewing test, (2) the residuals of the RNN (LSTM) model had the highest correlation with subject post-viewing test scores and (3) the residuals had the highest discriminability (assessed via area under the ROC curve, AUC) for classifying high and low test performers, compared to the true pupil size or the RNN model predictions. This suggests that deep learning sequence models may be good for separating components of pupil responses that are linked to luminance and accommodation from those that are linked to cognition and arousal.
first_indexed 2024-12-13T12:48:54Z
format Article
id doaj.art-74d80979c73740c58e5193cf631460b3
institution Directory Open Access Journal
issn 1664-1078
language English
last_indexed 2024-12-13T12:48:54Z
publishDate 2021-02-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Psychology
spelling doaj.art-74d80979c73740c58e5193cf631460b32022-12-21T23:45:24ZengFrontiers Media S.A.Frontiers in Psychology1664-10782021-02-011210.3389/fpsyg.2021.604522604522A Recurrent Neural Network for Attenuating Non-cognitive Components of Pupil DynamicsSharath Koorathota0Sharath Koorathota1Kaveri Thakoor2Linbi Hong3Yaoli Mao4Patrick Adelman5Paul Sajda6Department of Biomedical Engineering, Columbia University, New York, NY, United StatesFovea Inc., New York, NY, United StatesDepartment of Biomedical Engineering, Columbia University, New York, NY, United StatesDepartment of Biomedical Engineering, Columbia University, New York, NY, United StatesDepartment of Cognitive Science, Columbia University, New York, NY, United StatesFovea Inc., New York, NY, United StatesDepartment of Biomedical Engineering, Columbia University, New York, NY, United StatesThere is increasing interest in how the pupil dynamics of the eye reflect underlying cognitive processes and brain states. Problematic, however, is that pupil changes can be due to non-cognitive factors, for example luminance changes in the environment, accommodation and movement. In this paper we consider how by modeling the response of the pupil in real-world environments we can capture the non-cognitive related changes and remove these to extract a residual signal which is a better index of cognition and performance. Specifically, we utilize sequence measures such as fixation position, duration, saccades, and blink-related information as inputs to a deep recurrent neural network (RNN) model for predicting subsequent pupil diameter. We build and evaluate the model for a task where subjects are watching educational videos and subsequently asked questions based on the content. Compared to commonly-used models for this task, the RNN had the lowest errors rates in predicting subsequent pupil dilation given sequence data. Most importantly was how the model output related to subjects' cognitive performance as assessed by a post-viewing test. Consistent with our hypothesis that the model captures non-cognitive pupil dynamics, we found (1) the model's root-mean square error was less for lower performing subjects than for those having better performance on the post-viewing test, (2) the residuals of the RNN (LSTM) model had the highest correlation with subject post-viewing test scores and (3) the residuals had the highest discriminability (assessed via area under the ROC curve, AUC) for classifying high and low test performers, compared to the true pupil size or the RNN model predictions. This suggests that deep learning sequence models may be good for separating components of pupil responses that are linked to luminance and accommodation from those that are linked to cognition and arousal.https://www.frontiersin.org/articles/10.3389/fpsyg.2021.604522/fullrecurrent neural networkpupil diametereye trackingvideo viewingpupil response
spellingShingle Sharath Koorathota
Sharath Koorathota
Kaveri Thakoor
Linbi Hong
Yaoli Mao
Patrick Adelman
Paul Sajda
A Recurrent Neural Network for Attenuating Non-cognitive Components of Pupil Dynamics
Frontiers in Psychology
recurrent neural network
pupil diameter
eye tracking
video viewing
pupil response
title A Recurrent Neural Network for Attenuating Non-cognitive Components of Pupil Dynamics
title_full A Recurrent Neural Network for Attenuating Non-cognitive Components of Pupil Dynamics
title_fullStr A Recurrent Neural Network for Attenuating Non-cognitive Components of Pupil Dynamics
title_full_unstemmed A Recurrent Neural Network for Attenuating Non-cognitive Components of Pupil Dynamics
title_short A Recurrent Neural Network for Attenuating Non-cognitive Components of Pupil Dynamics
title_sort recurrent neural network for attenuating non cognitive components of pupil dynamics
topic recurrent neural network
pupil diameter
eye tracking
video viewing
pupil response
url https://www.frontiersin.org/articles/10.3389/fpsyg.2021.604522/full
work_keys_str_mv AT sharathkoorathota arecurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT sharathkoorathota arecurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT kaverithakoor arecurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT linbihong arecurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT yaolimao arecurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT patrickadelman arecurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT paulsajda arecurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT sharathkoorathota recurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT sharathkoorathota recurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT kaverithakoor recurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT linbihong recurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT yaolimao recurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT patrickadelman recurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics
AT paulsajda recurrentneuralnetworkforattenuatingnoncognitivecomponentsofpupildynamics