Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices

Eye tracking is becoming a very popular, useful, and important technology. Many eye tracking technologies are currently expensive and only available to large corporations. Some of them necessitate explicit personal calibration, which makes them unsuitable for use in real-world or uncontrolled enviro...

Full description

Bibliographic Details
Main Authors: Andronicus A. Akinyelu, Pieter Blignaut
Format: Article
Language:English
Published: Frontiers Media S.A. 2022-01-01
Series:Frontiers in Artificial Intelligence
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/frai.2021.796825/full
_version_ 1818954510113964032
author Andronicus A. Akinyelu
Pieter Blignaut
author_facet Andronicus A. Akinyelu
Pieter Blignaut
author_sort Andronicus A. Akinyelu
collection DOAJ
description Eye tracking is becoming a very popular, useful, and important technology. Many eye tracking technologies are currently expensive and only available to large corporations. Some of them necessitate explicit personal calibration, which makes them unsuitable for use in real-world or uncontrolled environments. Explicit personal calibration can also be cumbersome and degrades the user experience. To address these issues, this study proposes a Convolutional Neural Network (CNN) based calibration-free technique for improved gaze estimation in unconstrained environments. The proposed technique consists of two components, namely a face component and a 39-point facial landmark component. The face component is used to extract the gaze estimation features from the eyes, while the 39-point facial landmark component is used to encode the shape and location of the eyes (within the face) into the network. Adding this information can make the network learn free-head and eye movements. Another CNN model was designed in this study primarily for the sake of comparison. The CNN model accepts only the face images as input. Different experiments were performed, and the experimental result reveals that the proposed technique outperforms the second model. Fine-tuning was also performed using the VGG16 pre-trained model. Experimental results show that the fine-tuned results of the proposed technique perform better than the fine-tuned results of the second model. Overall, the results show that 39-point facial landmarks can be used to improve the performance of CNN-based gaze estimation models.
first_indexed 2024-12-20T10:23:18Z
format Article
id doaj.art-523fa753a3994e06ba950cdee04207f2
institution Directory Open Access Journal
issn 2624-8212
language English
last_indexed 2024-12-20T10:23:18Z
publishDate 2022-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Artificial Intelligence
spelling doaj.art-523fa753a3994e06ba950cdee04207f22022-12-21T19:43:53ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122022-01-01410.3389/frai.2021.796825796825Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile DevicesAndronicus A. AkinyeluPieter BlignautEye tracking is becoming a very popular, useful, and important technology. Many eye tracking technologies are currently expensive and only available to large corporations. Some of them necessitate explicit personal calibration, which makes them unsuitable for use in real-world or uncontrolled environments. Explicit personal calibration can also be cumbersome and degrades the user experience. To address these issues, this study proposes a Convolutional Neural Network (CNN) based calibration-free technique for improved gaze estimation in unconstrained environments. The proposed technique consists of two components, namely a face component and a 39-point facial landmark component. The face component is used to extract the gaze estimation features from the eyes, while the 39-point facial landmark component is used to encode the shape and location of the eyes (within the face) into the network. Adding this information can make the network learn free-head and eye movements. Another CNN model was designed in this study primarily for the sake of comparison. The CNN model accepts only the face images as input. Different experiments were performed, and the experimental result reveals that the proposed technique outperforms the second model. Fine-tuning was also performed using the VGG16 pre-trained model. Experimental results show that the fine-tuned results of the proposed technique perform better than the fine-tuned results of the second model. Overall, the results show that 39-point facial landmarks can be used to improve the performance of CNN-based gaze estimation models.https://www.frontiersin.org/articles/10.3389/frai.2021.796825/fullConvolutional Neural Networkcomputer visiongaze estimationeye trackingmobile device
spellingShingle Andronicus A. Akinyelu
Pieter Blignaut
Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
Frontiers in Artificial Intelligence
Convolutional Neural Network
computer vision
gaze estimation
eye tracking
mobile device
title Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
title_full Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
title_fullStr Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
title_full_unstemmed Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
title_short Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
title_sort convolutional neural network based technique for gaze estimation on mobile devices
topic Convolutional Neural Network
computer vision
gaze estimation
eye tracking
mobile device
url https://www.frontiersin.org/articles/10.3389/frai.2021.796825/full
work_keys_str_mv AT andronicusaakinyelu convolutionalneuralnetworkbasedtechniqueforgazeestimationonmobiledevices
AT pieterblignaut convolutionalneuralnetworkbasedtechniqueforgazeestimationonmobiledevices