Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices

Eye tracking is becoming a very popular, useful, and important technology. Many eye tracking technologies are currently expensive and only available to large corporations. Some of them necessitate explicit personal calibration, which makes them unsuitable for use in real-world or uncontrolled enviro...

Full description

Bibliographic Details
Main Authors:	Andronicus A. Akinyelu, Pieter Blignaut
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2022-01-01
Series:	Frontiers in Artificial Intelligence
Subjects:	Convolutional Neural Network computer vision gaze estimation eye tracking mobile device
Online Access:	https://www.frontiersin.org/articles/10.3389/frai.2021.796825/full

_version_	1818954510113964032
author	Andronicus A. Akinyelu Pieter Blignaut
author_facet	Andronicus A. Akinyelu Pieter Blignaut
author_sort	Andronicus A. Akinyelu
collection	DOAJ
description	Eye tracking is becoming a very popular, useful, and important technology. Many eye tracking technologies are currently expensive and only available to large corporations. Some of them necessitate explicit personal calibration, which makes them unsuitable for use in real-world or uncontrolled environments. Explicit personal calibration can also be cumbersome and degrades the user experience. To address these issues, this study proposes a Convolutional Neural Network (CNN) based calibration-free technique for improved gaze estimation in unconstrained environments. The proposed technique consists of two components, namely a face component and a 39-point facial landmark component. The face component is used to extract the gaze estimation features from the eyes, while the 39-point facial landmark component is used to encode the shape and location of the eyes (within the face) into the network. Adding this information can make the network learn free-head and eye movements. Another CNN model was designed in this study primarily for the sake of comparison. The CNN model accepts only the face images as input. Different experiments were performed, and the experimental result reveals that the proposed technique outperforms the second model. Fine-tuning was also performed using the VGG16 pre-trained model. Experimental results show that the fine-tuned results of the proposed technique perform better than the fine-tuned results of the second model. Overall, the results show that 39-point facial landmarks can be used to improve the performance of CNN-based gaze estimation models.
first_indexed	2024-12-20T10:23:18Z
format	Article
id	doaj.art-523fa753a3994e06ba950cdee04207f2
institution	Directory Open Access Journal
issn	2624-8212
language	English
last_indexed	2024-12-20T10:23:18Z
publishDate	2022-01-01
publisher	Frontiers Media S.A.
record_format	Article
series	Frontiers in Artificial Intelligence
spelling	doaj.art-523fa753a3994e06ba950cdee04207f22022-12-21T19:43:53ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122022-01-01410.3389/frai.2021.796825796825Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile DevicesAndronicus A. AkinyeluPieter BlignautEye tracking is becoming a very popular, useful, and important technology. Many eye tracking technologies are currently expensive and only available to large corporations. Some of them necessitate explicit personal calibration, which makes them unsuitable for use in real-world or uncontrolled environments. Explicit personal calibration can also be cumbersome and degrades the user experience. To address these issues, this study proposes a Convolutional Neural Network (CNN) based calibration-free technique for improved gaze estimation in unconstrained environments. The proposed technique consists of two components, namely a face component and a 39-point facial landmark component. The face component is used to extract the gaze estimation features from the eyes, while the 39-point facial landmark component is used to encode the shape and location of the eyes (within the face) into the network. Adding this information can make the network learn free-head and eye movements. Another CNN model was designed in this study primarily for the sake of comparison. The CNN model accepts only the face images as input. Different experiments were performed, and the experimental result reveals that the proposed technique outperforms the second model. Fine-tuning was also performed using the VGG16 pre-trained model. Experimental results show that the fine-tuned results of the proposed technique perform better than the fine-tuned results of the second model. Overall, the results show that 39-point facial landmarks can be used to improve the performance of CNN-based gaze estimation models.https://www.frontiersin.org/articles/10.3389/frai.2021.796825/fullConvolutional Neural Networkcomputer visiongaze estimationeye trackingmobile device
spellingShingle	Andronicus A. Akinyelu Pieter Blignaut Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices Frontiers in Artificial Intelligence Convolutional Neural Network computer vision gaze estimation eye tracking mobile device
title	Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
title_full	Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
title_fullStr	Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
title_full_unstemmed	Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
title_short	Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
title_sort	convolutional neural network based technique for gaze estimation on mobile devices
topic	Convolutional Neural Network computer vision gaze estimation eye tracking mobile device
url	https://www.frontiersin.org/articles/10.3389/frai.2021.796825/full
work_keys_str_mv	AT andronicusaakinyelu convolutionalneuralnetworkbasedtechniqueforgazeestimationonmobiledevices AT pieterblignaut convolutionalneuralnetworkbasedtechniqueforgazeestimationonmobiledevices

Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices

Similar Items