Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices
Eye tracking is becoming a very popular, useful, and important technology. Many eye tracking technologies are currently expensive and only available to large corporations. Some of them necessitate explicit personal calibration, which makes them unsuitable for use in real-world or uncontrolled enviro...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2022-01-01
|
Series: | Frontiers in Artificial Intelligence |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/frai.2021.796825/full |
_version_ | 1818954510113964032 |
---|---|
author | Andronicus A. Akinyelu Pieter Blignaut |
author_facet | Andronicus A. Akinyelu Pieter Blignaut |
author_sort | Andronicus A. Akinyelu |
collection | DOAJ |
description | Eye tracking is becoming a very popular, useful, and important technology. Many eye tracking technologies are currently expensive and only available to large corporations. Some of them necessitate explicit personal calibration, which makes them unsuitable for use in real-world or uncontrolled environments. Explicit personal calibration can also be cumbersome and degrades the user experience. To address these issues, this study proposes a Convolutional Neural Network (CNN) based calibration-free technique for improved gaze estimation in unconstrained environments. The proposed technique consists of two components, namely a face component and a 39-point facial landmark component. The face component is used to extract the gaze estimation features from the eyes, while the 39-point facial landmark component is used to encode the shape and location of the eyes (within the face) into the network. Adding this information can make the network learn free-head and eye movements. Another CNN model was designed in this study primarily for the sake of comparison. The CNN model accepts only the face images as input. Different experiments were performed, and the experimental result reveals that the proposed technique outperforms the second model. Fine-tuning was also performed using the VGG16 pre-trained model. Experimental results show that the fine-tuned results of the proposed technique perform better than the fine-tuned results of the second model. Overall, the results show that 39-point facial landmarks can be used to improve the performance of CNN-based gaze estimation models. |
first_indexed | 2024-12-20T10:23:18Z |
format | Article |
id | doaj.art-523fa753a3994e06ba950cdee04207f2 |
institution | Directory Open Access Journal |
issn | 2624-8212 |
language | English |
last_indexed | 2024-12-20T10:23:18Z |
publishDate | 2022-01-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Artificial Intelligence |
spelling | doaj.art-523fa753a3994e06ba950cdee04207f22022-12-21T19:43:53ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122022-01-01410.3389/frai.2021.796825796825Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile DevicesAndronicus A. AkinyeluPieter BlignautEye tracking is becoming a very popular, useful, and important technology. Many eye tracking technologies are currently expensive and only available to large corporations. Some of them necessitate explicit personal calibration, which makes them unsuitable for use in real-world or uncontrolled environments. Explicit personal calibration can also be cumbersome and degrades the user experience. To address these issues, this study proposes a Convolutional Neural Network (CNN) based calibration-free technique for improved gaze estimation in unconstrained environments. The proposed technique consists of two components, namely a face component and a 39-point facial landmark component. The face component is used to extract the gaze estimation features from the eyes, while the 39-point facial landmark component is used to encode the shape and location of the eyes (within the face) into the network. Adding this information can make the network learn free-head and eye movements. Another CNN model was designed in this study primarily for the sake of comparison. The CNN model accepts only the face images as input. Different experiments were performed, and the experimental result reveals that the proposed technique outperforms the second model. Fine-tuning was also performed using the VGG16 pre-trained model. Experimental results show that the fine-tuned results of the proposed technique perform better than the fine-tuned results of the second model. Overall, the results show that 39-point facial landmarks can be used to improve the performance of CNN-based gaze estimation models.https://www.frontiersin.org/articles/10.3389/frai.2021.796825/fullConvolutional Neural Networkcomputer visiongaze estimationeye trackingmobile device |
spellingShingle | Andronicus A. Akinyelu Pieter Blignaut Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices Frontiers in Artificial Intelligence Convolutional Neural Network computer vision gaze estimation eye tracking mobile device |
title | Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices |
title_full | Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices |
title_fullStr | Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices |
title_full_unstemmed | Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices |
title_short | Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices |
title_sort | convolutional neural network based technique for gaze estimation on mobile devices |
topic | Convolutional Neural Network computer vision gaze estimation eye tracking mobile device |
url | https://www.frontiersin.org/articles/10.3389/frai.2021.796825/full |
work_keys_str_mv | AT andronicusaakinyelu convolutionalneuralnetworkbasedtechniqueforgazeestimationonmobiledevices AT pieterblignaut convolutionalneuralnetworkbasedtechniqueforgazeestimationonmobiledevices |