Machine learning for image and video summarization

With the digital evolution of the information, the interaction with the digital display has been studied and applied in fields ranging from text entry, mouse controlling, and to online learning, human-computer interaction. The study of gaze tracking is the central part of the research regarding the...

Full description

Bibliographic Details
Main Author: Liu, Liuziyi
Other Authors: Tan Yap Peng
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/136788
_version_ 1811685997863239680
author Liu, Liuziyi
author2 Tan Yap Peng
author_facet Tan Yap Peng
Liu, Liuziyi
author_sort Liu, Liuziyi
collection NTU
description With the digital evolution of the information, the interaction with the digital display has been studied and applied in fields ranging from text entry, mouse controlling, and to online learning, human-computer interaction. The study of gaze tracking is the central part of the research regarding the interaction with the digital display as the gaze is the fastest way of showing interest on a subject. Current gazing tracking systems implement various machine learning methods such as Neural Networks, Gaussian process regression, Ensemble of Regression Trees for landmark detection and head pose estimation. However, there is no robust solution as most of the systems are still subject to limitations, including unsatisfied accuracy, significant head movement, expensive geometric setups, inconsistent lighting conditions and cumbersome calibrations. In this way, there is not enough robustness for real-world applications. Besides, while most existing gaze tracking system focuses only on estimating the gaze direction, more efforts are needed for studying the gaze tracking on a digital display. This project studies gaze tracking on a digital display with a webcam camera through a machine learning approach. Different functions, including facial landmark detection, head pose estimation, gaze projection and image processing, are studied and integrated to realize the purpose of tracking gaze on the digital display. The project was to design a gaze tracking system that provides accurate performance on a digital display that is applicable for analysis of students’ behaviors during the E-learning process.
first_indexed 2024-10-01T04:53:25Z
format Final Year Project (FYP)
id ntu-10356/136788
institution Nanyang Technological University
language English
last_indexed 2024-10-01T04:53:25Z
publishDate 2020
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1367882023-07-07T16:58:07Z Machine learning for image and video summarization Liu, Liuziyi Tan Yap Peng School of Electrical and Electronic Engineering EYPTan@ntu.edu.sg Engineering::Electrical and electronic engineering With the digital evolution of the information, the interaction with the digital display has been studied and applied in fields ranging from text entry, mouse controlling, and to online learning, human-computer interaction. The study of gaze tracking is the central part of the research regarding the interaction with the digital display as the gaze is the fastest way of showing interest on a subject. Current gazing tracking systems implement various machine learning methods such as Neural Networks, Gaussian process regression, Ensemble of Regression Trees for landmark detection and head pose estimation. However, there is no robust solution as most of the systems are still subject to limitations, including unsatisfied accuracy, significant head movement, expensive geometric setups, inconsistent lighting conditions and cumbersome calibrations. In this way, there is not enough robustness for real-world applications. Besides, while most existing gaze tracking system focuses only on estimating the gaze direction, more efforts are needed for studying the gaze tracking on a digital display. This project studies gaze tracking on a digital display with a webcam camera through a machine learning approach. Different functions, including facial landmark detection, head pose estimation, gaze projection and image processing, are studied and integrated to realize the purpose of tracking gaze on the digital display. The project was to design a gaze tracking system that provides accurate performance on a digital display that is applicable for analysis of students’ behaviors during the E-learning process. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-01-28T04:36:04Z 2020-01-28T04:36:04Z 2019 Final Year Project (FYP) https://hdl.handle.net/10356/136788 en 3242-182 application/pdf Nanyang Technological University
spellingShingle Engineering::Electrical and electronic engineering
Liu, Liuziyi
Machine learning for image and video summarization
title Machine learning for image and video summarization
title_full Machine learning for image and video summarization
title_fullStr Machine learning for image and video summarization
title_full_unstemmed Machine learning for image and video summarization
title_short Machine learning for image and video summarization
title_sort machine learning for image and video summarization
topic Engineering::Electrical and electronic engineering
url https://hdl.handle.net/10356/136788
work_keys_str_mv AT liuliuziyi machinelearningforimageandvideosummarization