Improved CNN-based mouth position and status detection

Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with r...

Full description

Bibliographic Details
Main Author:	Chok, Yong Sheng
Format:	Thesis
Language:	English
Published:	2022
Subjects:	TK Electrical engineering. Electronics Nuclear engineering
Online Access:	http://eprints.utm.my/99383/1/ChokYongShengMSKE2022.pdf

_version_	1796866746480590848
author	Chok, Yong Sheng
author_facet	Chok, Yong Sheng
author_sort	Chok, Yong Sheng
collection	ePrints
description	Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with robotic arms. There are two major problems that urge the proposal of this project. First, the existing mouth status recognition networks are built and executed on high-end and costly hardware. Second, the existing CNN mouth status related detection systems are less accurate, the highest accuracy in the researched work is only 86.8% for 3 states mouth status detection. Based on the problems, there are two research objectives that are strived to be achieved. First, to develop a high-accuracy and light CNN-based model for mouth status detection on Python platform. Second, to shorten the inference time of the CNN-based model by resizing some of the convolution layers. For methodology, the primary task is to train a mouth status detection CNN model with high accuracy. The face picture datasets fed to the model during CNN model training are diverse, covering different human races and shooting angles. YOLOv5 is chosen to be the pre-trained network due to its outstanding performance. The YOLOv5 backbone convolution layers are resized to shorten the inference time and reduce the model size. The developed CNN-based model achieved the targeted performance which is 96.8%, successfully improved inference time by 21.90% and model size by 13.20% as compared to the original model before enhancement.
first_indexed	2024-03-05T21:16:48Z
format	Thesis
id	utm.eprints-99383
institution	Universiti Teknologi Malaysia - ePrints
language	English
last_indexed	2024-03-05T21:16:48Z
publishDate	2022
record_format	dspace
spelling	utm.eprints-993832023-02-27T03:01:27Z http://eprints.utm.my/99383/ Improved CNN-based mouth position and status detection Chok, Yong Sheng TK Electrical engineering. Electronics Nuclear engineering Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with robotic arms. There are two major problems that urge the proposal of this project. First, the existing mouth status recognition networks are built and executed on high-end and costly hardware. Second, the existing CNN mouth status related detection systems are less accurate, the highest accuracy in the researched work is only 86.8% for 3 states mouth status detection. Based on the problems, there are two research objectives that are strived to be achieved. First, to develop a high-accuracy and light CNN-based model for mouth status detection on Python platform. Second, to shorten the inference time of the CNN-based model by resizing some of the convolution layers. For methodology, the primary task is to train a mouth status detection CNN model with high accuracy. The face picture datasets fed to the model during CNN model training are diverse, covering different human races and shooting angles. YOLOv5 is chosen to be the pre-trained network due to its outstanding performance. The YOLOv5 backbone convolution layers are resized to shorten the inference time and reduce the model size. The developed CNN-based model achieved the targeted performance which is 96.8%, successfully improved inference time by 21.90% and model size by 13.20% as compared to the original model before enhancement. 2022 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/99383/1/ChokYongShengMSKE2022.pdf Chok, Yong Sheng (2022) Improved CNN-based mouth position and status detection. Masters thesis, Universiti Teknologi Malaysia, Faculty of Engineering - School of Electrical Engineering. http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:149993
spellingShingle	TK Electrical engineering. Electronics Nuclear engineering Chok, Yong Sheng Improved CNN-based mouth position and status detection
title	Improved CNN-based mouth position and status detection
title_full	Improved CNN-based mouth position and status detection
title_fullStr	Improved CNN-based mouth position and status detection
title_full_unstemmed	Improved CNN-based mouth position and status detection
title_short	Improved CNN-based mouth position and status detection
title_sort	improved cnn based mouth position and status detection
topic	TK Electrical engineering. Electronics Nuclear engineering
url	http://eprints.utm.my/99383/1/ChokYongShengMSKE2022.pdf
work_keys_str_mv	AT chokyongsheng improvedcnnbasedmouthpositionandstatusdetection

Improved CNN-based mouth position and status detection

Similar Items