Improved CNN-based mouth position and status detection

Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with r...

Full description

Bibliographic Details
Main Author: Chok, Yong Sheng
Format: Thesis
Language:English
Published: 2022
Subjects:
Online Access:http://eprints.utm.my/99383/1/ChokYongShengMSKE2022.pdf
_version_ 1796866746480590848
author Chok, Yong Sheng
author_facet Chok, Yong Sheng
author_sort Chok, Yong Sheng
collection ePrints
description Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with robotic arms. There are two major problems that urge the proposal of this project. First, the existing mouth status recognition networks are built and executed on high-end and costly hardware. Second, the existing CNN mouth status related detection systems are less accurate, the highest accuracy in the researched work is only 86.8% for 3 states mouth status detection. Based on the problems, there are two research objectives that are strived to be achieved. First, to develop a high-accuracy and light CNN-based model for mouth status detection on Python platform. Second, to shorten the inference time of the CNN-based model by resizing some of the convolution layers. For methodology, the primary task is to train a mouth status detection CNN model with high accuracy. The face picture datasets fed to the model during CNN model training are diverse, covering different human races and shooting angles. YOLOv5 is chosen to be the pre-trained network due to its outstanding performance. The YOLOv5 backbone convolution layers are resized to shorten the inference time and reduce the model size. The developed CNN-based model achieved the targeted performance which is 96.8%, successfully improved inference time by 21.90% and model size by 13.20% as compared to the original model before enhancement.
first_indexed 2024-03-05T21:16:48Z
format Thesis
id utm.eprints-99383
institution Universiti Teknologi Malaysia - ePrints
language English
last_indexed 2024-03-05T21:16:48Z
publishDate 2022
record_format dspace
spelling utm.eprints-993832023-02-27T03:01:27Z http://eprints.utm.my/99383/ Improved CNN-based mouth position and status detection Chok, Yong Sheng TK Electrical engineering. Electronics Nuclear engineering Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with robotic arms. There are two major problems that urge the proposal of this project. First, the existing mouth status recognition networks are built and executed on high-end and costly hardware. Second, the existing CNN mouth status related detection systems are less accurate, the highest accuracy in the researched work is only 86.8% for 3 states mouth status detection. Based on the problems, there are two research objectives that are strived to be achieved. First, to develop a high-accuracy and light CNN-based model for mouth status detection on Python platform. Second, to shorten the inference time of the CNN-based model by resizing some of the convolution layers. For methodology, the primary task is to train a mouth status detection CNN model with high accuracy. The face picture datasets fed to the model during CNN model training are diverse, covering different human races and shooting angles. YOLOv5 is chosen to be the pre-trained network due to its outstanding performance. The YOLOv5 backbone convolution layers are resized to shorten the inference time and reduce the model size. The developed CNN-based model achieved the targeted performance which is 96.8%, successfully improved inference time by 21.90% and model size by 13.20% as compared to the original model before enhancement. 2022 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/99383/1/ChokYongShengMSKE2022.pdf Chok, Yong Sheng (2022) Improved CNN-based mouth position and status detection. Masters thesis, Universiti Teknologi Malaysia, Faculty of Engineering - School of Electrical Engineering. http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:149993
spellingShingle TK Electrical engineering. Electronics Nuclear engineering
Chok, Yong Sheng
Improved CNN-based mouth position and status detection
title Improved CNN-based mouth position and status detection
title_full Improved CNN-based mouth position and status detection
title_fullStr Improved CNN-based mouth position and status detection
title_full_unstemmed Improved CNN-based mouth position and status detection
title_short Improved CNN-based mouth position and status detection
title_sort improved cnn based mouth position and status detection
topic TK Electrical engineering. Electronics Nuclear engineering
url http://eprints.utm.my/99383/1/ChokYongShengMSKE2022.pdf
work_keys_str_mv AT chokyongsheng improvedcnnbasedmouthpositionandstatusdetection