A lightweight method for face expression recognition based on improved MobileNetV3

Abstract Facial expression recognition plays a significant role in the application of man–machine interaction. However, existing models typically have shortcomings with numerous parameters, large model sizes, and high computational costs, which are difficult to deploy in resource‐constrained devices...

Full description

Bibliographic Details
Main Authors:	Xunru Liang, Jianfeng Liang, Tao Yin, Xiaoyu Tang
Format:	Article
Language:	English
Published:	Wiley 2023-06-01
Series:	IET Image Processing
Subjects:	computer vision emotion recognition image classification image recognition
Online Access:	https://doi.org/10.1049/ipr2.12798

_version_	1797813784327225344
author	Xunru Liang Jianfeng Liang Tao Yin Xiaoyu Tang
author_facet	Xunru Liang Jianfeng Liang Tao Yin Xiaoyu Tang
author_sort	Xunru Liang
collection	DOAJ
description	Abstract Facial expression recognition plays a significant role in the application of man–machine interaction. However, existing models typically have shortcomings with numerous parameters, large model sizes, and high computational costs, which are difficult to deploy in resource‐constrained devices. This paper proposes a lightweight network based on improved MobileNetV3 to mitigate these disadvantages. Firstly, we adjust the channels in the high‐level network to reduce the number of parameters and model size, and then, the coordinate attention mechanism is introduced to the network, which enhances the attention of the network with few parameters and low computing cost. Furthermore, a complementary pooling structure is designed to improve the coordinate attention mechanism, which enables it to assist the network in extracting salient features sufficiently. In addition, the network with the joint loss consisting of the softmax loss and centre loss is trained, which can minimize the intra‐class gap and improve the classification performance. Finally, the network is trained and tested on public datasets FERPlus and RAF‐DB, with the best accuracy of 87.5% and 86.6%, respectively. The FLOPs, parameters, and the memory storage size are only 0.19GMac, 1.3 M, and 15.9 MB, respectively, which is lighter than most state‐of‐the‐art networks. Code is available at https://github.com/RIS‐LAB1/FER‐mobilenet.
first_indexed	2024-03-13T07:57:54Z
format	Article
id	doaj.art-38df37a8176e47e294d79776f3506e19
institution	Directory Open Access Journal
issn	1751-9659 1751-9667
language	English
last_indexed	2024-03-13T07:57:54Z
publishDate	2023-06-01
publisher	Wiley
record_format	Article
series	IET Image Processing
spelling	doaj.art-38df37a8176e47e294d79776f3506e192023-06-02T03:06:38ZengWileyIET Image Processing1751-96591751-96672023-06-011782375238410.1049/ipr2.12798A lightweight method for face expression recognition based on improved MobileNetV3Xunru Liang0Jianfeng Liang1Tao Yin2Xiaoyu Tang3School of Physics and Telecommunication Engineering South China Normal University Guangzhou ChinaSchool of Physics and Telecommunication Engineering South China Normal University Guangzhou ChinaSchool of Physics and Telecommunication Engineering South China Normal University Guangzhou ChinaSchool of Physics and Telecommunication Engineering South China Normal University Guangzhou ChinaAbstract Facial expression recognition plays a significant role in the application of man–machine interaction. However, existing models typically have shortcomings with numerous parameters, large model sizes, and high computational costs, which are difficult to deploy in resource‐constrained devices. This paper proposes a lightweight network based on improved MobileNetV3 to mitigate these disadvantages. Firstly, we adjust the channels in the high‐level network to reduce the number of parameters and model size, and then, the coordinate attention mechanism is introduced to the network, which enhances the attention of the network with few parameters and low computing cost. Furthermore, a complementary pooling structure is designed to improve the coordinate attention mechanism, which enables it to assist the network in extracting salient features sufficiently. In addition, the network with the joint loss consisting of the softmax loss and centre loss is trained, which can minimize the intra‐class gap and improve the classification performance. Finally, the network is trained and tested on public datasets FERPlus and RAF‐DB, with the best accuracy of 87.5% and 86.6%, respectively. The FLOPs, parameters, and the memory storage size are only 0.19GMac, 1.3 M, and 15.9 MB, respectively, which is lighter than most state‐of‐the‐art networks. Code is available at https://github.com/RIS‐LAB1/FER‐mobilenet.https://doi.org/10.1049/ipr2.12798computer visionemotion recognitionimage classificationimage recognition
spellingShingle	Xunru Liang Jianfeng Liang Tao Yin Xiaoyu Tang A lightweight method for face expression recognition based on improved MobileNetV3 IET Image Processing computer vision emotion recognition image classification image recognition
title	A lightweight method for face expression recognition based on improved MobileNetV3
title_full	A lightweight method for face expression recognition based on improved MobileNetV3
title_fullStr	A lightweight method for face expression recognition based on improved MobileNetV3
title_full_unstemmed	A lightweight method for face expression recognition based on improved MobileNetV3
title_short	A lightweight method for face expression recognition based on improved MobileNetV3
title_sort	lightweight method for face expression recognition based on improved mobilenetv3
topic	computer vision emotion recognition image classification image recognition
url	https://doi.org/10.1049/ipr2.12798
work_keys_str_mv	AT xunruliang alightweightmethodforfaceexpressionrecognitionbasedonimprovedmobilenetv3 AT jianfengliang alightweightmethodforfaceexpressionrecognitionbasedonimprovedmobilenetv3 AT taoyin alightweightmethodforfaceexpressionrecognitionbasedonimprovedmobilenetv3 AT xiaoyutang alightweightmethodforfaceexpressionrecognitionbasedonimprovedmobilenetv3 AT xunruliang lightweightmethodforfaceexpressionrecognitionbasedonimprovedmobilenetv3 AT jianfengliang lightweightmethodforfaceexpressionrecognitionbasedonimprovedmobilenetv3 AT taoyin lightweightmethodforfaceexpressionrecognitionbasedonimprovedmobilenetv3 AT xiaoyutang lightweightmethodforfaceexpressionrecognitionbasedonimprovedmobilenetv3

A lightweight method for face expression recognition based on improved MobileNetV3

Similar Items