Object Detection Network Based on Module Stack and Attention Mechanism

Currently, visual computer applications based on convolutional neural networks are rapidly developing. However, several problems remain: (1) high-quality graphics processing equipment is needed, and (2) the trained network model has several unnecessary convolution operations. These problems result i...

Full description

Bibliographic Details
Main Authors:	Xinke Dou, Ting Wang, Shiliang Shao, Xianqing Cao
Format:	Article
Language:	English
Published:	MDPI AG 2023-08-01
Series:	Electronics
Subjects:	lightweight YOLOv5 PP-LCNet object detection attention mechanism
Online Access:	https://www.mdpi.com/2079-9292/12/17/3542

_version_	1797582683744763904
author	Xinke Dou Ting Wang Shiliang Shao Xianqing Cao
author_facet	Xinke Dou Ting Wang Shiliang Shao Xianqing Cao
author_sort	Xinke Dou
collection	DOAJ
description	Currently, visual computer applications based on convolutional neural networks are rapidly developing. However, several problems remain: (1) high-quality graphics processing equipment is needed, and (2) the trained network model has several unnecessary convolution operations. These problems result in a single-stage target detection network that often requires unnecessary computing power and is difficult to apply to equipment with insufficient computing resources. To solve these problems, based on YOLOv5, a YOLOv5-L (YOLOv5 Lightweight) network structure is proposed. This network is improved using YOLOv5. First, to enhance the inference speed of the detector on the CPU, the PP-LCNet (PaddlePaddle-Lightweight CPU Net) is employed as the backbone network. Second, the focus module is removed, and the end convolution module in the head network is replaced by a deep separable convolution module, which eliminates redundant operations and reduces the amount of computation. The experimental results show that YOLOv5-L enables a 48% reduction in model parameters and computation compared to YOLOv5, a 35% increase in operation speed, and a less than 2% reduction in accuracy, which is significant in the environment of low-performance computing equipment.
first_indexed	2024-03-10T23:25:59Z
format	Article
id	doaj.art-a272371634334f8ebd49694c400399ac
institution	Directory Open Access Journal
issn	2079-9292
language	English
last_indexed	2024-03-10T23:25:59Z
publishDate	2023-08-01
publisher	MDPI AG
record_format	Article
series	Electronics
spelling	doaj.art-a272371634334f8ebd49694c400399ac2023-11-19T08:00:40ZengMDPI AGElectronics2079-92922023-08-011217354210.3390/electronics12173542Object Detection Network Based on Module Stack and Attention MechanismXinke Dou0Ting Wang1Shiliang Shao2Xianqing Cao3School of Information Engineering, Shenyang University of Chemical Technology, No. 11, 11th St., Tiexi District, Shenyang 110142, ChinaState Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, No. 114, Nanta St., Shenhe District, Shenyang 110016, ChinaState Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, No. 114, Nanta St., Shenhe District, Shenyang 110016, ChinaSchool of Information Engineering, Shenyang University of Chemical Technology, No. 11, 11th St., Tiexi District, Shenyang 110142, ChinaCurrently, visual computer applications based on convolutional neural networks are rapidly developing. However, several problems remain: (1) high-quality graphics processing equipment is needed, and (2) the trained network model has several unnecessary convolution operations. These problems result in a single-stage target detection network that often requires unnecessary computing power and is difficult to apply to equipment with insufficient computing resources. To solve these problems, based on YOLOv5, a YOLOv5-L (YOLOv5 Lightweight) network structure is proposed. This network is improved using YOLOv5. First, to enhance the inference speed of the detector on the CPU, the PP-LCNet (PaddlePaddle-Lightweight CPU Net) is employed as the backbone network. Second, the focus module is removed, and the end convolution module in the head network is replaced by a deep separable convolution module, which eliminates redundant operations and reduces the amount of computation. The experimental results show that YOLOv5-L enables a 48% reduction in model parameters and computation compared to YOLOv5, a 35% increase in operation speed, and a less than 2% reduction in accuracy, which is significant in the environment of low-performance computing equipment.https://www.mdpi.com/2079-9292/12/17/3542lightweightYOLOv5PP-LCNetobject detectionattention mechanism
spellingShingle	Xinke Dou Ting Wang Shiliang Shao Xianqing Cao Object Detection Network Based on Module Stack and Attention Mechanism Electronics lightweight YOLOv5 PP-LCNet object detection attention mechanism
title	Object Detection Network Based on Module Stack and Attention Mechanism
title_full	Object Detection Network Based on Module Stack and Attention Mechanism
title_fullStr	Object Detection Network Based on Module Stack and Attention Mechanism
title_full_unstemmed	Object Detection Network Based on Module Stack and Attention Mechanism
title_short	Object Detection Network Based on Module Stack and Attention Mechanism
title_sort	object detection network based on module stack and attention mechanism
topic	lightweight YOLOv5 PP-LCNet object detection attention mechanism
url	https://www.mdpi.com/2079-9292/12/17/3542
work_keys_str_mv	AT xinkedou objectdetectionnetworkbasedonmodulestackandattentionmechanism AT tingwang objectdetectionnetworkbasedonmodulestackandattentionmechanism AT shiliangshao objectdetectionnetworkbasedonmodulestackandattentionmechanism AT xianqingcao objectdetectionnetworkbasedonmodulestackandattentionmechanism

Object Detection Network Based on Module Stack and Attention Mechanism

Similar Items