Object Detection Network Based on Module Stack and Attention Mechanism

Currently, visual computer applications based on convolutional neural networks are rapidly developing. However, several problems remain: (1) high-quality graphics processing equipment is needed, and (2) the trained network model has several unnecessary convolution operations. These problems result i...

Full description

Bibliographic Details
Main Authors: Xinke Dou, Ting Wang, Shiliang Shao, Xianqing Cao
Format: Article
Language:English
Published: MDPI AG 2023-08-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/12/17/3542
_version_ 1797582683744763904
author Xinke Dou
Ting Wang
Shiliang Shao
Xianqing Cao
author_facet Xinke Dou
Ting Wang
Shiliang Shao
Xianqing Cao
author_sort Xinke Dou
collection DOAJ
description Currently, visual computer applications based on convolutional neural networks are rapidly developing. However, several problems remain: (1) high-quality graphics processing equipment is needed, and (2) the trained network model has several unnecessary convolution operations. These problems result in a single-stage target detection network that often requires unnecessary computing power and is difficult to apply to equipment with insufficient computing resources. To solve these problems, based on YOLOv5, a YOLOv5-L (YOLOv5 Lightweight) network structure is proposed. This network is improved using YOLOv5. First, to enhance the inference speed of the detector on the CPU, the PP-LCNet (PaddlePaddle-Lightweight CPU Net) is employed as the backbone network. Second, the focus module is removed, and the end convolution module in the head network is replaced by a deep separable convolution module, which eliminates redundant operations and reduces the amount of computation. The experimental results show that YOLOv5-L enables a 48% reduction in model parameters and computation compared to YOLOv5, a 35% increase in operation speed, and a less than 2% reduction in accuracy, which is significant in the environment of low-performance computing equipment.
first_indexed 2024-03-10T23:25:59Z
format Article
id doaj.art-a272371634334f8ebd49694c400399ac
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-10T23:25:59Z
publishDate 2023-08-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-a272371634334f8ebd49694c400399ac2023-11-19T08:00:40ZengMDPI AGElectronics2079-92922023-08-011217354210.3390/electronics12173542Object Detection Network Based on Module Stack and Attention MechanismXinke Dou0Ting Wang1Shiliang Shao2Xianqing Cao3School of Information Engineering, Shenyang University of Chemical Technology, No. 11, 11th St., Tiexi District, Shenyang 110142, ChinaState Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, No. 114, Nanta St., Shenhe District, Shenyang 110016, ChinaState Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, No. 114, Nanta St., Shenhe District, Shenyang 110016, ChinaSchool of Information Engineering, Shenyang University of Chemical Technology, No. 11, 11th St., Tiexi District, Shenyang 110142, ChinaCurrently, visual computer applications based on convolutional neural networks are rapidly developing. However, several problems remain: (1) high-quality graphics processing equipment is needed, and (2) the trained network model has several unnecessary convolution operations. These problems result in a single-stage target detection network that often requires unnecessary computing power and is difficult to apply to equipment with insufficient computing resources. To solve these problems, based on YOLOv5, a YOLOv5-L (YOLOv5 Lightweight) network structure is proposed. This network is improved using YOLOv5. First, to enhance the inference speed of the detector on the CPU, the PP-LCNet (PaddlePaddle-Lightweight CPU Net) is employed as the backbone network. Second, the focus module is removed, and the end convolution module in the head network is replaced by a deep separable convolution module, which eliminates redundant operations and reduces the amount of computation. The experimental results show that YOLOv5-L enables a 48% reduction in model parameters and computation compared to YOLOv5, a 35% increase in operation speed, and a less than 2% reduction in accuracy, which is significant in the environment of low-performance computing equipment.https://www.mdpi.com/2079-9292/12/17/3542lightweightYOLOv5PP-LCNetobject detectionattention mechanism
spellingShingle Xinke Dou
Ting Wang
Shiliang Shao
Xianqing Cao
Object Detection Network Based on Module Stack and Attention Mechanism
Electronics
lightweight
YOLOv5
PP-LCNet
object detection
attention mechanism
title Object Detection Network Based on Module Stack and Attention Mechanism
title_full Object Detection Network Based on Module Stack and Attention Mechanism
title_fullStr Object Detection Network Based on Module Stack and Attention Mechanism
title_full_unstemmed Object Detection Network Based on Module Stack and Attention Mechanism
title_short Object Detection Network Based on Module Stack and Attention Mechanism
title_sort object detection network based on module stack and attention mechanism
topic lightweight
YOLOv5
PP-LCNet
object detection
attention mechanism
url https://www.mdpi.com/2079-9292/12/17/3542
work_keys_str_mv AT xinkedou objectdetectionnetworkbasedonmodulestackandattentionmechanism
AT tingwang objectdetectionnetworkbasedonmodulestackandattentionmechanism
AT shiliangshao objectdetectionnetworkbasedonmodulestackandattentionmechanism
AT xianqingcao objectdetectionnetworkbasedonmodulestackandattentionmechanism