Pedestrian Detection Based on Feature Enhancement in Complex Scenes

Pedestrian detection has always been a difficult and hot spot in computer vision research. At the same time, pedestrian detection technology plays an important role in many applications, such as intelligent transportation and security monitoring. In complex scenes, pedestrian detection often faces s...

Full description

Bibliographic Details
Main Authors: Jiao Su, Yi An, Jialin Wu, Kai Zhang
Format: Article
Language:English
Published: MDPI AG 2024-01-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/17/1/39
_version_ 1797340214918643712
author Jiao Su
Yi An
Jialin Wu
Kai Zhang
author_facet Jiao Su
Yi An
Jialin Wu
Kai Zhang
author_sort Jiao Su
collection DOAJ
description Pedestrian detection has always been a difficult and hot spot in computer vision research. At the same time, pedestrian detection technology plays an important role in many applications, such as intelligent transportation and security monitoring. In complex scenes, pedestrian detection often faces some challenges, such as low detection accuracy and misdetection due to small target sizes and scale variations. To solve these problems, this paper proposes a pedestrian detection network PT-YOLO based on the YOLOv5. The pedestrian detection network PT-YOLO consists of the YOLOv5 network, the squeeze-and-excitation module (SE), the weighted bi-directional feature pyramid module (BiFPN), the coordinate convolution (coordconv) module and the wise intersection over union loss function (WIoU). The SE module in the backbone allows it to focus on the important features of pedestrians and improves accuracy. The weighted BiFPN module enhances the fusion of multi-scale pedestrian features and information transfer, which can improve fusion efficiency. The prediction head design uses the WIoU loss function to reduce the regression error. The coordconv module allows the network to better perceive the location information in the feature map. The experimental results show that the pedestrian detection network PT-YOLO is more accurate compared with other target detection methods in pedestrian detection and can effectively accomplish the task of pedestrian detection in complex scenes.
first_indexed 2024-03-08T09:59:47Z
format Article
id doaj.art-5b7c3efa34f6453bb810e7c7dcebbb2e
institution Directory Open Access Journal
issn 1999-4893
language English
last_indexed 2024-03-08T09:59:47Z
publishDate 2024-01-01
publisher MDPI AG
record_format Article
series Algorithms
spelling doaj.art-5b7c3efa34f6453bb810e7c7dcebbb2e2024-01-29T13:41:35ZengMDPI AGAlgorithms1999-48932024-01-011713910.3390/a17010039Pedestrian Detection Based on Feature Enhancement in Complex ScenesJiao Su0Yi An1Jialin Wu2Kai Zhang3School of Electrical Engineering, Xinjiang University, Urumqi 830000, ChinaSchool of Electrical Engineering, Xinjiang University, Urumqi 830000, ChinaSchool of Control Science and Engineering, Dalian University of Technology, Dalian 116023, ChinaSchool of Electrical Engineering, Xinjiang University, Urumqi 830000, ChinaPedestrian detection has always been a difficult and hot spot in computer vision research. At the same time, pedestrian detection technology plays an important role in many applications, such as intelligent transportation and security monitoring. In complex scenes, pedestrian detection often faces some challenges, such as low detection accuracy and misdetection due to small target sizes and scale variations. To solve these problems, this paper proposes a pedestrian detection network PT-YOLO based on the YOLOv5. The pedestrian detection network PT-YOLO consists of the YOLOv5 network, the squeeze-and-excitation module (SE), the weighted bi-directional feature pyramid module (BiFPN), the coordinate convolution (coordconv) module and the wise intersection over union loss function (WIoU). The SE module in the backbone allows it to focus on the important features of pedestrians and improves accuracy. The weighted BiFPN module enhances the fusion of multi-scale pedestrian features and information transfer, which can improve fusion efficiency. The prediction head design uses the WIoU loss function to reduce the regression error. The coordconv module allows the network to better perceive the location information in the feature map. The experimental results show that the pedestrian detection network PT-YOLO is more accurate compared with other target detection methods in pedestrian detection and can effectively accomplish the task of pedestrian detection in complex scenes.https://www.mdpi.com/1999-4893/17/1/39pedestrian detectioncoordinate convolutionfeature pyramidSE attention mechanismfeature enhancement
spellingShingle Jiao Su
Yi An
Jialin Wu
Kai Zhang
Pedestrian Detection Based on Feature Enhancement in Complex Scenes
Algorithms
pedestrian detection
coordinate convolution
feature pyramid
SE attention mechanism
feature enhancement
title Pedestrian Detection Based on Feature Enhancement in Complex Scenes
title_full Pedestrian Detection Based on Feature Enhancement in Complex Scenes
title_fullStr Pedestrian Detection Based on Feature Enhancement in Complex Scenes
title_full_unstemmed Pedestrian Detection Based on Feature Enhancement in Complex Scenes
title_short Pedestrian Detection Based on Feature Enhancement in Complex Scenes
title_sort pedestrian detection based on feature enhancement in complex scenes
topic pedestrian detection
coordinate convolution
feature pyramid
SE attention mechanism
feature enhancement
url https://www.mdpi.com/1999-4893/17/1/39
work_keys_str_mv AT jiaosu pedestriandetectionbasedonfeatureenhancementincomplexscenes
AT yian pedestriandetectionbasedonfeatureenhancementincomplexscenes
AT jialinwu pedestriandetectionbasedonfeatureenhancementincomplexscenes
AT kaizhang pedestriandetectionbasedonfeatureenhancementincomplexscenes