Pedestrian Detection Based on Feature Enhancement in Complex Scenes

Pedestrian detection has always been a difficult and hot spot in computer vision research. At the same time, pedestrian detection technology plays an important role in many applications, such as intelligent transportation and security monitoring. In complex scenes, pedestrian detection often faces s...

Full description

Bibliographic Details
Main Authors:	Jiao Su, Yi An, Jialin Wu, Kai Zhang
Format:	Article
Language:	English
Published:	MDPI AG 2024-01-01
Series:	Algorithms
Subjects:	pedestrian detection coordinate convolution feature pyramid SE attention mechanism feature enhancement
Online Access:	https://www.mdpi.com/1999-4893/17/1/39

_version_	1797340214918643712
author	Jiao Su Yi An Jialin Wu Kai Zhang
author_facet	Jiao Su Yi An Jialin Wu Kai Zhang
author_sort	Jiao Su
collection	DOAJ
description	Pedestrian detection has always been a difficult and hot spot in computer vision research. At the same time, pedestrian detection technology plays an important role in many applications, such as intelligent transportation and security monitoring. In complex scenes, pedestrian detection often faces some challenges, such as low detection accuracy and misdetection due to small target sizes and scale variations. To solve these problems, this paper proposes a pedestrian detection network PT-YOLO based on the YOLOv5. The pedestrian detection network PT-YOLO consists of the YOLOv5 network, the squeeze-and-excitation module (SE), the weighted bi-directional feature pyramid module (BiFPN), the coordinate convolution (coordconv) module and the wise intersection over union loss function (WIoU). The SE module in the backbone allows it to focus on the important features of pedestrians and improves accuracy. The weighted BiFPN module enhances the fusion of multi-scale pedestrian features and information transfer, which can improve fusion efficiency. The prediction head design uses the WIoU loss function to reduce the regression error. The coordconv module allows the network to better perceive the location information in the feature map. The experimental results show that the pedestrian detection network PT-YOLO is more accurate compared with other target detection methods in pedestrian detection and can effectively accomplish the task of pedestrian detection in complex scenes.
first_indexed	2024-03-08T09:59:47Z
format	Article
id	doaj.art-5b7c3efa34f6453bb810e7c7dcebbb2e
institution	Directory Open Access Journal
issn	1999-4893
language	English
last_indexed	2024-03-08T09:59:47Z
publishDate	2024-01-01
publisher	MDPI AG
record_format	Article
series	Algorithms
spelling	doaj.art-5b7c3efa34f6453bb810e7c7dcebbb2e2024-01-29T13:41:35ZengMDPI AGAlgorithms1999-48932024-01-011713910.3390/a17010039Pedestrian Detection Based on Feature Enhancement in Complex ScenesJiao Su0Yi An1Jialin Wu2Kai Zhang3School of Electrical Engineering, Xinjiang University, Urumqi 830000, ChinaSchool of Electrical Engineering, Xinjiang University, Urumqi 830000, ChinaSchool of Control Science and Engineering, Dalian University of Technology, Dalian 116023, ChinaSchool of Electrical Engineering, Xinjiang University, Urumqi 830000, ChinaPedestrian detection has always been a difficult and hot spot in computer vision research. At the same time, pedestrian detection technology plays an important role in many applications, such as intelligent transportation and security monitoring. In complex scenes, pedestrian detection often faces some challenges, such as low detection accuracy and misdetection due to small target sizes and scale variations. To solve these problems, this paper proposes a pedestrian detection network PT-YOLO based on the YOLOv5. The pedestrian detection network PT-YOLO consists of the YOLOv5 network, the squeeze-and-excitation module (SE), the weighted bi-directional feature pyramid module (BiFPN), the coordinate convolution (coordconv) module and the wise intersection over union loss function (WIoU). The SE module in the backbone allows it to focus on the important features of pedestrians and improves accuracy. The weighted BiFPN module enhances the fusion of multi-scale pedestrian features and information transfer, which can improve fusion efficiency. The prediction head design uses the WIoU loss function to reduce the regression error. The coordconv module allows the network to better perceive the location information in the feature map. The experimental results show that the pedestrian detection network PT-YOLO is more accurate compared with other target detection methods in pedestrian detection and can effectively accomplish the task of pedestrian detection in complex scenes.https://www.mdpi.com/1999-4893/17/1/39pedestrian detectioncoordinate convolutionfeature pyramidSE attention mechanismfeature enhancement
spellingShingle	Jiao Su Yi An Jialin Wu Kai Zhang Pedestrian Detection Based on Feature Enhancement in Complex Scenes Algorithms pedestrian detection coordinate convolution feature pyramid SE attention mechanism feature enhancement
title	Pedestrian Detection Based on Feature Enhancement in Complex Scenes
title_full	Pedestrian Detection Based on Feature Enhancement in Complex Scenes
title_fullStr	Pedestrian Detection Based on Feature Enhancement in Complex Scenes
title_full_unstemmed	Pedestrian Detection Based on Feature Enhancement in Complex Scenes
title_short	Pedestrian Detection Based on Feature Enhancement in Complex Scenes
title_sort	pedestrian detection based on feature enhancement in complex scenes
topic	pedestrian detection coordinate convolution feature pyramid SE attention mechanism feature enhancement
url	https://www.mdpi.com/1999-4893/17/1/39
work_keys_str_mv	AT jiaosu pedestriandetectionbasedonfeatureenhancementincomplexscenes AT yian pedestriandetectionbasedonfeatureenhancementincomplexscenes AT jialinwu pedestriandetectionbasedonfeatureenhancementincomplexscenes AT kaizhang pedestriandetectionbasedonfeatureenhancementincomplexscenes

Pedestrian Detection Based on Feature Enhancement in Complex Scenes

Similar Items