ENHANCING PEDESTRIAN TARGET RECOGNITION IN OPEN COMMUNITY MULTI-SCENE SPACES USING THE YOLO-STP NETWORK

Addressing the challenge of quantitatively analyzing and presenting pedestrian elements within open community spaces is of significant importance. Focusing on the indoor scene spaces of open communities, this study introduces the TJ-Person pedestrian target recognition image dataset. Furthermore, we...

Full description

Bibliographic Details
Main Authors: C. Liu, Y. Li, J. Gu, Y. Lou, T. Shen
Format: Article
Language:English
Published: Copernicus Publications 2023-12-01
Series:ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:https://isprs-annals.copernicus.org/articles/X-1-W1-2023/319/2023/isprs-annals-X-1-W1-2023-319-2023.pdf
_version_ 1797403708152086528
author C. Liu
Y. Li
J. Gu
Y. Lou
T. Shen
author_facet C. Liu
Y. Li
J. Gu
Y. Lou
T. Shen
author_sort C. Liu
collection DOAJ
description Addressing the challenge of quantitatively analyzing and presenting pedestrian elements within open community spaces is of significant importance. Focusing on the indoor scene spaces of open communities, this study introduces the TJ-Person pedestrian target recognition image dataset. Furthermore, we design a deep learning-based community pedestrian activity analysis network model and incorporate various attention mechanisms, such as SA, CA, CBAM, SE, and SK, into the YOLO v5s deep learning target recognition network framework for comparative evaluation of pedestrian target recognition in open communities. Utilizing the optimized YOLO Swin Transformer Person (YOLO-STP) network, precise identification of pedestrian targets across multiple scenarios was achieved. We conducted experimental verification using four typical scenarios within Shanghai's NICE2035 open community as case studies. The results demonstrated that the proposed YOLO-STP community pedestrian activity analysis network model achieved an optimal detection accuracy of up to 98.47%. In all four tested scenarios, the YOLO-STP method consistently exhibited competitive performance. Moreover, in the COCO-2017 open-source dataset testing, the YOLO-STP method outperformed other networks of the same type, showcasing its significant advantages. Overall, the research presented in this study provides a crucial technical foundation for the analysis and recognition of pedestrian targets in future community scenarios.
first_indexed 2024-03-09T02:43:19Z
format Article
id doaj.art-6e5f970fe2e5480198f4989d28b9b1d4
institution Directory Open Access Journal
issn 2194-9042
2194-9050
language English
last_indexed 2024-03-09T02:43:19Z
publishDate 2023-12-01
publisher Copernicus Publications
record_format Article
series ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
spelling doaj.art-6e5f970fe2e5480198f4989d28b9b1d42023-12-06T00:19:51ZengCopernicus PublicationsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences2194-90422194-90502023-12-01X-1-W1-202331932810.5194/isprs-annals-X-1-W1-2023-319-2023ENHANCING PEDESTRIAN TARGET RECOGNITION IN OPEN COMMUNITY MULTI-SCENE SPACES USING THE YOLO-STP NETWORKC. Liu0Y. Li1J. Gu2Y. Lou3T. Shen4College of Surveying and Geo-informatics, Tongji University, Shanghai, ChinaCollege of Surveying and Geo-informatics, Tongji University, Shanghai, ChinaCollege of Surveying and Geo-informatics, Tongji University, Shanghai, ChinaCollege of Design and Innovation, Tongji University, Shanghai, ChinaCollege of Design and Innovation, Tongji University, Shanghai, ChinaAddressing the challenge of quantitatively analyzing and presenting pedestrian elements within open community spaces is of significant importance. Focusing on the indoor scene spaces of open communities, this study introduces the TJ-Person pedestrian target recognition image dataset. Furthermore, we design a deep learning-based community pedestrian activity analysis network model and incorporate various attention mechanisms, such as SA, CA, CBAM, SE, and SK, into the YOLO v5s deep learning target recognition network framework for comparative evaluation of pedestrian target recognition in open communities. Utilizing the optimized YOLO Swin Transformer Person (YOLO-STP) network, precise identification of pedestrian targets across multiple scenarios was achieved. We conducted experimental verification using four typical scenarios within Shanghai's NICE2035 open community as case studies. The results demonstrated that the proposed YOLO-STP community pedestrian activity analysis network model achieved an optimal detection accuracy of up to 98.47%. In all four tested scenarios, the YOLO-STP method consistently exhibited competitive performance. Moreover, in the COCO-2017 open-source dataset testing, the YOLO-STP method outperformed other networks of the same type, showcasing its significant advantages. Overall, the research presented in this study provides a crucial technical foundation for the analysis and recognition of pedestrian targets in future community scenarios.https://isprs-annals.copernicus.org/articles/X-1-W1-2023/319/2023/isprs-annals-X-1-W1-2023-319-2023.pdf
spellingShingle C. Liu
Y. Li
J. Gu
Y. Lou
T. Shen
ENHANCING PEDESTRIAN TARGET RECOGNITION IN OPEN COMMUNITY MULTI-SCENE SPACES USING THE YOLO-STP NETWORK
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
title ENHANCING PEDESTRIAN TARGET RECOGNITION IN OPEN COMMUNITY MULTI-SCENE SPACES USING THE YOLO-STP NETWORK
title_full ENHANCING PEDESTRIAN TARGET RECOGNITION IN OPEN COMMUNITY MULTI-SCENE SPACES USING THE YOLO-STP NETWORK
title_fullStr ENHANCING PEDESTRIAN TARGET RECOGNITION IN OPEN COMMUNITY MULTI-SCENE SPACES USING THE YOLO-STP NETWORK
title_full_unstemmed ENHANCING PEDESTRIAN TARGET RECOGNITION IN OPEN COMMUNITY MULTI-SCENE SPACES USING THE YOLO-STP NETWORK
title_short ENHANCING PEDESTRIAN TARGET RECOGNITION IN OPEN COMMUNITY MULTI-SCENE SPACES USING THE YOLO-STP NETWORK
title_sort enhancing pedestrian target recognition in open community multi scene spaces using the yolo stp network
url https://isprs-annals.copernicus.org/articles/X-1-W1-2023/319/2023/isprs-annals-X-1-W1-2023-319-2023.pdf
work_keys_str_mv AT cliu enhancingpedestriantargetrecognitioninopencommunitymultiscenespacesusingtheyolostpnetwork
AT yli enhancingpedestriantargetrecognitioninopencommunitymultiscenespacesusingtheyolostpnetwork
AT jgu enhancingpedestriantargetrecognitioninopencommunitymultiscenespacesusingtheyolostpnetwork
AT ylou enhancingpedestriantargetrecognitioninopencommunitymultiscenespacesusingtheyolostpnetwork
AT tshen enhancingpedestriantargetrecognitioninopencommunitymultiscenespacesusingtheyolostpnetwork