SDMSEAF-YOLOv8: a framework to significantly improve the detection performance of unmanned aerial vehicle images

The detailed, high-resolution images captured by drones pose challenges to target detection algorithms with complex scenes and small-sized targets. Moreover, targets in unmanned aerial vehicle images are usually affected by factors such as viewing perspective, occlusion, and light, which increase th...

Full description

Bibliographic Details
Main Authors: Linxuan Li, Xiaoyu Liu, Xuan Chen, Fengjuan Yin, Bin Chen, Yufeng Wang, Fanbin Meng
Format: Article
Language:English
Published: Taylor & Francis Group 2024-01-01
Series:Geocarto International
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/10106049.2024.2339294
_version_ 1826938997023178752
author Linxuan Li
Xiaoyu Liu
Xuan Chen
Fengjuan Yin
Bin Chen
Yufeng Wang
Fanbin Meng
author_facet Linxuan Li
Xiaoyu Liu
Xuan Chen
Fengjuan Yin
Bin Chen
Yufeng Wang
Fanbin Meng
author_sort Linxuan Li
collection DOAJ
description The detailed, high-resolution images captured by drones pose challenges to target detection algorithms with complex scenes and small-sized targets. Moreover, targets in unmanned aerial vehicle images are usually affected by factors such as viewing perspective, occlusion, and light, which increase the difficulty of target detection. In response to the above issues, we propose an improved SDMSEAF-YOLOv8 for target detection based on YOLOv8, combined with a Bi-directional Feature Pyramid Network, to improve the sensing ability of the model for multiscale targets. A Space-to-depth layer replaces the traditional strided convolution layer to enhance the extraction of fine-grained information and small-sized target features. A Multi-Separated and Enhancement Attention module enhances the feature learning ability of the occluded target region, thus reducing missed and false detections. Four detection heads are employed for tiny target detection, each responsible for different size ranges, so as to improve the accuracy and robustness of small target detection. The conventional non-maximum suppression algorithm is improved, so as to reduce the problem of missed detections under a densely occluded scene by setting the attenuation function to adjust the confidence of the treated box based on the overlap between it and the highest-scoring box. Experiments demonstrate that the accuracy of SDMSEAF-YOLOv8 exceeds that of state-of-the-art models on the VisDrone2019-DET-val dataset, with a mAP of 42.9% at 640-pixel resolution, 14.8% over the baseline YOLOv8-x algorithm model, and 6.0% over the known state-of-the-art Fine-Grained Target Focusing Network model and with twice as fast detection.
first_indexed 2024-04-24T11:24:37Z
format Article
id doaj.art-7cca6b427c4e44e8aed0ff7ab7a2cf0e
institution Directory Open Access Journal
issn 1010-6049
1752-0762
language English
last_indexed 2025-02-17T19:07:21Z
publishDate 2024-01-01
publisher Taylor & Francis Group
record_format Article
series Geocarto International
spelling doaj.art-7cca6b427c4e44e8aed0ff7ab7a2cf0e2024-12-10T08:23:09ZengTaylor & Francis GroupGeocarto International1010-60491752-07622024-01-0139110.1080/10106049.2024.2339294SDMSEAF-YOLOv8: a framework to significantly improve the detection performance of unmanned aerial vehicle imagesLinxuan Li0Xiaoyu Liu1Xuan Chen2Fengjuan Yin3Bin Chen4Yufeng Wang5Fanbin Meng6School of Medical Information Engineering, Jining Medical University, Rizhao, ChinaSchool of Medical Information Engineering, Jining Medical University, Rizhao, ChinaSchool of Medical Information Engineering, Jining Medical University, Rizhao, ChinaSchool of Medical Information Engineering, Jining Medical University, Rizhao, ChinaSchool of Mechanical and Electronic Engineering, Shandong Agriculture and Engineering University, Zibo, ChinaSchool of Medical Information Engineering, Jining Medical University, Rizhao, ChinaSchool of Medical Information Engineering, Jining Medical University, Rizhao, ChinaThe detailed, high-resolution images captured by drones pose challenges to target detection algorithms with complex scenes and small-sized targets. Moreover, targets in unmanned aerial vehicle images are usually affected by factors such as viewing perspective, occlusion, and light, which increase the difficulty of target detection. In response to the above issues, we propose an improved SDMSEAF-YOLOv8 for target detection based on YOLOv8, combined with a Bi-directional Feature Pyramid Network, to improve the sensing ability of the model for multiscale targets. A Space-to-depth layer replaces the traditional strided convolution layer to enhance the extraction of fine-grained information and small-sized target features. A Multi-Separated and Enhancement Attention module enhances the feature learning ability of the occluded target region, thus reducing missed and false detections. Four detection heads are employed for tiny target detection, each responsible for different size ranges, so as to improve the accuracy and robustness of small target detection. The conventional non-maximum suppression algorithm is improved, so as to reduce the problem of missed detections under a densely occluded scene by setting the attenuation function to adjust the confidence of the treated box based on the overlap between it and the highest-scoring box. Experiments demonstrate that the accuracy of SDMSEAF-YOLOv8 exceeds that of state-of-the-art models on the VisDrone2019-DET-val dataset, with a mAP of 42.9% at 640-pixel resolution, 14.8% over the baseline YOLOv8-x algorithm model, and 6.0% over the known state-of-the-art Fine-Grained Target Focusing Network model and with twice as fast detection.https://www.tandfonline.com/doi/10.1080/10106049.2024.2339294SDMSEAF-YOLOv8VisDrone2019SPDBiFPNMultiSEAM
spellingShingle Linxuan Li
Xiaoyu Liu
Xuan Chen
Fengjuan Yin
Bin Chen
Yufeng Wang
Fanbin Meng
SDMSEAF-YOLOv8: a framework to significantly improve the detection performance of unmanned aerial vehicle images
Geocarto International
SDMSEAF-YOLOv8
VisDrone2019
SPD
BiFPN
MultiSEAM
title SDMSEAF-YOLOv8: a framework to significantly improve the detection performance of unmanned aerial vehicle images
title_full SDMSEAF-YOLOv8: a framework to significantly improve the detection performance of unmanned aerial vehicle images
title_fullStr SDMSEAF-YOLOv8: a framework to significantly improve the detection performance of unmanned aerial vehicle images
title_full_unstemmed SDMSEAF-YOLOv8: a framework to significantly improve the detection performance of unmanned aerial vehicle images
title_short SDMSEAF-YOLOv8: a framework to significantly improve the detection performance of unmanned aerial vehicle images
title_sort sdmseaf yolov8 a framework to significantly improve the detection performance of unmanned aerial vehicle images
topic SDMSEAF-YOLOv8
VisDrone2019
SPD
BiFPN
MultiSEAM
url https://www.tandfonline.com/doi/10.1080/10106049.2024.2339294
work_keys_str_mv AT linxuanli sdmseafyolov8aframeworktosignificantlyimprovethedetectionperformanceofunmannedaerialvehicleimages
AT xiaoyuliu sdmseafyolov8aframeworktosignificantlyimprovethedetectionperformanceofunmannedaerialvehicleimages
AT xuanchen sdmseafyolov8aframeworktosignificantlyimprovethedetectionperformanceofunmannedaerialvehicleimages
AT fengjuanyin sdmseafyolov8aframeworktosignificantlyimprovethedetectionperformanceofunmannedaerialvehicleimages
AT binchen sdmseafyolov8aframeworktosignificantlyimprovethedetectionperformanceofunmannedaerialvehicleimages
AT yufengwang sdmseafyolov8aframeworktosignificantlyimprovethedetectionperformanceofunmannedaerialvehicleimages
AT fanbinmeng sdmseafyolov8aframeworktosignificantlyimprovethedetectionperformanceofunmannedaerialvehicleimages