SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes

Object detection in images captured by unmanned aerial vehicles (UAVs) holds great potential in various domains, including civilian applications, urban planning, and disaster response. However, it faces several challenges, such as multi-scale variations, dense scenes, complex backgrounds, and tiny-s...

Full description

Bibliographic Details
Main Authors:	Yuming Wang, Hua Zou, Ming Yin, Xining Zhang
Format:	Article
Language:	English
Published:	MDPI AG 2023-09-01
Series:	Remote Sensing
Subjects:	object detection unmanned aerial vehicles tiny objects complex scenarios multi-level feature information fusion
Online Access:	https://www.mdpi.com/2072-4292/15/18/4580

_version_	1797577198016659456
author	Yuming Wang Hua Zou Ming Yin Xining Zhang
author_facet	Yuming Wang Hua Zou Ming Yin Xining Zhang
author_sort	Yuming Wang
collection	DOAJ
description	Object detection in images captured by unmanned aerial vehicles (UAVs) holds great potential in various domains, including civilian applications, urban planning, and disaster response. However, it faces several challenges, such as multi-scale variations, dense scenes, complex backgrounds, and tiny-sized objects. In this paper, we present a novel scale-adaptive YOLO framework called SMFF-YOLO, which addresses these challenges through a multi-level feature fusion approach. To improve the detection accuracy of small objects, our framework incorporates the ELAN-SW object detection prediction head. This newly designed head effectively utilizes both global contextual information and local features, enhancing the detection accuracy of tiny objects. Additionally, the proposed bidirectional feature fusion pyramid (BFFP) module tackles the issue of scale variations in object sizes by aggregating multi-scale features. To handle complex backgrounds, we introduce the adaptive atrous spatial pyramid pooling (AASPP) module, which enables adaptive feature fusion and alleviates the negative impact of cluttered scenes. Moreover, we adopt the Wise-IoU(WIoU) bounding box regression loss to enhance the competitiveness of different quality anchor boxes, which offers the framework a more informed gradient allocation strategy. We validate the effectiveness of SMFF-YOLO using the VisDrone and UAVDT datasets. Experimental results demonstrate that our model achieves higher detection accuracy, with AP50 reaching 54.3% for VisDrone and 42.4% for UAVDT datasets. Visual comparative experiments with other YOLO-based methods further illustrate the robustness and adaptability of our approach.
first_indexed	2024-03-10T22:05:42Z
format	Article
id	doaj.art-b7dc26eaef584136bfc341b23816d031
institution	Directory Open Access Journal
issn	2072-4292
language	English
last_indexed	2024-03-10T22:05:42Z
publishDate	2023-09-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj.art-b7dc26eaef584136bfc341b23816d0312023-11-19T12:49:44ZengMDPI AGRemote Sensing2072-42922023-09-011518458010.3390/rs15184580SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV ScenesYuming Wang0Hua Zou1Ming Yin2Xining Zhang3School of Computer Science, Wuhan University, Wuhan 430072, ChinaSchool of Computer Science, Wuhan University, Wuhan 430072, ChinaSchool of Electronic and Electrical Engineering, Wuhan Textile University, Wuhan 430077, ChinaSchool of Computer Science, Wuhan University, Wuhan 430072, ChinaObject detection in images captured by unmanned aerial vehicles (UAVs) holds great potential in various domains, including civilian applications, urban planning, and disaster response. However, it faces several challenges, such as multi-scale variations, dense scenes, complex backgrounds, and tiny-sized objects. In this paper, we present a novel scale-adaptive YOLO framework called SMFF-YOLO, which addresses these challenges through a multi-level feature fusion approach. To improve the detection accuracy of small objects, our framework incorporates the ELAN-SW object detection prediction head. This newly designed head effectively utilizes both global contextual information and local features, enhancing the detection accuracy of tiny objects. Additionally, the proposed bidirectional feature fusion pyramid (BFFP) module tackles the issue of scale variations in object sizes by aggregating multi-scale features. To handle complex backgrounds, we introduce the adaptive atrous spatial pyramid pooling (AASPP) module, which enables adaptive feature fusion and alleviates the negative impact of cluttered scenes. Moreover, we adopt the Wise-IoU(WIoU) bounding box regression loss to enhance the competitiveness of different quality anchor boxes, which offers the framework a more informed gradient allocation strategy. We validate the effectiveness of SMFF-YOLO using the VisDrone and UAVDT datasets. Experimental results demonstrate that our model achieves higher detection accuracy, with AP50 reaching 54.3% for VisDrone and 42.4% for UAVDT datasets. Visual comparative experiments with other YOLO-based methods further illustrate the robustness and adaptability of our approach.https://www.mdpi.com/2072-4292/15/18/4580object detectionunmanned aerial vehiclestiny objectscomplex scenariosmulti-level feature information fusion
spellingShingle	Yuming Wang Hua Zou Ming Yin Xining Zhang SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes Remote Sensing object detection unmanned aerial vehicles tiny objects complex scenarios multi-level feature information fusion
title	SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes
title_full	SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes
title_fullStr	SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes
title_full_unstemmed	SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes
title_short	SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes
title_sort	smff yolo a scale adaptive yolo algorithm with multi level feature fusion for object detection in uav scenes
topic	object detection unmanned aerial vehicles tiny objects complex scenarios multi-level feature information fusion
url	https://www.mdpi.com/2072-4292/15/18/4580
work_keys_str_mv	AT yumingwang smffyoloascaleadaptiveyoloalgorithmwithmultilevelfeaturefusionforobjectdetectioninuavscenes AT huazou smffyoloascaleadaptiveyoloalgorithmwithmultilevelfeaturefusionforobjectdetectioninuavscenes AT mingyin smffyoloascaleadaptiveyoloalgorithmwithmultilevelfeaturefusionforobjectdetectioninuavscenes AT xiningzhang smffyoloascaleadaptiveyoloalgorithmwithmultilevelfeaturefusionforobjectdetectioninuavscenes

SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes

Similar Items