YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images

In recent years, object detection based on deep learning has been widely applied and developed. When using object detection methods to process remote sensing images, the trade-off between the speed and accuracy of models is necessary, because remote sensing images pose additional difficulties such a...

Full description

Bibliographic Details
Main Authors: Tianyi Xie, Wen Han, Sheng Xu
Format: Article
Language:English
Published: MDPI AG 2023-08-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/15/15/3863
_version_ 1827730901354676224
author Tianyi Xie
Wen Han
Sheng Xu
author_facet Tianyi Xie
Wen Han
Sheng Xu
author_sort Tianyi Xie
collection DOAJ
description In recent years, object detection based on deep learning has been widely applied and developed. When using object detection methods to process remote sensing images, the trade-off between the speed and accuracy of models is necessary, because remote sensing images pose additional difficulties such as complex backgrounds, small objects, and dense distribution to the detection task. This paper proposes YOLO-RS, an optimized object detection algorithm based on YOLOv4 to address the challenges. The Adaptively Spatial Feature Fusion (ASFF) structure is introduced after the feature enhancement network of YOLOv4. It assigns adaptive weight parameters to fuse multi-scale feature information, improving detection accuracy. Furthermore, optimizations are applied to the Spatial Pyramid Pooling (SPP) structure in YOLOv4. By incorporating residual connections and employing 1 × 1 convolutions after maximum pooling, both computation complexity and detection accuracy are improved. To enhance detection speed, Lightnet is introduced, inspired by Depthwise Separable Convolution for reducing model complexity. Additionally, the loss function in YOLOv4 is optimized by introducing the Intersection over Union loss function. This change replaces the aspect ratio loss term with the edge length loss, enhancing sensitivity to width and height, accelerating model convergence, and improving regression accuracy for detected frames. The mean Average Precision (mAP) values of the YOLO-RS model are 87.73% and 92.81% under the TGRS-HRRSD dataset and RSOD dataset, respectively, which are experimentally verified to be 2.15% and 1.66% higher compared to the original YOLOv4 algorithm. The detection speed reached 43.45 FPS and 43.68 FPS, respectively, with 5.29 Frames Per Second (FPS) and 5.30 FPS improvement.
first_indexed 2024-03-11T00:17:07Z
format Article
id doaj.art-0de24b73258a461db3f0d023e06e1dc8
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-11T00:17:07Z
publishDate 2023-08-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-0de24b73258a461db3f0d023e06e1dc82023-11-18T23:31:52ZengMDPI AGRemote Sensing2072-42922023-08-011515386310.3390/rs15153863YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing ImagesTianyi Xie0Wen Han1Sheng Xu2College of Information Science and Technology, Nanjing Forestry University, Nanjing 210037, ChinaCollege of Information Science and Technology, Nanjing Forestry University, Nanjing 210037, ChinaCollege of Information Science and Technology, Nanjing Forestry University, Nanjing 210037, ChinaIn recent years, object detection based on deep learning has been widely applied and developed. When using object detection methods to process remote sensing images, the trade-off between the speed and accuracy of models is necessary, because remote sensing images pose additional difficulties such as complex backgrounds, small objects, and dense distribution to the detection task. This paper proposes YOLO-RS, an optimized object detection algorithm based on YOLOv4 to address the challenges. The Adaptively Spatial Feature Fusion (ASFF) structure is introduced after the feature enhancement network of YOLOv4. It assigns adaptive weight parameters to fuse multi-scale feature information, improving detection accuracy. Furthermore, optimizations are applied to the Spatial Pyramid Pooling (SPP) structure in YOLOv4. By incorporating residual connections and employing 1 × 1 convolutions after maximum pooling, both computation complexity and detection accuracy are improved. To enhance detection speed, Lightnet is introduced, inspired by Depthwise Separable Convolution for reducing model complexity. Additionally, the loss function in YOLOv4 is optimized by introducing the Intersection over Union loss function. This change replaces the aspect ratio loss term with the edge length loss, enhancing sensitivity to width and height, accelerating model convergence, and improving regression accuracy for detected frames. The mean Average Precision (mAP) values of the YOLO-RS model are 87.73% and 92.81% under the TGRS-HRRSD dataset and RSOD dataset, respectively, which are experimentally verified to be 2.15% and 1.66% higher compared to the original YOLOv4 algorithm. The detection speed reached 43.45 FPS and 43.68 FPS, respectively, with 5.29 Frames Per Second (FPS) and 5.30 FPS improvement.https://www.mdpi.com/2072-4292/15/15/3863object detectionspeedaccuracyremote sensing imagesbalance
spellingShingle Tianyi Xie
Wen Han
Sheng Xu
YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images
Remote Sensing
object detection
speed
accuracy
remote sensing images
balance
title YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images
title_full YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images
title_fullStr YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images
title_full_unstemmed YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images
title_short YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images
title_sort yolo rs a more accurate and faster object detection method for remote sensing images
topic object detection
speed
accuracy
remote sensing images
balance
url https://www.mdpi.com/2072-4292/15/15/3863
work_keys_str_mv AT tianyixie yolorsamoreaccurateandfasterobjectdetectionmethodforremotesensingimages
AT wenhan yolorsamoreaccurateandfasterobjectdetectionmethodforremotesensingimages
AT shengxu yolorsamoreaccurateandfasterobjectdetectionmethodforremotesensingimages