YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images
In recent years, object detection based on deep learning has been widely applied and developed. When using object detection methods to process remote sensing images, the trade-off between the speed and accuracy of models is necessary, because remote sensing images pose additional difficulties such a...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-08-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/15/15/3863 |
_version_ | 1827730901354676224 |
---|---|
author | Tianyi Xie Wen Han Sheng Xu |
author_facet | Tianyi Xie Wen Han Sheng Xu |
author_sort | Tianyi Xie |
collection | DOAJ |
description | In recent years, object detection based on deep learning has been widely applied and developed. When using object detection methods to process remote sensing images, the trade-off between the speed and accuracy of models is necessary, because remote sensing images pose additional difficulties such as complex backgrounds, small objects, and dense distribution to the detection task. This paper proposes YOLO-RS, an optimized object detection algorithm based on YOLOv4 to address the challenges. The Adaptively Spatial Feature Fusion (ASFF) structure is introduced after the feature enhancement network of YOLOv4. It assigns adaptive weight parameters to fuse multi-scale feature information, improving detection accuracy. Furthermore, optimizations are applied to the Spatial Pyramid Pooling (SPP) structure in YOLOv4. By incorporating residual connections and employing 1 × 1 convolutions after maximum pooling, both computation complexity and detection accuracy are improved. To enhance detection speed, Lightnet is introduced, inspired by Depthwise Separable Convolution for reducing model complexity. Additionally, the loss function in YOLOv4 is optimized by introducing the Intersection over Union loss function. This change replaces the aspect ratio loss term with the edge length loss, enhancing sensitivity to width and height, accelerating model convergence, and improving regression accuracy for detected frames. The mean Average Precision (mAP) values of the YOLO-RS model are 87.73% and 92.81% under the TGRS-HRRSD dataset and RSOD dataset, respectively, which are experimentally verified to be 2.15% and 1.66% higher compared to the original YOLOv4 algorithm. The detection speed reached 43.45 FPS and 43.68 FPS, respectively, with 5.29 Frames Per Second (FPS) and 5.30 FPS improvement. |
first_indexed | 2024-03-11T00:17:07Z |
format | Article |
id | doaj.art-0de24b73258a461db3f0d023e06e1dc8 |
institution | Directory Open Access Journal |
issn | 2072-4292 |
language | English |
last_indexed | 2024-03-11T00:17:07Z |
publishDate | 2023-08-01 |
publisher | MDPI AG |
record_format | Article |
series | Remote Sensing |
spelling | doaj.art-0de24b73258a461db3f0d023e06e1dc82023-11-18T23:31:52ZengMDPI AGRemote Sensing2072-42922023-08-011515386310.3390/rs15153863YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing ImagesTianyi Xie0Wen Han1Sheng Xu2College of Information Science and Technology, Nanjing Forestry University, Nanjing 210037, ChinaCollege of Information Science and Technology, Nanjing Forestry University, Nanjing 210037, ChinaCollege of Information Science and Technology, Nanjing Forestry University, Nanjing 210037, ChinaIn recent years, object detection based on deep learning has been widely applied and developed. When using object detection methods to process remote sensing images, the trade-off between the speed and accuracy of models is necessary, because remote sensing images pose additional difficulties such as complex backgrounds, small objects, and dense distribution to the detection task. This paper proposes YOLO-RS, an optimized object detection algorithm based on YOLOv4 to address the challenges. The Adaptively Spatial Feature Fusion (ASFF) structure is introduced after the feature enhancement network of YOLOv4. It assigns adaptive weight parameters to fuse multi-scale feature information, improving detection accuracy. Furthermore, optimizations are applied to the Spatial Pyramid Pooling (SPP) structure in YOLOv4. By incorporating residual connections and employing 1 × 1 convolutions after maximum pooling, both computation complexity and detection accuracy are improved. To enhance detection speed, Lightnet is introduced, inspired by Depthwise Separable Convolution for reducing model complexity. Additionally, the loss function in YOLOv4 is optimized by introducing the Intersection over Union loss function. This change replaces the aspect ratio loss term with the edge length loss, enhancing sensitivity to width and height, accelerating model convergence, and improving regression accuracy for detected frames. The mean Average Precision (mAP) values of the YOLO-RS model are 87.73% and 92.81% under the TGRS-HRRSD dataset and RSOD dataset, respectively, which are experimentally verified to be 2.15% and 1.66% higher compared to the original YOLOv4 algorithm. The detection speed reached 43.45 FPS and 43.68 FPS, respectively, with 5.29 Frames Per Second (FPS) and 5.30 FPS improvement.https://www.mdpi.com/2072-4292/15/15/3863object detectionspeedaccuracyremote sensing imagesbalance |
spellingShingle | Tianyi Xie Wen Han Sheng Xu YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images Remote Sensing object detection speed accuracy remote sensing images balance |
title | YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images |
title_full | YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images |
title_fullStr | YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images |
title_full_unstemmed | YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images |
title_short | YOLO-RS: A More Accurate and Faster Object Detection Method for Remote Sensing Images |
title_sort | yolo rs a more accurate and faster object detection method for remote sensing images |
topic | object detection speed accuracy remote sensing images balance |
url | https://www.mdpi.com/2072-4292/15/15/3863 |
work_keys_str_mv | AT tianyixie yolorsamoreaccurateandfasterobjectdetectionmethodforremotesensingimages AT wenhan yolorsamoreaccurateandfasterobjectdetectionmethodforremotesensingimages AT shengxu yolorsamoreaccurateandfasterobjectdetectionmethodforremotesensingimages |