AF-SSD: An Accurate and Fast Single Shot Detector for High Spatial Remote Sensing Imagery

There are a large number of studies on geospatial object detection. However, many existing methods only focus on either accuracy or speed. Methods with both fast speed and high accuracy are of great importance in some scenes, like search and rescue, and military information acquisition. In remote se...

Full description

Bibliographic Details
Main Authors: Ruihong Yin, Wei Zhao, Xudong Fan, Yongfeng Yin
Format: Article
Language:English
Published: MDPI AG 2020-11-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/20/22/6530
_version_ 1797547863727669248
author Ruihong Yin
Wei Zhao
Xudong Fan
Yongfeng Yin
author_facet Ruihong Yin
Wei Zhao
Xudong Fan
Yongfeng Yin
author_sort Ruihong Yin
collection DOAJ
description There are a large number of studies on geospatial object detection. However, many existing methods only focus on either accuracy or speed. Methods with both fast speed and high accuracy are of great importance in some scenes, like search and rescue, and military information acquisition. In remote sensing images, there are some targets that are small and have few textures and low contrast compared with the background, which impose challenges on object detection. In this paper, we propose an accurate and fast single shot detector (AF-SSD) for high spatial remote sensing imagery to solve these problems. Firstly, we design a lightweight backbone to reduce the number of trainable parameters of the network. In this lightweight backbone, we also use some wide and deep convolutional blocks to extract more semantic information and keep the high detection precision. Secondly, a novel encoding–decoding module is employed to detect small targets accurately. With up-sampling and summation operations, the encoding–decoding module can add strong high-level semantic information to low-level features. Thirdly, we design a cascade structure with spatial and channel attention modules for targets with low contrast (named low-contrast targets) and few textures (named few-texture targets). The spatial attention module can extract long-range features for few-texture targets. By weighting each channel of a feature map, the channel attention module can guide the network to concentrate on easily identifiable features for low-contrast and few-texture targets. The experimental results on the NWPU VHR-10 dataset show that our proposed AF-SSD achieves superior detection performance: parameters 5.7 M, mAP 88.7%, and 0.035 s per image on average on an NVIDIA GTX-1080Ti GPU.
first_indexed 2024-03-10T14:51:22Z
format Article
id doaj.art-f31d35c796f548cabe100579ce102a0b
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-10T14:51:22Z
publishDate 2020-11-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-f31d35c796f548cabe100579ce102a0b2023-11-20T21:02:03ZengMDPI AGSensors1424-82202020-11-012022653010.3390/s20226530AF-SSD: An Accurate and Fast Single Shot Detector for High Spatial Remote Sensing ImageryRuihong Yin0Wei Zhao1Xudong Fan2Yongfeng Yin3School of Electronic and Information Engineering, Beihang University, Beijing 100191, ChinaSchool of Electronic and Information Engineering, Beihang University, Beijing 100191, ChinaSchool of Electronic and Information Engineering, Beihang University, Beijing 100191, ChinaSchool of Reliability and Systems Engineering, Beihang University, Beijing 100191, ChinaThere are a large number of studies on geospatial object detection. However, many existing methods only focus on either accuracy or speed. Methods with both fast speed and high accuracy are of great importance in some scenes, like search and rescue, and military information acquisition. In remote sensing images, there are some targets that are small and have few textures and low contrast compared with the background, which impose challenges on object detection. In this paper, we propose an accurate and fast single shot detector (AF-SSD) for high spatial remote sensing imagery to solve these problems. Firstly, we design a lightweight backbone to reduce the number of trainable parameters of the network. In this lightweight backbone, we also use some wide and deep convolutional blocks to extract more semantic information and keep the high detection precision. Secondly, a novel encoding–decoding module is employed to detect small targets accurately. With up-sampling and summation operations, the encoding–decoding module can add strong high-level semantic information to low-level features. Thirdly, we design a cascade structure with spatial and channel attention modules for targets with low contrast (named low-contrast targets) and few textures (named few-texture targets). The spatial attention module can extract long-range features for few-texture targets. By weighting each channel of a feature map, the channel attention module can guide the network to concentrate on easily identifiable features for low-contrast and few-texture targets. The experimental results on the NWPU VHR-10 dataset show that our proposed AF-SSD achieves superior detection performance: parameters 5.7 M, mAP 88.7%, and 0.035 s per image on average on an NVIDIA GTX-1080Ti GPU.https://www.mdpi.com/1424-8220/20/22/6530geospatial object detectionattention moduleencoding–decoding modulelightweight network
spellingShingle Ruihong Yin
Wei Zhao
Xudong Fan
Yongfeng Yin
AF-SSD: An Accurate and Fast Single Shot Detector for High Spatial Remote Sensing Imagery
Sensors
geospatial object detection
attention module
encoding–decoding module
lightweight network
title AF-SSD: An Accurate and Fast Single Shot Detector for High Spatial Remote Sensing Imagery
title_full AF-SSD: An Accurate and Fast Single Shot Detector for High Spatial Remote Sensing Imagery
title_fullStr AF-SSD: An Accurate and Fast Single Shot Detector for High Spatial Remote Sensing Imagery
title_full_unstemmed AF-SSD: An Accurate and Fast Single Shot Detector for High Spatial Remote Sensing Imagery
title_short AF-SSD: An Accurate and Fast Single Shot Detector for High Spatial Remote Sensing Imagery
title_sort af ssd an accurate and fast single shot detector for high spatial remote sensing imagery
topic geospatial object detection
attention module
encoding–decoding module
lightweight network
url https://www.mdpi.com/1424-8220/20/22/6530
work_keys_str_mv AT ruihongyin afssdanaccurateandfastsingleshotdetectorforhighspatialremotesensingimagery
AT weizhao afssdanaccurateandfastsingleshotdetectorforhighspatialremotesensingimagery
AT xudongfan afssdanaccurateandfastsingleshotdetectorforhighspatialremotesensingimagery
AT yongfengyin afssdanaccurateandfastsingleshotdetectorforhighspatialremotesensingimagery