Divide to Attend: A Multiple Receptive Field Attention Module for Object Detection in Remote Sensing Images

The study of remote sensing image object detection has excellent research value in environmental protection and public safety. However, the performance of the detectors is unsatisfactory due to the large variability of object size and complex background noise in remote sensing images. Therefore, it...

Full description

Bibliographic Details
Main Authors: Haotian Tan, Yu Jiong, Xueqiang Wan, Junjie Wang
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9858151/
_version_ 1828357015393533952
author Haotian Tan
Yu Jiong
Xueqiang Wan
Junjie Wang
author_facet Haotian Tan
Yu Jiong
Xueqiang Wan
Junjie Wang
author_sort Haotian Tan
collection DOAJ
description The study of remote sensing image object detection has excellent research value in environmental protection and public safety. However, the performance of the detectors is unsatisfactory due to the large variability of object size and complex background noise in remote sensing images. Therefore, it is essential to improve the detection performance of the detectors. Inspired by the idea of “divide and conquer”, we proposed a Multiple Receptive Field Attention (MRFA) to solve these problems and which is a plug-and-play attention method. First, we use the method of multiple receptive field feature map generation to convert the input feature map into four feature maps with different receptive fields. In this way, the small, medium, large, and immense objects in the input feature maps are “seen” in these feature maps, respectively. Then, we used the multiple attention map fusion method to focus objects of different sizes separately, which can effectively suppress noise in the background of remote sensing images. Experiments on remote sensing object detection datasets DIOR and HRRSD demonstrate that the performance of our method is better than other state-of-the-art attention modules. At the same time, the experiments on remote sensing image semantic segmentation dataset WHDLD and classification dataset AID prove the generalization and superiority of our method.
first_indexed 2024-04-14T03:08:25Z
format Article
id doaj.art-8cee2965933e4ed782915c13b6572abb
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-14T03:08:25Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-8cee2965933e4ed782915c13b6572abb2022-12-22T02:15:40ZengIEEEIEEE Access2169-35362022-01-0110872668728110.1109/ACCESS.2022.31993689858151Divide to Attend: A Multiple Receptive Field Attention Module for Object Detection in Remote Sensing ImagesHaotian Tan0https://orcid.org/0000-0001-8698-0115Yu Jiong1Xueqiang Wan2Junjie Wang3School of Information Science and Engineering, Xinjiang University, Urumqi, ChinaSchool of Information Science and Engineering, Xinjiang University, Urumqi, ChinaSchool of Software, Xinjiang University, Urumqi, ChinaSchool of Software, Xinjiang University, Urumqi, ChinaThe study of remote sensing image object detection has excellent research value in environmental protection and public safety. However, the performance of the detectors is unsatisfactory due to the large variability of object size and complex background noise in remote sensing images. Therefore, it is essential to improve the detection performance of the detectors. Inspired by the idea of “divide and conquer”, we proposed a Multiple Receptive Field Attention (MRFA) to solve these problems and which is a plug-and-play attention method. First, we use the method of multiple receptive field feature map generation to convert the input feature map into four feature maps with different receptive fields. In this way, the small, medium, large, and immense objects in the input feature maps are “seen” in these feature maps, respectively. Then, we used the multiple attention map fusion method to focus objects of different sizes separately, which can effectively suppress noise in the background of remote sensing images. Experiments on remote sensing object detection datasets DIOR and HRRSD demonstrate that the performance of our method is better than other state-of-the-art attention modules. At the same time, the experiments on remote sensing image semantic segmentation dataset WHDLD and classification dataset AID prove the generalization and superiority of our method.https://ieeexplore.ieee.org/document/9858151/Attention mechanismobject detectionremote sensing imagesimage processing
spellingShingle Haotian Tan
Yu Jiong
Xueqiang Wan
Junjie Wang
Divide to Attend: A Multiple Receptive Field Attention Module for Object Detection in Remote Sensing Images
IEEE Access
Attention mechanism
object detection
remote sensing images
image processing
title Divide to Attend: A Multiple Receptive Field Attention Module for Object Detection in Remote Sensing Images
title_full Divide to Attend: A Multiple Receptive Field Attention Module for Object Detection in Remote Sensing Images
title_fullStr Divide to Attend: A Multiple Receptive Field Attention Module for Object Detection in Remote Sensing Images
title_full_unstemmed Divide to Attend: A Multiple Receptive Field Attention Module for Object Detection in Remote Sensing Images
title_short Divide to Attend: A Multiple Receptive Field Attention Module for Object Detection in Remote Sensing Images
title_sort divide to attend a multiple receptive field attention module for object detection in remote sensing images
topic Attention mechanism
object detection
remote sensing images
image processing
url https://ieeexplore.ieee.org/document/9858151/
work_keys_str_mv AT haotiantan dividetoattendamultiplereceptivefieldattentionmoduleforobjectdetectioninremotesensingimages
AT yujiong dividetoattendamultiplereceptivefieldattentionmoduleforobjectdetectioninremotesensingimages
AT xueqiangwan dividetoattendamultiplereceptivefieldattentionmoduleforobjectdetectioninremotesensingimages
AT junjiewang dividetoattendamultiplereceptivefieldattentionmoduleforobjectdetectioninremotesensingimages