GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene Classification

Deep convolutional neural networks have become an indispensable method in remote sensing image scene classification because of their powerful feature extraction capabilities. However, the ability of the models to extract multiscale features and global features on surface objects of complex scenes is...

Full description

Bibliographic Details
Main Authors: Weitao Chen, Shubing Ouyang, Wei Tong, Xianju Li, Xiongwei Zheng, Lizhe Wang
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9678028/
_version_ 1797947285026373632
author Weitao Chen
Shubing Ouyang
Wei Tong
Xianju Li
Xiongwei Zheng
Lizhe Wang
author_facet Weitao Chen
Shubing Ouyang
Wei Tong
Xianju Li
Xiongwei Zheng
Lizhe Wang
author_sort Weitao Chen
collection DOAJ
description Deep convolutional neural networks have become an indispensable method in remote sensing image scene classification because of their powerful feature extraction capabilities. However, the ability of the models to extract multiscale features and global features on surface objects of complex scenes is currently insufficient. We propose a framework based on global context spatial attention (GCSA) and densely connected convolutional networks to extract multiscale global scene features, called GCSANet. The mixup operation is used to enhance the spatial mixed data of remote sensing images, and the discrete sample space is rendered continuous to improve the smoothness in the neighborhood of the data space. The characteristics of multiscale surface objects are extracted, and their internal dense connection is strengthened by the densely connected backbone network. GCSA is introduced into the densely connected backbone network to encode the context information of the remote sensing scene image into the local features. Experiments were performed on four remote sensing scene datasets to evaluate the performance of GCSANet. The GCSANet achieved the highest classification precision on AID and NWPU datasets and the second-best performance on the UC Merced dataset, indicating the GCSANet can effectively extract the global features of remote sensing images. In addition, the GCSANet presents the highest classification accuracy on the constructed mountain image scene dataset. These results reveal that the GCSANet can effectively extract multiscale global scene features on complex remote sensing scenes. The source codes of this method can be foundin <uri>https://github.com/ShubingOuyangcug/GCSANet</uri>.
first_indexed 2024-04-10T21:25:24Z
format Article
id doaj.art-bec92b7db0fb426080b08072628066dc
institution Directory Open Access Journal
issn 2151-1535
language English
last_indexed 2024-04-10T21:25:24Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj.art-bec92b7db0fb426080b08072628066dc2023-01-20T00:00:21ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352022-01-01151150116210.1109/JSTARS.2022.31418269678028GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene ClassificationWeitao Chen0https://orcid.org/0000-0002-6272-1618Shubing Ouyang1https://orcid.org/0000-0003-4737-4205Wei Tong2https://orcid.org/0000-0003-2873-7584Xianju Li3https://orcid.org/0000-0001-7785-2541Xiongwei Zheng4Lizhe Wang5https://orcid.org/0000-0003-2766-0845School of Computer Science, China University of Geosciences, Wuhan, ChinaSchool of Computer Science, China University of Geosciences, Wuhan, ChinaSchool of Computer Science, China University of Geosciences, Wuhan, ChinaSchool of Computer Science, China University of Geosciences, Wuhan, ChinaSchool of Computer Science, China University of Geosciences, Wuhan, ChinaSchool of Computer Science, China University of Geosciences, Wuhan, ChinaDeep convolutional neural networks have become an indispensable method in remote sensing image scene classification because of their powerful feature extraction capabilities. However, the ability of the models to extract multiscale features and global features on surface objects of complex scenes is currently insufficient. We propose a framework based on global context spatial attention (GCSA) and densely connected convolutional networks to extract multiscale global scene features, called GCSANet. The mixup operation is used to enhance the spatial mixed data of remote sensing images, and the discrete sample space is rendered continuous to improve the smoothness in the neighborhood of the data space. The characteristics of multiscale surface objects are extracted, and their internal dense connection is strengthened by the densely connected backbone network. GCSA is introduced into the densely connected backbone network to encode the context information of the remote sensing scene image into the local features. Experiments were performed on four remote sensing scene datasets to evaluate the performance of GCSANet. The GCSANet achieved the highest classification precision on AID and NWPU datasets and the second-best performance on the UC Merced dataset, indicating the GCSANet can effectively extract the global features of remote sensing images. In addition, the GCSANet presents the highest classification accuracy on the constructed mountain image scene dataset. These results reveal that the GCSANet can effectively extract multiscale global scene features on complex remote sensing scenes. The source codes of this method can be foundin <uri>https://github.com/ShubingOuyangcug/GCSANet</uri>.https://ieeexplore.ieee.org/document/9678028/Attention mechanismfeature channelglobal context informationremote sensingscene classification
spellingShingle Weitao Chen
Shubing Ouyang
Wei Tong
Xianju Li
Xiongwei Zheng
Lizhe Wang
GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene Classification
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Attention mechanism
feature channel
global context information
remote sensing
scene classification
title GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene Classification
title_full GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene Classification
title_fullStr GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene Classification
title_full_unstemmed GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene Classification
title_short GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene Classification
title_sort gcsanet a global context spatial attention deep learning network for remote sensing scene classification
topic Attention mechanism
feature channel
global context information
remote sensing
scene classification
url https://ieeexplore.ieee.org/document/9678028/
work_keys_str_mv AT weitaochen gcsanetaglobalcontextspatialattentiondeeplearningnetworkforremotesensingsceneclassification
AT shubingouyang gcsanetaglobalcontextspatialattentiondeeplearningnetworkforremotesensingsceneclassification
AT weitong gcsanetaglobalcontextspatialattentiondeeplearningnetworkforremotesensingsceneclassification
AT xianjuli gcsanetaglobalcontextspatialattentiondeeplearningnetworkforremotesensingsceneclassification
AT xiongweizheng gcsanetaglobalcontextspatialattentiondeeplearningnetworkforremotesensingsceneclassification
AT lizhewang gcsanetaglobalcontextspatialattentiondeeplearningnetworkforremotesensingsceneclassification