An Attention Cascade Global–Local Network for Remote Sensing Scene Classification

Remote sensing image scene classification is an important task of remote sensing image interpretation, which has recently been well addressed by the convolutional neural network owing to its powerful learning ability. However, due to the multiple types of geographical information and redundant backg...

Full description

Bibliographic Details
Main Authors: Junge Shen, Tianwei Yu, Haopeng Yang, Ruxin Wang, Qi Wang
Format: Article
Language:English
Published: MDPI AG 2022-04-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/14/9/2042
_version_ 1797503025020928000
author Junge Shen
Tianwei Yu
Haopeng Yang
Ruxin Wang
Qi Wang
author_facet Junge Shen
Tianwei Yu
Haopeng Yang
Ruxin Wang
Qi Wang
author_sort Junge Shen
collection DOAJ
description Remote sensing image scene classification is an important task of remote sensing image interpretation, which has recently been well addressed by the convolutional neural network owing to its powerful learning ability. However, due to the multiple types of geographical information and redundant background information of the remote sensing images, most of the CNN-based methods, especially those based on a single CNN model and those ignoring the combination of global and local features, exhibit limited performance on accurate classification. To compensate for such insufficiency, we propose a new dual-model deep feature fusion method based on an attention cascade global–local network (ACGLNet). Specifically, we use two popular CNNs as the feature extractors to extract complementary multiscale features from the input image. Considering the characteristics of the global and local features, the proposed ACGLNet filters the redundant background information from the low-level features through the spatial attention mechanism, followed by which the locally attended features are fused with the high-level features. Then, bilinear fusion is employed to produce the fused representation of the dual model, which is finally fed to the classifier. Through extensive experiments on four public remote sensing scene datasets, including UCM, AID, PatternNet, and OPTIMAL-31, we demonstrate the feasibility of the proposed method and its superiority over the state-of-the-art scene classification methods.
first_indexed 2024-03-10T03:44:35Z
format Article
id doaj.art-12876796abac4580abaa8947f2db0a3c
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-10T03:44:35Z
publishDate 2022-04-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-12876796abac4580abaa8947f2db0a3c2023-11-23T09:09:41ZengMDPI AGRemote Sensing2072-42922022-04-01149204210.3390/rs14092042An Attention Cascade Global–Local Network for Remote Sensing Scene ClassificationJunge Shen0Tianwei Yu1Haopeng Yang2Ruxin Wang3Qi Wang4Unmanned System Research Institute, Northwestern Polytechnical University, Xi’an 710072, ChinaUnmanned System Research Institute, Northwestern Polytechnical University, Xi’an 710072, ChinaUnmanned System Research Institute, Northwestern Polytechnical University, Xi’an 710072, ChinaEngineering Research Center of Cyberspace, School of Software, Yunnan University, Kunming 650106, ChinaUnmanned System Research Institute, Northwestern Polytechnical University, Xi’an 710072, ChinaRemote sensing image scene classification is an important task of remote sensing image interpretation, which has recently been well addressed by the convolutional neural network owing to its powerful learning ability. However, due to the multiple types of geographical information and redundant background information of the remote sensing images, most of the CNN-based methods, especially those based on a single CNN model and those ignoring the combination of global and local features, exhibit limited performance on accurate classification. To compensate for such insufficiency, we propose a new dual-model deep feature fusion method based on an attention cascade global–local network (ACGLNet). Specifically, we use two popular CNNs as the feature extractors to extract complementary multiscale features from the input image. Considering the characteristics of the global and local features, the proposed ACGLNet filters the redundant background information from the low-level features through the spatial attention mechanism, followed by which the locally attended features are fused with the high-level features. Then, bilinear fusion is employed to produce the fused representation of the dual model, which is finally fed to the classifier. Through extensive experiments on four public remote sensing scene datasets, including UCM, AID, PatternNet, and OPTIMAL-31, we demonstrate the feasibility of the proposed method and its superiority over the state-of-the-art scene classification methods.https://www.mdpi.com/2072-4292/14/9/2042remote sensing scene classificationconvolutional neural networkneural architecture searchfeature fusion
spellingShingle Junge Shen
Tianwei Yu
Haopeng Yang
Ruxin Wang
Qi Wang
An Attention Cascade Global–Local Network for Remote Sensing Scene Classification
Remote Sensing
remote sensing scene classification
convolutional neural network
neural architecture search
feature fusion
title An Attention Cascade Global–Local Network for Remote Sensing Scene Classification
title_full An Attention Cascade Global–Local Network for Remote Sensing Scene Classification
title_fullStr An Attention Cascade Global–Local Network for Remote Sensing Scene Classification
title_full_unstemmed An Attention Cascade Global–Local Network for Remote Sensing Scene Classification
title_short An Attention Cascade Global–Local Network for Remote Sensing Scene Classification
title_sort attention cascade global local network for remote sensing scene classification
topic remote sensing scene classification
convolutional neural network
neural architecture search
feature fusion
url https://www.mdpi.com/2072-4292/14/9/2042
work_keys_str_mv AT jungeshen anattentioncascadegloballocalnetworkforremotesensingsceneclassification
AT tianweiyu anattentioncascadegloballocalnetworkforremotesensingsceneclassification
AT haopengyang anattentioncascadegloballocalnetworkforremotesensingsceneclassification
AT ruxinwang anattentioncascadegloballocalnetworkforremotesensingsceneclassification
AT qiwang anattentioncascadegloballocalnetworkforremotesensingsceneclassification
AT jungeshen attentioncascadegloballocalnetworkforremotesensingsceneclassification
AT tianweiyu attentioncascadegloballocalnetworkforremotesensingsceneclassification
AT haopengyang attentioncascadegloballocalnetworkforremotesensingsceneclassification
AT ruxinwang attentioncascadegloballocalnetworkforremotesensingsceneclassification
AT qiwang attentioncascadegloballocalnetworkforremotesensingsceneclassification