A Multiscale Attention Network for Remote Sensing Scene Images Classification

The remote sensing scene images classification has been of great value to civil and military fields. Deep learning models, especially the convolutional neural network (CNN), have achieved great success in this task, however, they may suffer from two challenges: first, the sizes of the category objec...

Full description

Bibliographic Details
Main Authors: Guokai Zhang, Weizhe Xu, Wei Zhao, Chenxi Huang, Eddie Ng Yk, Yongyong Chen, Jian Su
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9528049/
_version_ 1831762422380101632
author Guokai Zhang
Weizhe Xu
Wei Zhao
Chenxi Huang
Eddie Ng Yk
Yongyong Chen
Jian Su
author_facet Guokai Zhang
Weizhe Xu
Wei Zhao
Chenxi Huang
Eddie Ng Yk
Yongyong Chen
Jian Su
author_sort Guokai Zhang
collection DOAJ
description The remote sensing scene images classification has been of great value to civil and military fields. Deep learning models, especially the convolutional neural network (CNN), have achieved great success in this task, however, they may suffer from two challenges: first, the sizes of the category objects are usually different, but the conventional CNN extracts the features with fixed convolution extractor, which could cause the failure in learning the multiscale features; second, some image regions may not be useful during the feature learning process, therefore, how to guide the network to select and focus on the most relevant regions is crucially vital for remote sensing scene image classification. To address these two challenges, we propose a multiscale attention network (MSA-Network), which integrates a multiscale (MS) module and a channel and position attention (CPA) module to boost the performance of the remote sensing scene classification. The proposed MS module learns multiscale features by adopting various sizes of sliding windows from different depths’ layers and receptive fields. The CPA module is composed of two parts: the channel attention (CA) module and the position attention (PA) one. The CA module learns the global attention features from channel-level, and the PA module extracts the local attention features from pixel-level. Thus, fusing both of those two attention features, the network is apt to focus on the more critical and salient regions automatically. Extensive experiments on UC Merced, AID, NWPU-RESISC45 datasets demonstrate that the proposed MSA-Network outperforms several state-of-the-art methods.
first_indexed 2024-12-22T05:06:43Z
format Article
id doaj.art-75554dff98284939a18c6a702cef2509
institution Directory Open Access Journal
issn 2151-1535
language English
last_indexed 2024-12-22T05:06:43Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj.art-75554dff98284939a18c6a702cef25092022-12-21T18:38:06ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01149530954510.1109/JSTARS.2021.31096619528049A Multiscale Attention Network for Remote Sensing Scene Images ClassificationGuokai Zhang0https://orcid.org/0000-0002-0952-8325Weizhe Xu1Wei Zhao2Chenxi Huang3https://orcid.org/0000-0002-2100-0259Eddie Ng Yk4https://orcid.org/0000-0002-5701-1080Yongyong Chen5https://orcid.org/0000-0003-1970-1993Jian Su6https://orcid.org/0000-0003-0634-4843School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai, ChinaSchool of Computer Science, University of Manchester, Manchester, U.K.School of Software Enginnering, Tongji University, Shanghai, ChinaSchool of Informatics, Xiamen University, Xiamen, ChinaSchool of Mechanical and Aerospace Engineering, Nanyang Technological University, Singapore, SingaporeSchool of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, ChinaSchool of Computer and Software, Nanjing University of Information Science and Technology, Nanjing, ChinaThe remote sensing scene images classification has been of great value to civil and military fields. Deep learning models, especially the convolutional neural network (CNN), have achieved great success in this task, however, they may suffer from two challenges: first, the sizes of the category objects are usually different, but the conventional CNN extracts the features with fixed convolution extractor, which could cause the failure in learning the multiscale features; second, some image regions may not be useful during the feature learning process, therefore, how to guide the network to select and focus on the most relevant regions is crucially vital for remote sensing scene image classification. To address these two challenges, we propose a multiscale attention network (MSA-Network), which integrates a multiscale (MS) module and a channel and position attention (CPA) module to boost the performance of the remote sensing scene classification. The proposed MS module learns multiscale features by adopting various sizes of sliding windows from different depths’ layers and receptive fields. The CPA module is composed of two parts: the channel attention (CA) module and the position attention (PA) one. The CA module learns the global attention features from channel-level, and the PA module extracts the local attention features from pixel-level. Thus, fusing both of those two attention features, the network is apt to focus on the more critical and salient regions automatically. Extensive experiments on UC Merced, AID, NWPU-RESISC45 datasets demonstrate that the proposed MSA-Network outperforms several state-of-the-art methods.https://ieeexplore.ieee.org/document/9528049/Remote sensing scenemulti-scaleattentionfeature fusion
spellingShingle Guokai Zhang
Weizhe Xu
Wei Zhao
Chenxi Huang
Eddie Ng Yk
Yongyong Chen
Jian Su
A Multiscale Attention Network for Remote Sensing Scene Images Classification
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Remote sensing scene
multi-scale
attention
feature fusion
title A Multiscale Attention Network for Remote Sensing Scene Images Classification
title_full A Multiscale Attention Network for Remote Sensing Scene Images Classification
title_fullStr A Multiscale Attention Network for Remote Sensing Scene Images Classification
title_full_unstemmed A Multiscale Attention Network for Remote Sensing Scene Images Classification
title_short A Multiscale Attention Network for Remote Sensing Scene Images Classification
title_sort multiscale attention network for remote sensing scene images classification
topic Remote sensing scene
multi-scale
attention
feature fusion
url https://ieeexplore.ieee.org/document/9528049/
work_keys_str_mv AT guokaizhang amultiscaleattentionnetworkforremotesensingsceneimagesclassification
AT weizhexu amultiscaleattentionnetworkforremotesensingsceneimagesclassification
AT weizhao amultiscaleattentionnetworkforremotesensingsceneimagesclassification
AT chenxihuang amultiscaleattentionnetworkforremotesensingsceneimagesclassification
AT eddiengyk amultiscaleattentionnetworkforremotesensingsceneimagesclassification
AT yongyongchen amultiscaleattentionnetworkforremotesensingsceneimagesclassification
AT jiansu amultiscaleattentionnetworkforremotesensingsceneimagesclassification
AT guokaizhang multiscaleattentionnetworkforremotesensingsceneimagesclassification
AT weizhexu multiscaleattentionnetworkforremotesensingsceneimagesclassification
AT weizhao multiscaleattentionnetworkforremotesensingsceneimagesclassification
AT chenxihuang multiscaleattentionnetworkforremotesensingsceneimagesclassification
AT eddiengyk multiscaleattentionnetworkforremotesensingsceneimagesclassification
AT yongyongchen multiscaleattentionnetworkforremotesensingsceneimagesclassification
AT jiansu multiscaleattentionnetworkforremotesensingsceneimagesclassification