Relation-Attention Networks for Remote Sensing Scene Classification

Remote sensing (RS) scene classification plays an important role in a wide range of RS applications. Recently, convolutional neural networks (CNNs) have been applied to the field of scene classification in RS images and achieved impressive performance. However, to classify RS scenes, most of the exi...

Full description

Bibliographic Details
Main Authors: Xin Wang, Lin Duan, Chen Ning, Huiyu Zhou
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9652121/
_version_ 1798034798834352128
author Xin Wang
Lin Duan
Chen Ning
Huiyu Zhou
author_facet Xin Wang
Lin Duan
Chen Ning
Huiyu Zhou
author_sort Xin Wang
collection DOAJ
description Remote sensing (RS) scene classification plays an important role in a wide range of RS applications. Recently, convolutional neural networks (CNNs) have been applied to the field of scene classification in RS images and achieved impressive performance. However, to classify RS scenes, most of the existing CNN methods either utilize the high-level features from the last convolutional layer of CNNs, missing much important information existing at the other levels, or directly fuse the features at different levels, bringing redundant and/or mutually exclusive information. Inspired by the attention mechanism of the human visual system, in this article, we explore a novel relation-attention model and design an end-to-end relation-attention network (RANet) to learn powerful feature representations of multiple levels to further improve the classification performance. First, we propose to extract convolutional features at different levels by pretrained CNNs. Second, a multiscale feature computation module is constructed to connect features at different levels and generate multiscale semantic features. Third, a novel relation-attention model is designed to focus on the critical features whilst avoiding the use of redundant and even distractive ones by exploiting the scale contextual information. Finally, the resulting relation-attention features are concatenated and fed into a softmax layer for the final classification. Experiments on four well-known RS scene classification datasets (UC-Merced, WHU-RS19, AID, and OPTIMAL-31) show that our method outperforms some state-of-the-art algorithms.
first_indexed 2024-04-11T20:49:23Z
format Article
id doaj.art-b90707a1c8b74e218849d679aa241636
institution Directory Open Access Journal
issn 2151-1535
language English
last_indexed 2024-04-11T20:49:23Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj.art-b90707a1c8b74e218849d679aa2416362022-12-22T04:03:54ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352022-01-011542243910.1109/JSTARS.2021.31355669652121Relation-Attention Networks for Remote Sensing Scene ClassificationXin Wang0https://orcid.org/0000-0003-0203-9964Lin Duan1Chen Ning2https://orcid.org/0000-0002-3026-2496Huiyu Zhou3https://orcid.org/0000-0003-1634-9840College of Computer and Information, Hohai University, Nanjing, ChinaCollege of Computer and Information, Hohai University, Nanjing, ChinaSchool of Computer and Electronic Information, Nanjing Normal University, Nanjing, ChinaSchool of Informatics, University of Leicester, Leicester, U.K.Remote sensing (RS) scene classification plays an important role in a wide range of RS applications. Recently, convolutional neural networks (CNNs) have been applied to the field of scene classification in RS images and achieved impressive performance. However, to classify RS scenes, most of the existing CNN methods either utilize the high-level features from the last convolutional layer of CNNs, missing much important information existing at the other levels, or directly fuse the features at different levels, bringing redundant and/or mutually exclusive information. Inspired by the attention mechanism of the human visual system, in this article, we explore a novel relation-attention model and design an end-to-end relation-attention network (RANet) to learn powerful feature representations of multiple levels to further improve the classification performance. First, we propose to extract convolutional features at different levels by pretrained CNNs. Second, a multiscale feature computation module is constructed to connect features at different levels and generate multiscale semantic features. Third, a novel relation-attention model is designed to focus on the critical features whilst avoiding the use of redundant and even distractive ones by exploiting the scale contextual information. Finally, the resulting relation-attention features are concatenated and fed into a softmax layer for the final classification. Experiments on four well-known RS scene classification datasets (UC-Merced, WHU-RS19, AID, and OPTIMAL-31) show that our method outperforms some state-of-the-art algorithms.https://ieeexplore.ieee.org/document/9652121/Convolutional neural network (CNN)relation-attention network (RANet)remote sensing (RS)scene classification
spellingShingle Xin Wang
Lin Duan
Chen Ning
Huiyu Zhou
Relation-Attention Networks for Remote Sensing Scene Classification
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Convolutional neural network (CNN)
relation-attention network (RANet)
remote sensing (RS)
scene classification
title Relation-Attention Networks for Remote Sensing Scene Classification
title_full Relation-Attention Networks for Remote Sensing Scene Classification
title_fullStr Relation-Attention Networks for Remote Sensing Scene Classification
title_full_unstemmed Relation-Attention Networks for Remote Sensing Scene Classification
title_short Relation-Attention Networks for Remote Sensing Scene Classification
title_sort relation attention networks for remote sensing scene classification
topic Convolutional neural network (CNN)
relation-attention network (RANet)
remote sensing (RS)
scene classification
url https://ieeexplore.ieee.org/document/9652121/
work_keys_str_mv AT xinwang relationattentionnetworksforremotesensingsceneclassification
AT linduan relationattentionnetworksforremotesensingsceneclassification
AT chenning relationattentionnetworksforremotesensingsceneclassification
AT huiyuzhou relationattentionnetworksforremotesensingsceneclassification