Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic Segmentation

Semantic segmentation of remote sensing imagery is a fundamental task in intelligent interpretation. Since deep convolutional neural networks (DCNNs) performed considerable insight in learning implicit representations from data, numerous works in recent years have transferred the DCNN-based model to...

Full description

Bibliographic Details
Main Authors:	Xin Li, Feng Xu, Runliang Xia, Xin Lyu, Hongmin Gao, Yao Tong
Format:	Article
Language:	English
Published:	MDPI AG 2021-07-01
Series:	Remote Sensing
Subjects:	semantic segmentation remote sensing imagery cross-level contextual information representation enhancement
Online Access:	https://www.mdpi.com/2072-4292/13/15/2986

_version_	1827686338469560320
author	Xin Li Feng Xu Runliang Xia Xin Lyu Hongmin Gao Yao Tong
author_facet	Xin Li Feng Xu Runliang Xia Xin Lyu Hongmin Gao Yao Tong
author_sort	Xin Li
collection	DOAJ
description	Semantic segmentation of remote sensing imagery is a fundamental task in intelligent interpretation. Since deep convolutional neural networks (DCNNs) performed considerable insight in learning implicit representations from data, numerous works in recent years have transferred the DCNN-based model to remote sensing data analysis. However, the wide-range observation areas, complex and diverse objects and illumination and imaging angle influence the pixels easily confused, leading to undesirable results. Therefore, a remote sensing imagery semantic segmentation neural network, named HCANet, is proposed to generate representative and discriminative representations for dense predictions. HCANet hybridizes cross-level contextual and attentive representations to emphasize the distinguishability of learned features. First of all, a cross-level contextual representation module (CCRM) is devised to exploit and harness the superpixel contextual information. Moreover, a hybrid representation enhancement module (HREM) is designed to fuse cross-level contextual and self-attentive representations flexibly. Furthermore, the decoder incorporates DUpsampling operation to boost the efficiency losslessly. The extensive experiments are implemented on the Vaihingen and Potsdam benchmarks. In addition, the results indicate that HCANet achieves excellent performance on overall accuracy and mean intersection over union. In addition, the ablation study further verifies the superiority of CCRM.
first_indexed	2024-03-10T09:08:49Z
format	Article
id	doaj.art-10cd51ad0db245b4b07455b3e06a4df7
institution	Directory Open Access Journal
issn	2072-4292
language	English
last_indexed	2024-03-10T09:08:49Z
publishDate	2021-07-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj.art-10cd51ad0db245b4b07455b3e06a4df72023-11-22T06:07:10ZengMDPI AGRemote Sensing2072-42922021-07-011315298610.3390/rs13152986Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic SegmentationXin Li0Feng Xu1Runliang Xia2Xin Lyu3Hongmin Gao4Yao Tong5College of Computer and Information, Hohai University, Nanjing 211100, ChinaCollege of Computer and Information, Hohai University, Nanjing 211100, ChinaInformation Engineering Center, Yellow River Institute of Hydraulic Research, Zhengzhou 450003, ChinaCollege of Computer and Information, Hohai University, Nanjing 211100, ChinaCollege of Computer and Information, Hohai University, Nanjing 211100, ChinaSchool of Information Engineering, Zhengzhou University, Zhengzhou 450001, ChinaSemantic segmentation of remote sensing imagery is a fundamental task in intelligent interpretation. Since deep convolutional neural networks (DCNNs) performed considerable insight in learning implicit representations from data, numerous works in recent years have transferred the DCNN-based model to remote sensing data analysis. However, the wide-range observation areas, complex and diverse objects and illumination and imaging angle influence the pixels easily confused, leading to undesirable results. Therefore, a remote sensing imagery semantic segmentation neural network, named HCANet, is proposed to generate representative and discriminative representations for dense predictions. HCANet hybridizes cross-level contextual and attentive representations to emphasize the distinguishability of learned features. First of all, a cross-level contextual representation module (CCRM) is devised to exploit and harness the superpixel contextual information. Moreover, a hybrid representation enhancement module (HREM) is designed to fuse cross-level contextual and self-attentive representations flexibly. Furthermore, the decoder incorporates DUpsampling operation to boost the efficiency losslessly. The extensive experiments are implemented on the Vaihingen and Potsdam benchmarks. In addition, the results indicate that HCANet achieves excellent performance on overall accuracy and mean intersection over union. In addition, the ablation study further verifies the superiority of CCRM.https://www.mdpi.com/2072-4292/13/15/2986semantic segmentationremote sensing imagerycross-level contextual informationrepresentation enhancement
spellingShingle	Xin Li Feng Xu Runliang Xia Xin Lyu Hongmin Gao Yao Tong Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic Segmentation Remote Sensing semantic segmentation remote sensing imagery cross-level contextual information representation enhancement
title	Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic Segmentation
title_full	Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic Segmentation
title_fullStr	Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic Segmentation
title_full_unstemmed	Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic Segmentation
title_short	Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic Segmentation
title_sort	hybridizing cross level contextual and attentive representations for remote sensing imagery semantic segmentation
topic	semantic segmentation remote sensing imagery cross-level contextual information representation enhancement
url	https://www.mdpi.com/2072-4292/13/15/2986
work_keys_str_mv	AT xinli hybridizingcrosslevelcontextualandattentiverepresentationsforremotesensingimagerysemanticsegmentation AT fengxu hybridizingcrosslevelcontextualandattentiverepresentationsforremotesensingimagerysemanticsegmentation AT runliangxia hybridizingcrosslevelcontextualandattentiverepresentationsforremotesensingimagerysemanticsegmentation AT xinlyu hybridizingcrosslevelcontextualandattentiverepresentationsforremotesensingimagerysemanticsegmentation AT hongmingao hybridizingcrosslevelcontextualandattentiverepresentationsforremotesensingimagerysemanticsegmentation AT yaotong hybridizingcrosslevelcontextualandattentiverepresentationsforremotesensingimagerysemanticsegmentation

Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic Segmentation

Similar Items