Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery

Although the deep semantic segmentation network (DSSN) has been widely used in remote sensing (RS) image semantic segmentation, it still does not fully mind the spatial relationship cues between objects when extracting deep visual features through convolutional filters and pooling layers. In fact, t...

Full description

Bibliographic Details
Main Authors:	Song Ouyang, Yansheng Li
Format:	Article
Language:	English
Published:	MDPI AG 2020-12-01
Series:	Remote Sensing
Subjects:	deep semantic segmentation network (DSSN) graph convolutional neural network (GCN) remote sensing (RS) semantic segmentation spatial relationship
Online Access:	https://www.mdpi.com/2072-4292/13/1/119

_version_	1797542740139966464
author	Song Ouyang Yansheng Li
author_facet	Song Ouyang Yansheng Li
author_sort	Song Ouyang
collection	DOAJ
description	Although the deep semantic segmentation network (DSSN) has been widely used in remote sensing (RS) image semantic segmentation, it still does not fully mind the spatial relationship cues between objects when extracting deep visual features through convolutional filters and pooling layers. In fact, the spatial distribution between objects from different classes has a strong correlation characteristic. For example, buildings tend to be close to roads. In view of the strong appearance extraction ability of DSSN and the powerful topological relationship modeling capability of the graph convolutional neural network (GCN), a DSSN-GCN framework, which combines the advantages of DSSN and GCN, is proposed in this paper for RS image semantic segmentation. To lift the appearance extraction ability, this paper proposes a new DSSN called the attention residual U-shaped network (AttResUNet), which leverages residual blocks to encode feature maps and the attention module to refine the features. As far as GCN, the graph is built, where graph nodes are denoted by the superpixels and the graph weight is calculated by considering the spectral information and spatial information of the nodes. The AttResUNet is trained to extract the high-level features to initialize the graph nodes. Then the GCN combines features and spatial relationships between nodes to conduct classification. It is worth noting that the usage of spatial relationship knowledge boosts the performance and robustness of the classification module. In addition, benefiting from modeling GCN on the superpixel level, the boundaries of objects are restored to a certain extent and there are less pixel-level noises in the final classification result. Extensive experiments on two publicly open datasets show that DSSN-GCN model outperforms the competitive baseline (i.e., the DSSN model) and the DSSN-GCN when adopting AttResUNet achieves the best performance, which demonstrates the advance of our method.
first_indexed	2024-03-10T13:34:50Z
format	Article
id	doaj.art-5c7f1306d8334b7caef4e294128c088f
institution	Directory Open Access Journal
issn	2072-4292
language	English
last_indexed	2024-03-10T13:34:50Z
publishDate	2020-12-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj.art-5c7f1306d8334b7caef4e294128c088f2023-11-21T07:35:15ZengMDPI AGRemote Sensing2072-42922020-12-0113111910.3390/rs13010119Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing ImagerySong Ouyang0Yansheng Li1School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, ChinaSchool of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, ChinaAlthough the deep semantic segmentation network (DSSN) has been widely used in remote sensing (RS) image semantic segmentation, it still does not fully mind the spatial relationship cues between objects when extracting deep visual features through convolutional filters and pooling layers. In fact, the spatial distribution between objects from different classes has a strong correlation characteristic. For example, buildings tend to be close to roads. In view of the strong appearance extraction ability of DSSN and the powerful topological relationship modeling capability of the graph convolutional neural network (GCN), a DSSN-GCN framework, which combines the advantages of DSSN and GCN, is proposed in this paper for RS image semantic segmentation. To lift the appearance extraction ability, this paper proposes a new DSSN called the attention residual U-shaped network (AttResUNet), which leverages residual blocks to encode feature maps and the attention module to refine the features. As far as GCN, the graph is built, where graph nodes are denoted by the superpixels and the graph weight is calculated by considering the spectral information and spatial information of the nodes. The AttResUNet is trained to extract the high-level features to initialize the graph nodes. Then the GCN combines features and spatial relationships between nodes to conduct classification. It is worth noting that the usage of spatial relationship knowledge boosts the performance and robustness of the classification module. In addition, benefiting from modeling GCN on the superpixel level, the boundaries of objects are restored to a certain extent and there are less pixel-level noises in the final classification result. Extensive experiments on two publicly open datasets show that DSSN-GCN model outperforms the competitive baseline (i.e., the DSSN model) and the DSSN-GCN when adopting AttResUNet achieves the best performance, which demonstrates the advance of our method.https://www.mdpi.com/2072-4292/13/1/119deep semantic segmentation network (DSSN)graph convolutional neural network (GCN)remote sensing (RS)semantic segmentationspatial relationship
spellingShingle	Song Ouyang Yansheng Li Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery Remote Sensing deep semantic segmentation network (DSSN) graph convolutional neural network (GCN) remote sensing (RS) semantic segmentation spatial relationship
title	Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery
title_full	Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery
title_fullStr	Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery
title_full_unstemmed	Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery
title_short	Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery
title_sort	combining deep semantic segmentation network and graph convolutional neural network for semantic segmentation of remote sensing imagery
topic	deep semantic segmentation network (DSSN) graph convolutional neural network (GCN) remote sensing (RS) semantic segmentation spatial relationship
url	https://www.mdpi.com/2072-4292/13/1/119
work_keys_str_mv	AT songouyang combiningdeepsemanticsegmentationnetworkandgraphconvolutionalneuralnetworkforsemanticsegmentationofremotesensingimagery AT yanshengli combiningdeepsemanticsegmentationnetworkandgraphconvolutionalneuralnetworkforsemanticsegmentationofremotesensingimagery

Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery

Similar Items