GLF-Net: A Semantic Segmentation Model Fusing Global and Local Features for High-Resolution Remote Sensing Images

Semantic segmentation of high-resolution remote sensing images holds paramount importance in the field of remote sensing. To better excavate and fully fuse the features in high-resolution remote sensing images, this paper introduces a novel Global and Local Feature Fusion Network, abbreviated as GLF...

Full description

Bibliographic Details
Main Authors:	Wanying Song, Xinwei Zhou, Shiru Zhang, Yan Wu, Peng Zhang
Format:	Article
Language:	English
Published:	MDPI AG 2023-09-01
Series:	Remote Sensing
Subjects:	high-resolution remote sensing semantic segmentation global context information fine-grained feature feature fusion
Online Access:	https://www.mdpi.com/2072-4292/15/19/4649

_version_	1797575219764789248
author	Wanying Song Xinwei Zhou Shiru Zhang Yan Wu Peng Zhang
author_facet	Wanying Song Xinwei Zhou Shiru Zhang Yan Wu Peng Zhang
author_sort	Wanying Song
collection	DOAJ
description	Semantic segmentation of high-resolution remote sensing images holds paramount importance in the field of remote sensing. To better excavate and fully fuse the features in high-resolution remote sensing images, this paper introduces a novel Global and Local Feature Fusion Network, abbreviated as GLF-Net, by incorporating the extensive contextual information and refined fine-grained features. The proposed GLF-Net, devised as an encoder–decoder network, employs the powerful ResNet50 as its baseline model. It incorporates two pivotal components within the encoder phase: a Covariance Attention Module (CAM) and a Local Fine-Grained Extraction Module (LFM). And an additional wavelet self-attention module (WST) is integrated into the decoder stage. The CAM effectively extracts the features of different scales from various stages of the ResNet and then encodes them with graph convolutions. In this way, the proposed GLF-Net model can well capture the global contextual information with both universality and consistency. Additionally, the local feature extraction module refines the feature map by encoding the semantic and spatial information, thereby capturing the local fine-grained features in images. Furthermore, the WST maximizes the synergy between the high-frequency and the low-frequency information, facilitating the fusion of global and local features for better performance in semantic segmentation. The effectiveness of the proposed GLF-Net model is validated through experiments conducted on the ISPRS Potsdam and Vaihingen datasets. The results verify that it can greatly improve segmentation accuracy.
first_indexed	2024-03-10T21:36:39Z
format	Article
id	doaj.art-44c3c35372b64a44b401c7801da305ed
institution	Directory Open Access Journal
issn	2072-4292
language	English
last_indexed	2024-03-10T21:36:39Z
publishDate	2023-09-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj.art-44c3c35372b64a44b401c7801da305ed2023-11-19T14:58:01ZengMDPI AGRemote Sensing2072-42922023-09-011519464910.3390/rs15194649GLF-Net: A Semantic Segmentation Model Fusing Global and Local Features for High-Resolution Remote Sensing ImagesWanying Song0Xinwei Zhou1Shiru Zhang2Yan Wu3Peng Zhang4Xi’an Key Laboratory of Network Convergence Communication, School of Communication and Information Engineering, Xi’an University of Science and Technology, Xi’an 710054, ChinaXi’an Key Laboratory of Network Convergence Communication, School of Communication and Information Engineering, Xi’an University of Science and Technology, Xi’an 710054, ChinaXi’an Key Laboratory of Network Convergence Communication, School of Communication and Information Engineering, Xi’an University of Science and Technology, Xi’an 710054, ChinaSchool of Electronics Engineering, Xidian University, Xi’an 710071, ChinaSchool of Electronics Engineering, Xidian University, Xi’an 710071, ChinaSemantic segmentation of high-resolution remote sensing images holds paramount importance in the field of remote sensing. To better excavate and fully fuse the features in high-resolution remote sensing images, this paper introduces a novel Global and Local Feature Fusion Network, abbreviated as GLF-Net, by incorporating the extensive contextual information and refined fine-grained features. The proposed GLF-Net, devised as an encoder–decoder network, employs the powerful ResNet50 as its baseline model. It incorporates two pivotal components within the encoder phase: a Covariance Attention Module (CAM) and a Local Fine-Grained Extraction Module (LFM). And an additional wavelet self-attention module (WST) is integrated into the decoder stage. The CAM effectively extracts the features of different scales from various stages of the ResNet and then encodes them with graph convolutions. In this way, the proposed GLF-Net model can well capture the global contextual information with both universality and consistency. Additionally, the local feature extraction module refines the feature map by encoding the semantic and spatial information, thereby capturing the local fine-grained features in images. Furthermore, the WST maximizes the synergy between the high-frequency and the low-frequency information, facilitating the fusion of global and local features for better performance in semantic segmentation. The effectiveness of the proposed GLF-Net model is validated through experiments conducted on the ISPRS Potsdam and Vaihingen datasets. The results verify that it can greatly improve segmentation accuracy.https://www.mdpi.com/2072-4292/15/19/4649high-resolution remote sensingsemantic segmentationglobal context informationfine-grained featurefeature fusion
spellingShingle	Wanying Song Xinwei Zhou Shiru Zhang Yan Wu Peng Zhang GLF-Net: A Semantic Segmentation Model Fusing Global and Local Features for High-Resolution Remote Sensing Images Remote Sensing high-resolution remote sensing semantic segmentation global context information fine-grained feature feature fusion
title	GLF-Net: A Semantic Segmentation Model Fusing Global and Local Features for High-Resolution Remote Sensing Images
title_full	GLF-Net: A Semantic Segmentation Model Fusing Global and Local Features for High-Resolution Remote Sensing Images
title_fullStr	GLF-Net: A Semantic Segmentation Model Fusing Global and Local Features for High-Resolution Remote Sensing Images
title_full_unstemmed	GLF-Net: A Semantic Segmentation Model Fusing Global and Local Features for High-Resolution Remote Sensing Images
title_short	GLF-Net: A Semantic Segmentation Model Fusing Global and Local Features for High-Resolution Remote Sensing Images
title_sort	glf net a semantic segmentation model fusing global and local features for high resolution remote sensing images
topic	high-resolution remote sensing semantic segmentation global context information fine-grained feature feature fusion
url	https://www.mdpi.com/2072-4292/15/19/4649
work_keys_str_mv	AT wanyingsong glfnetasemanticsegmentationmodelfusingglobalandlocalfeaturesforhighresolutionremotesensingimages AT xinweizhou glfnetasemanticsegmentationmodelfusingglobalandlocalfeaturesforhighresolutionremotesensingimages AT shiruzhang glfnetasemanticsegmentationmodelfusingglobalandlocalfeaturesforhighresolutionremotesensingimages AT yanwu glfnetasemanticsegmentationmodelfusingglobalandlocalfeaturesforhighresolutionremotesensingimages AT pengzhang glfnetasemanticsegmentationmodelfusingglobalandlocalfeaturesforhighresolutionremotesensingimages

GLF-Net: A Semantic Segmentation Model Fusing Global and Local Features for High-Resolution Remote Sensing Images

Similar Items