Multi-Field Context Fusion Network for Semantic Segmentation of High-Spatial-Resolution Remote Sensing Images

High spatial resolution (HSR) remote sensing images have a wide range of application prospects in the fields of urban planning, agricultural planning and military training. Therefore, the research on the semantic segmentation of remote sensing images becomes extremely important. However, large data...

Full description

Bibliographic Details
Main Authors: Xinran Du, Shumeng He, Houqun Yang, Chunxiao Wang
Format: Article
Language:English
Published: MDPI AG 2022-11-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/14/22/5830
_version_ 1797464032877215744
author Xinran Du
Shumeng He
Houqun Yang
Chunxiao Wang
author_facet Xinran Du
Shumeng He
Houqun Yang
Chunxiao Wang
author_sort Xinran Du
collection DOAJ
description High spatial resolution (HSR) remote sensing images have a wide range of application prospects in the fields of urban planning, agricultural planning and military training. Therefore, the research on the semantic segmentation of remote sensing images becomes extremely important. However, large data volume and the complex background of HSR remote sensing images put great pressure on the algorithm efficiency. Although the pressure on the GPU can be relieved by down-sampling the image or cropping it into small patches for separate processing, the loss of local details or global contextual information can lead to limited segmentation accuracy. In this study, we propose a multi-field context fusion network (MCFNet), which can preserve both global and local information efficiently. The method consists of three modules: a backbone network, a patch selection module (PSM), and a multi-field context fusion module (FM). Specifically, we propose a confidence-based local selection criterion in the PSM, which adaptively selects local locations in the image that are poorly segmented. Subsequently, the FM dynamically aggregates the semantic information of multiple visual fields centered on that local location to enhance the segmentation of these local locations. Since MCFNet only performs segmentation enhancement on local locations in an image, it can improve segmentation accuracy without consuming excessive GPU memory. We implement our method on two high spatial resolution remote sensing image datasets, DeepGlobe and Potsdam, and compare the proposed method with state-of-the-art methods. The results show that the MCFNet method achieves the best balance in terms of segmentation accuracy, memory efficiency, and inference speed.
first_indexed 2024-03-09T18:02:12Z
format Article
id doaj.art-9931549c27144b6485431cc0511a39fc
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-09T18:02:12Z
publishDate 2022-11-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-9931549c27144b6485431cc0511a39fc2023-11-24T09:51:07ZengMDPI AGRemote Sensing2072-42922022-11-011422583010.3390/rs14225830Multi-Field Context Fusion Network for Semantic Segmentation of High-Spatial-Resolution Remote Sensing ImagesXinran Du0Shumeng He1Houqun Yang2Chunxiao Wang3College of Computer Science and Technology, Hainan University, Haikou 570100, ChinaCollege of Computer Science and Technology, Hainan University, Haikou 570100, ChinaCollege of Computer Science and Technology, Hainan University, Haikou 570100, ChinaHainan Geomatics Centre of Ministry of Natural Resources, Haikou 570203, ChinaHigh spatial resolution (HSR) remote sensing images have a wide range of application prospects in the fields of urban planning, agricultural planning and military training. Therefore, the research on the semantic segmentation of remote sensing images becomes extremely important. However, large data volume and the complex background of HSR remote sensing images put great pressure on the algorithm efficiency. Although the pressure on the GPU can be relieved by down-sampling the image or cropping it into small patches for separate processing, the loss of local details or global contextual information can lead to limited segmentation accuracy. In this study, we propose a multi-field context fusion network (MCFNet), which can preserve both global and local information efficiently. The method consists of three modules: a backbone network, a patch selection module (PSM), and a multi-field context fusion module (FM). Specifically, we propose a confidence-based local selection criterion in the PSM, which adaptively selects local locations in the image that are poorly segmented. Subsequently, the FM dynamically aggregates the semantic information of multiple visual fields centered on that local location to enhance the segmentation of these local locations. Since MCFNet only performs segmentation enhancement on local locations in an image, it can improve segmentation accuracy without consuming excessive GPU memory. We implement our method on two high spatial resolution remote sensing image datasets, DeepGlobe and Potsdam, and compare the proposed method with state-of-the-art methods. The results show that the MCFNet method achieves the best balance in terms of segmentation accuracy, memory efficiency, and inference speed.https://www.mdpi.com/2072-4292/14/22/5830semantic segmentationhigh spatial resolution remote sensing imagesmemory efficiency
spellingShingle Xinran Du
Shumeng He
Houqun Yang
Chunxiao Wang
Multi-Field Context Fusion Network for Semantic Segmentation of High-Spatial-Resolution Remote Sensing Images
Remote Sensing
semantic segmentation
high spatial resolution remote sensing images
memory efficiency
title Multi-Field Context Fusion Network for Semantic Segmentation of High-Spatial-Resolution Remote Sensing Images
title_full Multi-Field Context Fusion Network for Semantic Segmentation of High-Spatial-Resolution Remote Sensing Images
title_fullStr Multi-Field Context Fusion Network for Semantic Segmentation of High-Spatial-Resolution Remote Sensing Images
title_full_unstemmed Multi-Field Context Fusion Network for Semantic Segmentation of High-Spatial-Resolution Remote Sensing Images
title_short Multi-Field Context Fusion Network for Semantic Segmentation of High-Spatial-Resolution Remote Sensing Images
title_sort multi field context fusion network for semantic segmentation of high spatial resolution remote sensing images
topic semantic segmentation
high spatial resolution remote sensing images
memory efficiency
url https://www.mdpi.com/2072-4292/14/22/5830
work_keys_str_mv AT xinrandu multifieldcontextfusionnetworkforsemanticsegmentationofhighspatialresolutionremotesensingimages
AT shumenghe multifieldcontextfusionnetworkforsemanticsegmentationofhighspatialresolutionremotesensingimages
AT houqunyang multifieldcontextfusionnetworkforsemanticsegmentationofhighspatialresolutionremotesensingimages
AT chunxiaowang multifieldcontextfusionnetworkforsemanticsegmentationofhighspatialresolutionremotesensingimages