Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement

Extracting buildings from high-resolution remote sensing images is essential for many geospatial applications, such as building change detection, urban planning, and disaster emergency assessment. Due to the diversity of geometric shapes and the blurring of boundaries among buildings, it is still a...

Full description

Bibliographic Details
Main Authors: Hao Xu, Panpan Zhu, Xiaobo Luo, Tianshou Xie, Liqiang Zhang
Format: Article
Language:English
Published: MDPI AG 2022-01-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/14/3/564
_version_ 1797485059511418880
author Hao Xu
Panpan Zhu
Xiaobo Luo
Tianshou Xie
Liqiang Zhang
author_facet Hao Xu
Panpan Zhu
Xiaobo Luo
Tianshou Xie
Liqiang Zhang
author_sort Hao Xu
collection DOAJ
description Extracting buildings from high-resolution remote sensing images is essential for many geospatial applications, such as building change detection, urban planning, and disaster emergency assessment. Due to the diversity of geometric shapes and the blurring of boundaries among buildings, it is still a challenging task to accurately generate building footprints from the complex scenes of remote sensing images. The rapid development of convolutional neural networks is presenting both new opportunities and challenges with respect to the extraction of buildings from high-resolution remote sensing images. To capture multilevel contextual information, most deep learning methods extract buildings by integrating multilevel features. However, the differential responses between such multilevel features are often ignored, leading to blurred contours in the extraction results. In this study, we propose an end-to-end multitask building extraction method to address these issues; this approach utilizes the rich contextual features of remote sensing images to assist with building segmentation while ensuring that the shape of the extraction results is preserved. By combining boundary classification and boundary distance regression, clear contour and distance transformation maps are generated to further improve the accuracy of building extraction. Subsequently, multiple refinement modules are used to refine each part of the network to minimize the loss of image feature information. Experimental comparisons conducted on the SpaceNet and Massachusetts building datasets show that the proposed method outperforms other deep learning methods in terms of building extraction results.
first_indexed 2024-03-09T23:13:25Z
format Article
id doaj.art-20271b09ca86451a9ef65099b132dbe0
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-09T23:13:25Z
publishDate 2022-01-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-20271b09ca86451a9ef65099b132dbe02023-11-23T17:39:46ZengMDPI AGRemote Sensing2072-42922022-01-0114356410.3390/rs14030564Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary RefinementHao Xu0Panpan Zhu1Xiaobo Luo2Tianshou Xie3Liqiang Zhang4College of Computer Sciences and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaCollege of Computer Sciences and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaCollege of Computer Sciences and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaCollege of Computer Sciences and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaState Key Laboratory of Remote Sensing Science, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, ChinaExtracting buildings from high-resolution remote sensing images is essential for many geospatial applications, such as building change detection, urban planning, and disaster emergency assessment. Due to the diversity of geometric shapes and the blurring of boundaries among buildings, it is still a challenging task to accurately generate building footprints from the complex scenes of remote sensing images. The rapid development of convolutional neural networks is presenting both new opportunities and challenges with respect to the extraction of buildings from high-resolution remote sensing images. To capture multilevel contextual information, most deep learning methods extract buildings by integrating multilevel features. However, the differential responses between such multilevel features are often ignored, leading to blurred contours in the extraction results. In this study, we propose an end-to-end multitask building extraction method to address these issues; this approach utilizes the rich contextual features of remote sensing images to assist with building segmentation while ensuring that the shape of the extraction results is preserved. By combining boundary classification and boundary distance regression, clear contour and distance transformation maps are generated to further improve the accuracy of building extraction. Subsequently, multiple refinement modules are used to refine each part of the network to minimize the loss of image feature information. Experimental comparisons conducted on the SpaceNet and Massachusetts building datasets show that the proposed method outperforms other deep learning methods in terms of building extraction results.https://www.mdpi.com/2072-4292/14/3/564building extractionremote sensingdeep learningmultitask methodboundary information
spellingShingle Hao Xu
Panpan Zhu
Xiaobo Luo
Tianshou Xie
Liqiang Zhang
Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement
Remote Sensing
building extraction
remote sensing
deep learning
multitask method
boundary information
title Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement
title_full Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement
title_fullStr Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement
title_full_unstemmed Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement
title_short Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement
title_sort extracting buildings from remote sensing images using a multitask encoder decoder network with boundary refinement
topic building extraction
remote sensing
deep learning
multitask method
boundary information
url https://www.mdpi.com/2072-4292/14/3/564
work_keys_str_mv AT haoxu extractingbuildingsfromremotesensingimagesusingamultitaskencoderdecodernetworkwithboundaryrefinement
AT panpanzhu extractingbuildingsfromremotesensingimagesusingamultitaskencoderdecodernetworkwithboundaryrefinement
AT xiaoboluo extractingbuildingsfromremotesensingimagesusingamultitaskencoderdecodernetworkwithboundaryrefinement
AT tianshouxie extractingbuildingsfromremotesensingimagesusingamultitaskencoderdecodernetworkwithboundaryrefinement
AT liqiangzhang extractingbuildingsfromremotesensingimagesusingamultitaskencoderdecodernetworkwithboundaryrefinement