Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement

Extracting buildings from high-resolution remote sensing images is essential for many geospatial applications, such as building change detection, urban planning, and disaster emergency assessment. Due to the diversity of geometric shapes and the blurring of boundaries among buildings, it is still a...

Full description

Bibliographic Details
Main Authors:	Hao Xu, Panpan Zhu, Xiaobo Luo, Tianshou Xie, Liqiang Zhang
Format:	Article
Language:	English
Published:	MDPI AG 2022-01-01
Series:	Remote Sensing
Subjects:	building extraction remote sensing deep learning multitask method boundary information
Online Access:	https://www.mdpi.com/2072-4292/14/3/564

_version_	1797485059511418880
author	Hao Xu Panpan Zhu Xiaobo Luo Tianshou Xie Liqiang Zhang
author_facet	Hao Xu Panpan Zhu Xiaobo Luo Tianshou Xie Liqiang Zhang
author_sort	Hao Xu
collection	DOAJ
description	Extracting buildings from high-resolution remote sensing images is essential for many geospatial applications, such as building change detection, urban planning, and disaster emergency assessment. Due to the diversity of geometric shapes and the blurring of boundaries among buildings, it is still a challenging task to accurately generate building footprints from the complex scenes of remote sensing images. The rapid development of convolutional neural networks is presenting both new opportunities and challenges with respect to the extraction of buildings from high-resolution remote sensing images. To capture multilevel contextual information, most deep learning methods extract buildings by integrating multilevel features. However, the differential responses between such multilevel features are often ignored, leading to blurred contours in the extraction results. In this study, we propose an end-to-end multitask building extraction method to address these issues; this approach utilizes the rich contextual features of remote sensing images to assist with building segmentation while ensuring that the shape of the extraction results is preserved. By combining boundary classification and boundary distance regression, clear contour and distance transformation maps are generated to further improve the accuracy of building extraction. Subsequently, multiple refinement modules are used to refine each part of the network to minimize the loss of image feature information. Experimental comparisons conducted on the SpaceNet and Massachusetts building datasets show that the proposed method outperforms other deep learning methods in terms of building extraction results.
first_indexed	2024-03-09T23:13:25Z
format	Article
id	doaj.art-20271b09ca86451a9ef65099b132dbe0
institution	Directory Open Access Journal
issn	2072-4292
language	English
last_indexed	2024-03-09T23:13:25Z
publishDate	2022-01-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj.art-20271b09ca86451a9ef65099b132dbe02023-11-23T17:39:46ZengMDPI AGRemote Sensing2072-42922022-01-0114356410.3390/rs14030564Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary RefinementHao Xu0Panpan Zhu1Xiaobo Luo2Tianshou Xie3Liqiang Zhang4College of Computer Sciences and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaCollege of Computer Sciences and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaCollege of Computer Sciences and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaCollege of Computer Sciences and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaState Key Laboratory of Remote Sensing Science, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, ChinaExtracting buildings from high-resolution remote sensing images is essential for many geospatial applications, such as building change detection, urban planning, and disaster emergency assessment. Due to the diversity of geometric shapes and the blurring of boundaries among buildings, it is still a challenging task to accurately generate building footprints from the complex scenes of remote sensing images. The rapid development of convolutional neural networks is presenting both new opportunities and challenges with respect to the extraction of buildings from high-resolution remote sensing images. To capture multilevel contextual information, most deep learning methods extract buildings by integrating multilevel features. However, the differential responses between such multilevel features are often ignored, leading to blurred contours in the extraction results. In this study, we propose an end-to-end multitask building extraction method to address these issues; this approach utilizes the rich contextual features of remote sensing images to assist with building segmentation while ensuring that the shape of the extraction results is preserved. By combining boundary classification and boundary distance regression, clear contour and distance transformation maps are generated to further improve the accuracy of building extraction. Subsequently, multiple refinement modules are used to refine each part of the network to minimize the loss of image feature information. Experimental comparisons conducted on the SpaceNet and Massachusetts building datasets show that the proposed method outperforms other deep learning methods in terms of building extraction results.https://www.mdpi.com/2072-4292/14/3/564building extractionremote sensingdeep learningmultitask methodboundary information
spellingShingle	Hao Xu Panpan Zhu Xiaobo Luo Tianshou Xie Liqiang Zhang Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement Remote Sensing building extraction remote sensing deep learning multitask method boundary information
title	Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement
title_full	Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement
title_fullStr	Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement
title_full_unstemmed	Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement
title_short	Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement
title_sort	extracting buildings from remote sensing images using a multitask encoder decoder network with boundary refinement
topic	building extraction remote sensing deep learning multitask method boundary information
url	https://www.mdpi.com/2072-4292/14/3/564
work_keys_str_mv	AT haoxu extractingbuildingsfromremotesensingimagesusingamultitaskencoderdecodernetworkwithboundaryrefinement AT panpanzhu extractingbuildingsfromremotesensingimagesusingamultitaskencoderdecodernetworkwithboundaryrefinement AT xiaoboluo extractingbuildingsfromremotesensingimagesusingamultitaskencoderdecodernetworkwithboundaryrefinement AT tianshouxie extractingbuildingsfromremotesensingimagesusingamultitaskencoderdecodernetworkwithboundaryrefinement AT liqiangzhang extractingbuildingsfromremotesensingimagesusingamultitaskencoderdecodernetworkwithboundaryrefinement

Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement

Similar Items