Building Extraction and Floor Area Estimation at the Village Level in Rural China Via a Comprehensive Method Integrating UAV Photogrammetry and the Novel EDSANet

Dynamic monitoring of building environments is essential for observing rural land changes and socio-economic development, especially in agricultural countries, such as China. Rapid and accurate building extraction and floor area estimation at the village level are vital for the overall planning of r...

Full description

Bibliographic Details
Main Authors: Jie Zhou, Yaohui Liu, Gaozhong Nie, Hao Cheng, Xinyue Yang, Xiaoxian Chen, Lutz Gross
Format: Article
Language:English
Published: MDPI AG 2022-10-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/14/20/5175
_version_ 1797470110382817280
author Jie Zhou
Yaohui Liu
Gaozhong Nie
Hao Cheng
Xinyue Yang
Xiaoxian Chen
Lutz Gross
author_facet Jie Zhou
Yaohui Liu
Gaozhong Nie
Hao Cheng
Xinyue Yang
Xiaoxian Chen
Lutz Gross
author_sort Jie Zhou
collection DOAJ
description Dynamic monitoring of building environments is essential for observing rural land changes and socio-economic development, especially in agricultural countries, such as China. Rapid and accurate building extraction and floor area estimation at the village level are vital for the overall planning of rural development and intensive land use and the “beautiful countryside” construction policy in China. Traditional in situ field surveys are an effective way to collect building information but are time-consuming and labor-intensive. Moreover, rural buildings are usually covered by vegetation and trees, leading to incomplete boundaries. This paper proposes a comprehensive method to perform village-level homestead area estimation by combining unmanned aerial vehicle (UAV) photogrammetry and deep learning technology. First, to tackle the problem of complex surface feature scenes in remote sensing images, we proposed a novel Efficient Deep-wise Spatial Attention Network (EDSANet), which uses dual attention extraction and attention feature refinement to aggregate multi-level semantics and enhance the accuracy of building extraction, especially for high-spatial-resolution imagery. Qualitative and quantitative experiments were conducted with the newly built dataset (named the rural Weinan building dataset) with different deep learning networks to examine the performance of the EDSANet model in the task of rural building extraction. Then, the number of floors of each building was estimated using the normalized digital surface model (nDSM) generated from UAV oblique photogrammetry. The floor area of the entire village was rapidly calculated by multiplying the area of each building in the village by the number of floors. The case study was conducted in Helan village, Shannxi province, China. The results show that the overall accuracy of the building extraction from UAV images with the EDSANet model was 0.939 and that the precision reached 0.949. The buildings in Helan village primarily have two stories, and their total floor area is 3.1 × 10<sup>5</sup> m<sup>2</sup>. The field survey results verified that the accuracy of the nDSM model was 0.94; the RMSE was 0.243. The proposed workflow and experimental results highlight the potential of UAV oblique photogrammetry and deep learning for rapid and efficient village-level building extraction and floor area estimation in China, as well as worldwide.
first_indexed 2024-03-09T19:32:06Z
format Article
id doaj.art-cf6377563a54466db41c7587276d5a8a
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-09T19:32:06Z
publishDate 2022-10-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-cf6377563a54466db41c7587276d5a8a2023-11-24T02:20:36ZengMDPI AGRemote Sensing2072-42922022-10-011420517510.3390/rs14205175Building Extraction and Floor Area Estimation at the Village Level in Rural China Via a Comprehensive Method Integrating UAV Photogrammetry and the Novel EDSANetJie Zhou0Yaohui Liu1Gaozhong Nie2Hao Cheng3Xinyue Yang4Xiaoxian Chen5Lutz Gross6Institute of Geology, China Earthquake Administration, Beijing 100029, ChinaSchool of Surveying and Geo-Informatics, Shandong Jianzhu University, Jinan 250101, ChinaInstitute of Geology, China Earthquake Administration, Beijing 100029, ChinaSchool of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, ChinaSchool of Surveying and Geo-Informatics, Shandong Jianzhu University, Jinan 250101, ChinaSchool of Surveying and Geo-Informatics, Shandong Jianzhu University, Jinan 250101, ChinaSchool of Earth and Environmental Sciences, The University of Queensland, Brisbane, QLD 4072, AustraliaDynamic monitoring of building environments is essential for observing rural land changes and socio-economic development, especially in agricultural countries, such as China. Rapid and accurate building extraction and floor area estimation at the village level are vital for the overall planning of rural development and intensive land use and the “beautiful countryside” construction policy in China. Traditional in situ field surveys are an effective way to collect building information but are time-consuming and labor-intensive. Moreover, rural buildings are usually covered by vegetation and trees, leading to incomplete boundaries. This paper proposes a comprehensive method to perform village-level homestead area estimation by combining unmanned aerial vehicle (UAV) photogrammetry and deep learning technology. First, to tackle the problem of complex surface feature scenes in remote sensing images, we proposed a novel Efficient Deep-wise Spatial Attention Network (EDSANet), which uses dual attention extraction and attention feature refinement to aggregate multi-level semantics and enhance the accuracy of building extraction, especially for high-spatial-resolution imagery. Qualitative and quantitative experiments were conducted with the newly built dataset (named the rural Weinan building dataset) with different deep learning networks to examine the performance of the EDSANet model in the task of rural building extraction. Then, the number of floors of each building was estimated using the normalized digital surface model (nDSM) generated from UAV oblique photogrammetry. The floor area of the entire village was rapidly calculated by multiplying the area of each building in the village by the number of floors. The case study was conducted in Helan village, Shannxi province, China. The results show that the overall accuracy of the building extraction from UAV images with the EDSANet model was 0.939 and that the precision reached 0.949. The buildings in Helan village primarily have two stories, and their total floor area is 3.1 × 10<sup>5</sup> m<sup>2</sup>. The field survey results verified that the accuracy of the nDSM model was 0.94; the RMSE was 0.243. The proposed workflow and experimental results highlight the potential of UAV oblique photogrammetry and deep learning for rapid and efficient village-level building extraction and floor area estimation in China, as well as worldwide.https://www.mdpi.com/2072-4292/14/20/5175building extractionfloor area estimationrural Chinadeep learningUAV
spellingShingle Jie Zhou
Yaohui Liu
Gaozhong Nie
Hao Cheng
Xinyue Yang
Xiaoxian Chen
Lutz Gross
Building Extraction and Floor Area Estimation at the Village Level in Rural China Via a Comprehensive Method Integrating UAV Photogrammetry and the Novel EDSANet
Remote Sensing
building extraction
floor area estimation
rural China
deep learning
UAV
title Building Extraction and Floor Area Estimation at the Village Level in Rural China Via a Comprehensive Method Integrating UAV Photogrammetry and the Novel EDSANet
title_full Building Extraction and Floor Area Estimation at the Village Level in Rural China Via a Comprehensive Method Integrating UAV Photogrammetry and the Novel EDSANet
title_fullStr Building Extraction and Floor Area Estimation at the Village Level in Rural China Via a Comprehensive Method Integrating UAV Photogrammetry and the Novel EDSANet
title_full_unstemmed Building Extraction and Floor Area Estimation at the Village Level in Rural China Via a Comprehensive Method Integrating UAV Photogrammetry and the Novel EDSANet
title_short Building Extraction and Floor Area Estimation at the Village Level in Rural China Via a Comprehensive Method Integrating UAV Photogrammetry and the Novel EDSANet
title_sort building extraction and floor area estimation at the village level in rural china via a comprehensive method integrating uav photogrammetry and the novel edsanet
topic building extraction
floor area estimation
rural China
deep learning
UAV
url https://www.mdpi.com/2072-4292/14/20/5175
work_keys_str_mv AT jiezhou buildingextractionandfloorareaestimationatthevillagelevelinruralchinaviaacomprehensivemethodintegratinguavphotogrammetryandthenoveledsanet
AT yaohuiliu buildingextractionandfloorareaestimationatthevillagelevelinruralchinaviaacomprehensivemethodintegratinguavphotogrammetryandthenoveledsanet
AT gaozhongnie buildingextractionandfloorareaestimationatthevillagelevelinruralchinaviaacomprehensivemethodintegratinguavphotogrammetryandthenoveledsanet
AT haocheng buildingextractionandfloorareaestimationatthevillagelevelinruralchinaviaacomprehensivemethodintegratinguavphotogrammetryandthenoveledsanet
AT xinyueyang buildingextractionandfloorareaestimationatthevillagelevelinruralchinaviaacomprehensivemethodintegratinguavphotogrammetryandthenoveledsanet
AT xiaoxianchen buildingextractionandfloorareaestimationatthevillagelevelinruralchinaviaacomprehensivemethodintegratinguavphotogrammetryandthenoveledsanet
AT lutzgross buildingextractionandfloorareaestimationatthevillagelevelinruralchinaviaacomprehensivemethodintegratinguavphotogrammetryandthenoveledsanet