Attention-Gate-Based Encoder–Decoder Network for Automatical Building Extraction

Rapidly developing remote sensing technology provides massive data for urban planning, mapping, and disaster management. As a carrier of human productive activities, buildings are essential to both urban dynamic monitoring and suburban construction inspection. Fully-convolutional-network-based metho...

Full description

Bibliographic Details
Main Authors: Wenjing Deng, Qian Shi, Jun Li
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9351600/
_version_ 1819092722654380032
author Wenjing Deng
Qian Shi
Jun Li
author_facet Wenjing Deng
Qian Shi
Jun Li
author_sort Wenjing Deng
collection DOAJ
description Rapidly developing remote sensing technology provides massive data for urban planning, mapping, and disaster management. As a carrier of human productive activities, buildings are essential to both urban dynamic monitoring and suburban construction inspection. Fully-convolutional-network-based methods have provided a paradigm for automatically extracting buildings from high-resolution imagery. However, high intraclass variance and complexity are two problems in building extraction. It is hard to identify different scales of buildings by using a single receptive field. For this purpose, in this article, we use the stable encoder- decoder architecture, combined with a grid-based attention gate and atrous spatial pyramid pooling module, to capture and restore features progressively and effectively. A modified ResNet50 encoder is also applied to extract features. The proposed method could learn gated features and distinguish buildings from complex surroundings such as trees. We evaluate our model on two building datasets, WHU aerial building dataset and our DB UAV rural building dataset. Experiments show that our model outperforms other five most recent models. The results also exhibit great potential for extracting buildings with different scales and validate the effectiveness of deep learning in practical scenarios.
first_indexed 2024-12-21T23:00:08Z
format Article
id doaj.art-469a890e6d48455e916f7f7a01c1bdaa
institution Directory Open Access Journal
issn 2151-1535
language English
last_indexed 2024-12-21T23:00:08Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj.art-469a890e6d48455e916f7f7a01c1bdaa2022-12-21T18:47:18ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01142611262010.1109/JSTARS.2021.30580979351600Attention-Gate-Based Encoder–Decoder Network for Automatical Building ExtractionWenjing Deng0https://orcid.org/0000-0002-9435-7195Qian Shi1https://orcid.org/0000-0002-1276-0352Jun Li2https://orcid.org/0000-0003-1613-9448School of Geography and Planning, Guangdong Provincial Key Laboratory of Urbanization and Geosimulation, Sun Yat-sen University, Guangzhou, ChinaSchool of Geography and Planning, Guangdong Provincial Key Laboratory of Urbanization and Geosimulation, Sun Yat-sen University, Guangzhou, ChinaSchool of Geography and Planning, Guangdong Provincial Key Laboratory of Urbanization and Geosimulation, Sun Yat-sen University, Guangzhou, ChinaRapidly developing remote sensing technology provides massive data for urban planning, mapping, and disaster management. As a carrier of human productive activities, buildings are essential to both urban dynamic monitoring and suburban construction inspection. Fully-convolutional-network-based methods have provided a paradigm for automatically extracting buildings from high-resolution imagery. However, high intraclass variance and complexity are two problems in building extraction. It is hard to identify different scales of buildings by using a single receptive field. For this purpose, in this article, we use the stable encoder- decoder architecture, combined with a grid-based attention gate and atrous spatial pyramid pooling module, to capture and restore features progressively and effectively. A modified ResNet50 encoder is also applied to extract features. The proposed method could learn gated features and distinguish buildings from complex surroundings such as trees. We evaluate our model on two building datasets, WHU aerial building dataset and our DB UAV rural building dataset. Experiments show that our model outperforms other five most recent models. The results also exhibit great potential for extracting buildings with different scales and validate the effectiveness of deep learning in practical scenarios.https://ieeexplore.ieee.org/document/9351600/Attention gate (AG)building extractiondeep learningfully convolutional networks (FCNs)semantic segmentation
spellingShingle Wenjing Deng
Qian Shi
Jun Li
Attention-Gate-Based Encoder–Decoder Network for Automatical Building Extraction
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Attention gate (AG)
building extraction
deep learning
fully convolutional networks (FCNs)
semantic segmentation
title Attention-Gate-Based Encoder–Decoder Network for Automatical Building Extraction
title_full Attention-Gate-Based Encoder–Decoder Network for Automatical Building Extraction
title_fullStr Attention-Gate-Based Encoder–Decoder Network for Automatical Building Extraction
title_full_unstemmed Attention-Gate-Based Encoder–Decoder Network for Automatical Building Extraction
title_short Attention-Gate-Based Encoder–Decoder Network for Automatical Building Extraction
title_sort attention gate based encoder x2013 decoder network for automatical building extraction
topic Attention gate (AG)
building extraction
deep learning
fully convolutional networks (FCNs)
semantic segmentation
url https://ieeexplore.ieee.org/document/9351600/
work_keys_str_mv AT wenjingdeng attentiongatebasedencoderx2013decodernetworkforautomaticalbuildingextraction
AT qianshi attentiongatebasedencoderx2013decodernetworkforautomaticalbuildingextraction
AT junli attentiongatebasedencoderx2013decodernetworkforautomaticalbuildingextraction