Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery

Advances in deep learning techniques for remote sensing as well as the increased availability of high-resolution data enable the extraction of more detailed information from aerial images. One promising task is the semantic segmentation of roof segments and their orientation. However, the lack of an...

Full description

Bibliographic Details
Main Authors: Florian L. Faltermeier, Sebastian Krapf, Bruno Willenborg, Thomas H. Kolbe
Format: Article
Language:English
Published: MDPI AG 2023-04-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/15/7/1931
_version_ 1827746466039332864
author Florian L. Faltermeier
Sebastian Krapf
Bruno Willenborg
Thomas H. Kolbe
author_facet Florian L. Faltermeier
Sebastian Krapf
Bruno Willenborg
Thomas H. Kolbe
author_sort Florian L. Faltermeier
collection DOAJ
description Advances in deep learning techniques for remote sensing as well as the increased availability of high-resolution data enable the extraction of more detailed information from aerial images. One promising task is the semantic segmentation of roof segments and their orientation. However, the lack of annotated data is a major barrier for deploying respective models on a large scale. Previous research demonstrated the viability of the deep learning approach for the task, but currently, published datasets are small-scale, manually labeled, and rare. Therefore, this paper extends the state of the art by presenting a novel method for the automated generation of large-scale datasets based on semantic 3D city models. Furthermore, we train a model on a dataset 50 times larger than existing datasets and achieve superior performance while applying it to a wider variety of buildings. We evaluate the approach by comparing networks trained on four dataset configurations, including an existing dataset and our novel large-scale dataset. The results show that the network performance measured as intersection over union can be increased from 0.60 for the existing dataset to 0.70 when the large-scale model is applied on the same region. The large-scale model performs superiorly even when applied to more diverse test samples, achieving 0.635. The novel approach contributes to solving the dataset bottleneck and consequently to improving semantic segmentation of roof segments. The resulting remotely sensed information is crucial for applications such as solar potential analysis or urban planning.
first_indexed 2024-03-11T05:25:45Z
format Article
id doaj.art-5e63d3337aa94ddb8270a0067a86f675
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-11T05:25:45Z
publishDate 2023-04-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-5e63d3337aa94ddb8270a0067a86f6752023-11-17T17:31:05ZengMDPI AGRemote Sensing2072-42922023-04-01157193110.3390/rs15071931Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial ImageryFlorian L. Faltermeier0Sebastian Krapf1Bruno Willenborg2Thomas H. Kolbe3Chair of Geoinformatics, TUM School of Engineering and Design, Technical University of Munich, 80333 Munich, GermanyInstitute of Automotive Technology, Department of Mechanical Engineering, TUM School of Engineering and Design, Technical University of Munich, 80333 Munich, GermanyChair of Geoinformatics, TUM School of Engineering and Design, Technical University of Munich, 80333 Munich, GermanyChair of Geoinformatics, TUM School of Engineering and Design, Technical University of Munich, 80333 Munich, GermanyAdvances in deep learning techniques for remote sensing as well as the increased availability of high-resolution data enable the extraction of more detailed information from aerial images. One promising task is the semantic segmentation of roof segments and their orientation. However, the lack of annotated data is a major barrier for deploying respective models on a large scale. Previous research demonstrated the viability of the deep learning approach for the task, but currently, published datasets are small-scale, manually labeled, and rare. Therefore, this paper extends the state of the art by presenting a novel method for the automated generation of large-scale datasets based on semantic 3D city models. Furthermore, we train a model on a dataset 50 times larger than existing datasets and achieve superior performance while applying it to a wider variety of buildings. We evaluate the approach by comparing networks trained on four dataset configurations, including an existing dataset and our novel large-scale dataset. The results show that the network performance measured as intersection over union can be increased from 0.60 for the existing dataset to 0.70 when the large-scale model is applied on the same region. The large-scale model performs superiorly even when applied to more diverse test samples, achieving 0.635. The novel approach contributes to solving the dataset bottleneck and consequently to improving semantic segmentation of roof segments. The resulting remotely sensed information is crucial for applications such as solar potential analysis or urban planning.https://www.mdpi.com/2072-4292/15/7/1931CityGML3D city modelsaerial imagesremote sensingdatasetlabeling
spellingShingle Florian L. Faltermeier
Sebastian Krapf
Bruno Willenborg
Thomas H. Kolbe
Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery
Remote Sensing
CityGML
3D city models
aerial images
remote sensing
dataset
labeling
title Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery
title_full Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery
title_fullStr Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery
title_full_unstemmed Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery
title_short Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery
title_sort improving semantic segmentation of roof segments using large scale datasets derived from 3d city models and high resolution aerial imagery
topic CityGML
3D city models
aerial images
remote sensing
dataset
labeling
url https://www.mdpi.com/2072-4292/15/7/1931
work_keys_str_mv AT florianlfaltermeier improvingsemanticsegmentationofroofsegmentsusinglargescaledatasetsderivedfrom3dcitymodelsandhighresolutionaerialimagery
AT sebastiankrapf improvingsemanticsegmentationofroofsegmentsusinglargescaledatasetsderivedfrom3dcitymodelsandhighresolutionaerialimagery
AT brunowillenborg improvingsemanticsegmentationofroofsegmentsusinglargescaledatasetsderivedfrom3dcitymodelsandhighresolutionaerialimagery
AT thomashkolbe improvingsemanticsegmentationofroofsegmentsusinglargescaledatasetsderivedfrom3dcitymodelsandhighresolutionaerialimagery