Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery
Advances in deep learning techniques for remote sensing as well as the increased availability of high-resolution data enable the extraction of more detailed information from aerial images. One promising task is the semantic segmentation of roof segments and their orientation. However, the lack of an...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-04-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/15/7/1931 |
_version_ | 1827746466039332864 |
---|---|
author | Florian L. Faltermeier Sebastian Krapf Bruno Willenborg Thomas H. Kolbe |
author_facet | Florian L. Faltermeier Sebastian Krapf Bruno Willenborg Thomas H. Kolbe |
author_sort | Florian L. Faltermeier |
collection | DOAJ |
description | Advances in deep learning techniques for remote sensing as well as the increased availability of high-resolution data enable the extraction of more detailed information from aerial images. One promising task is the semantic segmentation of roof segments and their orientation. However, the lack of annotated data is a major barrier for deploying respective models on a large scale. Previous research demonstrated the viability of the deep learning approach for the task, but currently, published datasets are small-scale, manually labeled, and rare. Therefore, this paper extends the state of the art by presenting a novel method for the automated generation of large-scale datasets based on semantic 3D city models. Furthermore, we train a model on a dataset 50 times larger than existing datasets and achieve superior performance while applying it to a wider variety of buildings. We evaluate the approach by comparing networks trained on four dataset configurations, including an existing dataset and our novel large-scale dataset. The results show that the network performance measured as intersection over union can be increased from 0.60 for the existing dataset to 0.70 when the large-scale model is applied on the same region. The large-scale model performs superiorly even when applied to more diverse test samples, achieving 0.635. The novel approach contributes to solving the dataset bottleneck and consequently to improving semantic segmentation of roof segments. The resulting remotely sensed information is crucial for applications such as solar potential analysis or urban planning. |
first_indexed | 2024-03-11T05:25:45Z |
format | Article |
id | doaj.art-5e63d3337aa94ddb8270a0067a86f675 |
institution | Directory Open Access Journal |
issn | 2072-4292 |
language | English |
last_indexed | 2024-03-11T05:25:45Z |
publishDate | 2023-04-01 |
publisher | MDPI AG |
record_format | Article |
series | Remote Sensing |
spelling | doaj.art-5e63d3337aa94ddb8270a0067a86f6752023-11-17T17:31:05ZengMDPI AGRemote Sensing2072-42922023-04-01157193110.3390/rs15071931Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial ImageryFlorian L. Faltermeier0Sebastian Krapf1Bruno Willenborg2Thomas H. Kolbe3Chair of Geoinformatics, TUM School of Engineering and Design, Technical University of Munich, 80333 Munich, GermanyInstitute of Automotive Technology, Department of Mechanical Engineering, TUM School of Engineering and Design, Technical University of Munich, 80333 Munich, GermanyChair of Geoinformatics, TUM School of Engineering and Design, Technical University of Munich, 80333 Munich, GermanyChair of Geoinformatics, TUM School of Engineering and Design, Technical University of Munich, 80333 Munich, GermanyAdvances in deep learning techniques for remote sensing as well as the increased availability of high-resolution data enable the extraction of more detailed information from aerial images. One promising task is the semantic segmentation of roof segments and their orientation. However, the lack of annotated data is a major barrier for deploying respective models on a large scale. Previous research demonstrated the viability of the deep learning approach for the task, but currently, published datasets are small-scale, manually labeled, and rare. Therefore, this paper extends the state of the art by presenting a novel method for the automated generation of large-scale datasets based on semantic 3D city models. Furthermore, we train a model on a dataset 50 times larger than existing datasets and achieve superior performance while applying it to a wider variety of buildings. We evaluate the approach by comparing networks trained on four dataset configurations, including an existing dataset and our novel large-scale dataset. The results show that the network performance measured as intersection over union can be increased from 0.60 for the existing dataset to 0.70 when the large-scale model is applied on the same region. The large-scale model performs superiorly even when applied to more diverse test samples, achieving 0.635. The novel approach contributes to solving the dataset bottleneck and consequently to improving semantic segmentation of roof segments. The resulting remotely sensed information is crucial for applications such as solar potential analysis or urban planning.https://www.mdpi.com/2072-4292/15/7/1931CityGML3D city modelsaerial imagesremote sensingdatasetlabeling |
spellingShingle | Florian L. Faltermeier Sebastian Krapf Bruno Willenborg Thomas H. Kolbe Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery Remote Sensing CityGML 3D city models aerial images remote sensing dataset labeling |
title | Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery |
title_full | Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery |
title_fullStr | Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery |
title_full_unstemmed | Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery |
title_short | Improving Semantic Segmentation of Roof Segments Using Large-Scale Datasets Derived from 3D City Models and High-Resolution Aerial Imagery |
title_sort | improving semantic segmentation of roof segments using large scale datasets derived from 3d city models and high resolution aerial imagery |
topic | CityGML 3D city models aerial images remote sensing dataset labeling |
url | https://www.mdpi.com/2072-4292/15/7/1931 |
work_keys_str_mv | AT florianlfaltermeier improvingsemanticsegmentationofroofsegmentsusinglargescaledatasetsderivedfrom3dcitymodelsandhighresolutionaerialimagery AT sebastiankrapf improvingsemanticsegmentationofroofsegmentsusinglargescaledatasetsderivedfrom3dcitymodelsandhighresolutionaerialimagery AT brunowillenborg improvingsemanticsegmentationofroofsegmentsusinglargescaledatasetsderivedfrom3dcitymodelsandhighresolutionaerialimagery AT thomashkolbe improvingsemanticsegmentationofroofsegmentsusinglargescaledatasetsderivedfrom3dcitymodelsandhighresolutionaerialimagery |