BUILDING SECTION INSTANCE SEGMENTATION WITH COMBINED CLASSICAL AND DEEP LEARNING METHODS

In big cities, the complexity of urban infrastructure is very high. In city centers, one construction can consist of several building sections of different heights or roof geometries. Most of the existing approaches detect those buildings as a single construction in the form of binary building segme...

Full description

Bibliographic Details
Main Authors: P. Schuegraf, J. Schnell, C. Henry, K. Bittner
Format: Article
Language:English
Published: Copernicus Publications 2022-05-01
Series:ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:https://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/V-2-2022/407/2022/isprs-annals-V-2-2022-407-2022.pdf
_version_ 1818253422494744576
author P. Schuegraf
J. Schnell
C. Henry
K. Bittner
author_facet P. Schuegraf
J. Schnell
C. Henry
K. Bittner
author_sort P. Schuegraf
collection DOAJ
description In big cities, the complexity of urban infrastructure is very high. In city centers, one construction can consist of several building sections of different heights or roof geometries. Most of the existing approaches detect those buildings as a single construction in the form of binary building segmentation maps or as one instance of object-oriented segmentation. However, reconstructing complex buildings consisting of several parts requires a higher level of detail. In this work, we present a methodology for individual building section instance segmentation on satellite imagery. We show that fully convolutional networks (FCNs) can tackle the issue much better than the state-of-the-art Mask-RCNN. A ground truth raster image with pixel value 1 for building sections and 2 for their touching borders was generated to train models on predicting both classes as a semantic output. The semantic outputs were then post-processed with the help of morphology and watershed labeling to generate segmentation on the instance level. The combination of a deep learning-based approach and a classical image processing algorithm allowed us to fulfill the segmentation task on the instance level and reach high-quality results with an mAP of up to 42 %.
first_indexed 2024-12-12T16:39:49Z
format Article
id doaj.art-f3084f45617a43429e8f4891483bbfd9
institution Directory Open Access Journal
issn 2194-9042
2194-9050
language English
last_indexed 2024-12-12T16:39:49Z
publishDate 2022-05-01
publisher Copernicus Publications
record_format Article
series ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
spelling doaj.art-f3084f45617a43429e8f4891483bbfd92022-12-22T00:18:36ZengCopernicus PublicationsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences2194-90422194-90502022-05-01V-2-202240741410.5194/isprs-annals-V-2-2022-407-2022BUILDING SECTION INSTANCE SEGMENTATION WITH COMBINED CLASSICAL AND DEEP LEARNING METHODSP. Schuegraf0J. Schnell1C. Henry2K. Bittner3Remote Sensing Technology Institute, German Aerospace Center (DLR), Wessling, GermanyEnvironmental Engineering, Technical University of Darmstadt (TU Darmstadt), Darmstadt, GermanyRemote Sensing Technology Institute, German Aerospace Center (DLR), Wessling, GermanyRemote Sensing Technology Institute, German Aerospace Center (DLR), Wessling, GermanyIn big cities, the complexity of urban infrastructure is very high. In city centers, one construction can consist of several building sections of different heights or roof geometries. Most of the existing approaches detect those buildings as a single construction in the form of binary building segmentation maps or as one instance of object-oriented segmentation. However, reconstructing complex buildings consisting of several parts requires a higher level of detail. In this work, we present a methodology for individual building section instance segmentation on satellite imagery. We show that fully convolutional networks (FCNs) can tackle the issue much better than the state-of-the-art Mask-RCNN. A ground truth raster image with pixel value 1 for building sections and 2 for their touching borders was generated to train models on predicting both classes as a semantic output. The semantic outputs were then post-processed with the help of morphology and watershed labeling to generate segmentation on the instance level. The combination of a deep learning-based approach and a classical image processing algorithm allowed us to fulfill the segmentation task on the instance level and reach high-quality results with an mAP of up to 42 %.https://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/V-2-2022/407/2022/isprs-annals-V-2-2022-407-2022.pdf
spellingShingle P. Schuegraf
J. Schnell
C. Henry
K. Bittner
BUILDING SECTION INSTANCE SEGMENTATION WITH COMBINED CLASSICAL AND DEEP LEARNING METHODS
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
title BUILDING SECTION INSTANCE SEGMENTATION WITH COMBINED CLASSICAL AND DEEP LEARNING METHODS
title_full BUILDING SECTION INSTANCE SEGMENTATION WITH COMBINED CLASSICAL AND DEEP LEARNING METHODS
title_fullStr BUILDING SECTION INSTANCE SEGMENTATION WITH COMBINED CLASSICAL AND DEEP LEARNING METHODS
title_full_unstemmed BUILDING SECTION INSTANCE SEGMENTATION WITH COMBINED CLASSICAL AND DEEP LEARNING METHODS
title_short BUILDING SECTION INSTANCE SEGMENTATION WITH COMBINED CLASSICAL AND DEEP LEARNING METHODS
title_sort building section instance segmentation with combined classical and deep learning methods
url https://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/V-2-2022/407/2022/isprs-annals-V-2-2022-407-2022.pdf
work_keys_str_mv AT pschuegraf buildingsectioninstancesegmentationwithcombinedclassicalanddeeplearningmethods
AT jschnell buildingsectioninstancesegmentationwithcombinedclassicalanddeeplearningmethods
AT chenry buildingsectioninstancesegmentationwithcombinedclassicalanddeeplearningmethods
AT kbittner buildingsectioninstancesegmentationwithcombinedclassicalanddeeplearningmethods