A 3D BUILDING INDOOR-OUTDOOR BENCHMARK FOR SEMANTIC SEGMENTATION

Both machine learning (ML) and deep learning (DL) algorithms require high-quality training samples as well as precise and thorough annotations in order to work effectively. The 3D building indoor-outdoor dataset (BIO dataset), which is a highly accurate, high level of detail, and high coverage datas...

Full description

Bibliographic Details
Main Authors: Y. Cao, M. Scaioni
Format: Article
Language:English
Published: Copernicus Publications 2023-12-01
Series:The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:https://isprs-archives.copernicus.org/articles/XLVIII-1-W2-2023/147/2023/isprs-archives-XLVIII-1-W2-2023-147-2023.pdf
_version_ 1827585388821086208
author Y. Cao
M. Scaioni
author_facet Y. Cao
M. Scaioni
author_sort Y. Cao
collection DOAJ
description Both machine learning (ML) and deep learning (DL) algorithms require high-quality training samples as well as precise and thorough annotations in order to work effectively. The 3D building indoor-outdoor dataset (BIO dataset), which is a highly accurate, high level of detail, and high coverage dataset for 3D building point cloud and mesh semantic segmentation, is established as a canonical benchmark dataset. It contains 100 building models, in which building structural elements are annotated into 11 semantic categories. Each building in this dataset has an average of 75,587 triangular faces, and the total area of the dataset is 481,769 square meters. Furthermore, semantic segmentation of the dataset was carried out using the Random Forest ML algorithm to verify the dataset’s accessibility. A weighted F1 score of 96.64% was obtained with 10% of the segments of each building randomly chosen as training data. For applications involving building geometry data, the BIO dataset can support a broad class of recently developed ML and DL methodologies.
first_indexed 2024-03-08T23:45:49Z
format Article
id doaj.art-8c61291a107149e1a335d1dc46222a98
institution Directory Open Access Journal
issn 1682-1750
2194-9034
language English
last_indexed 2024-03-08T23:45:49Z
publishDate 2023-12-01
publisher Copernicus Publications
record_format Article
series The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
spelling doaj.art-8c61291a107149e1a335d1dc46222a982023-12-13T23:30:11ZengCopernicus PublicationsThe International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences1682-17502194-90342023-12-01XLVIII-1-W2-202314715310.5194/isprs-archives-XLVIII-1-W2-2023-147-2023A 3D BUILDING INDOOR-OUTDOOR BENCHMARK FOR SEMANTIC SEGMENTATIONY. Cao0M. Scaioni1Department of Architecture, Built Environment and Construction Engineering, Politecnico di Milano, via Ponzio 31, 20133, Milan, ItalyDepartment of Architecture, Built Environment and Construction Engineering, Politecnico di Milano, via Ponzio 31, 20133, Milan, ItalyBoth machine learning (ML) and deep learning (DL) algorithms require high-quality training samples as well as precise and thorough annotations in order to work effectively. The 3D building indoor-outdoor dataset (BIO dataset), which is a highly accurate, high level of detail, and high coverage dataset for 3D building point cloud and mesh semantic segmentation, is established as a canonical benchmark dataset. It contains 100 building models, in which building structural elements are annotated into 11 semantic categories. Each building in this dataset has an average of 75,587 triangular faces, and the total area of the dataset is 481,769 square meters. Furthermore, semantic segmentation of the dataset was carried out using the Random Forest ML algorithm to verify the dataset’s accessibility. A weighted F1 score of 96.64% was obtained with 10% of the segments of each building randomly chosen as training data. For applications involving building geometry data, the BIO dataset can support a broad class of recently developed ML and DL methodologies.https://isprs-archives.copernicus.org/articles/XLVIII-1-W2-2023/147/2023/isprs-archives-XLVIII-1-W2-2023-147-2023.pdf
spellingShingle Y. Cao
M. Scaioni
A 3D BUILDING INDOOR-OUTDOOR BENCHMARK FOR SEMANTIC SEGMENTATION
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
title A 3D BUILDING INDOOR-OUTDOOR BENCHMARK FOR SEMANTIC SEGMENTATION
title_full A 3D BUILDING INDOOR-OUTDOOR BENCHMARK FOR SEMANTIC SEGMENTATION
title_fullStr A 3D BUILDING INDOOR-OUTDOOR BENCHMARK FOR SEMANTIC SEGMENTATION
title_full_unstemmed A 3D BUILDING INDOOR-OUTDOOR BENCHMARK FOR SEMANTIC SEGMENTATION
title_short A 3D BUILDING INDOOR-OUTDOOR BENCHMARK FOR SEMANTIC SEGMENTATION
title_sort 3d building indoor outdoor benchmark for semantic segmentation
url https://isprs-archives.copernicus.org/articles/XLVIII-1-W2-2023/147/2023/isprs-archives-XLVIII-1-W2-2023-147-2023.pdf
work_keys_str_mv AT ycao a3dbuildingindooroutdoorbenchmarkforsemanticsegmentation
AT mscaioni a3dbuildingindooroutdoorbenchmarkforsemanticsegmentation
AT ycao 3dbuildingindooroutdoorbenchmarkforsemanticsegmentation
AT mscaioni 3dbuildingindooroutdoorbenchmarkforsemanticsegmentation