Semantic Segmentation of Remote-Sensing Imagery Using Heterogeneous Big Data: International Society for Photogrammetry and Remote Sensing Potsdam and Cityscape Datasets

Although semantic segmentation of remote-sensing (RS) images using deep-learning networks has demonstrated its effectiveness recently, compared with natural-image datasets, obtaining RS images under the same conditions to construct data labels is difficult. Indeed, small datasets limit the effective...

Full description

Bibliographic Details
Main Authors:	Ahram Song, Yongil Kim
Format:	Article
Language:	English
Published:	MDPI AG 2020-10-01
Series:	ISPRS International Journal of Geo-Information
Subjects:	semantic segmentation deep learning big dataset ISPRS Potsdam dataset Cityscape dataset
Online Access:	https://www.mdpi.com/2220-9964/9/10/601

_version_	1797551264454672384
author	Ahram Song Yongil Kim
author_facet	Ahram Song Yongil Kim
author_sort	Ahram Song
collection	DOAJ
description	Although semantic segmentation of remote-sensing (RS) images using deep-learning networks has demonstrated its effectiveness recently, compared with natural-image datasets, obtaining RS images under the same conditions to construct data labels is difficult. Indeed, small datasets limit the effective learning of deep-learning networks. To address this problem, we propose a combined U-net model that is trained using a combined weighted loss function and can handle heterogeneous datasets. The network consists of encoder and decoder blocks. The convolutional layers that form the encoder blocks are shared with the heterogeneous datasets, and the decoder blocks are assigned separate training weights. Herein, the International Society for Photogrammetry and Remote Sensing (ISPRS) Potsdam and Cityscape datasets are used as the RS and natural-image datasets, respectively. When the layers are shared, only visible bands of the ISPRS Potsdam data are used. Experimental results show that when same-sized heterogeneous datasets are used, the semantic segmentation accuracy of the Potsdam data obtained using our proposed method is lower than that obtained using only the Potsdam data (four bands) with other methods, such as SegNet, DeepLab-V3+, and the simplified version of U-net. However, the segmentation accuracy of the Potsdam images is improved when the larger Cityscape dataset is used. The combined U-net model can effectively train heterogeneous datasets and overcome the insufficient training data problem in the context of RS-image datasets. Furthermore, it is expected that the proposed method can not only be applied to segmentation tasks of aerial images but also to tasks with various purposes of using big heterogeneous datasets.
first_indexed	2024-03-10T15:42:07Z
format	Article
id	doaj.art-20eb07496b7541cc95d7fe08d01c7a75
institution	Directory Open Access Journal
issn	2220-9964
language	English
last_indexed	2024-03-10T15:42:07Z
publishDate	2020-10-01
publisher	MDPI AG
record_format	Article
series	ISPRS International Journal of Geo-Information
spelling	doaj.art-20eb07496b7541cc95d7fe08d01c7a752023-11-20T16:41:58ZengMDPI AGISPRS International Journal of Geo-Information2220-99642020-10-0191060110.3390/ijgi9100601Semantic Segmentation of Remote-Sensing Imagery Using Heterogeneous Big Data: International Society for Photogrammetry and Remote Sensing Potsdam and Cityscape DatasetsAhram Song0Yongil Kim1Department of Civil and Environmental Engineering, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul 08826, KoreaDepartment of Civil and Environmental Engineering, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul 08826, KoreaAlthough semantic segmentation of remote-sensing (RS) images using deep-learning networks has demonstrated its effectiveness recently, compared with natural-image datasets, obtaining RS images under the same conditions to construct data labels is difficult. Indeed, small datasets limit the effective learning of deep-learning networks. To address this problem, we propose a combined U-net model that is trained using a combined weighted loss function and can handle heterogeneous datasets. The network consists of encoder and decoder blocks. The convolutional layers that form the encoder blocks are shared with the heterogeneous datasets, and the decoder blocks are assigned separate training weights. Herein, the International Society for Photogrammetry and Remote Sensing (ISPRS) Potsdam and Cityscape datasets are used as the RS and natural-image datasets, respectively. When the layers are shared, only visible bands of the ISPRS Potsdam data are used. Experimental results show that when same-sized heterogeneous datasets are used, the semantic segmentation accuracy of the Potsdam data obtained using our proposed method is lower than that obtained using only the Potsdam data (four bands) with other methods, such as SegNet, DeepLab-V3+, and the simplified version of U-net. However, the segmentation accuracy of the Potsdam images is improved when the larger Cityscape dataset is used. The combined U-net model can effectively train heterogeneous datasets and overcome the insufficient training data problem in the context of RS-image datasets. Furthermore, it is expected that the proposed method can not only be applied to segmentation tasks of aerial images but also to tasks with various purposes of using big heterogeneous datasets.https://www.mdpi.com/2220-9964/9/10/601semantic segmentationdeep learningbig datasetISPRS Potsdam datasetCityscape dataset
spellingShingle	Ahram Song Yongil Kim Semantic Segmentation of Remote-Sensing Imagery Using Heterogeneous Big Data: International Society for Photogrammetry and Remote Sensing Potsdam and Cityscape Datasets ISPRS International Journal of Geo-Information semantic segmentation deep learning big dataset ISPRS Potsdam dataset Cityscape dataset
title	Semantic Segmentation of Remote-Sensing Imagery Using Heterogeneous Big Data: International Society for Photogrammetry and Remote Sensing Potsdam and Cityscape Datasets
title_full	Semantic Segmentation of Remote-Sensing Imagery Using Heterogeneous Big Data: International Society for Photogrammetry and Remote Sensing Potsdam and Cityscape Datasets
title_fullStr	Semantic Segmentation of Remote-Sensing Imagery Using Heterogeneous Big Data: International Society for Photogrammetry and Remote Sensing Potsdam and Cityscape Datasets
title_full_unstemmed	Semantic Segmentation of Remote-Sensing Imagery Using Heterogeneous Big Data: International Society for Photogrammetry and Remote Sensing Potsdam and Cityscape Datasets
title_short	Semantic Segmentation of Remote-Sensing Imagery Using Heterogeneous Big Data: International Society for Photogrammetry and Remote Sensing Potsdam and Cityscape Datasets
title_sort	semantic segmentation of remote sensing imagery using heterogeneous big data international society for photogrammetry and remote sensing potsdam and cityscape datasets
topic	semantic segmentation deep learning big dataset ISPRS Potsdam dataset Cityscape dataset
url	https://www.mdpi.com/2220-9964/9/10/601
work_keys_str_mv	AT ahramsong semanticsegmentationofremotesensingimageryusingheterogeneousbigdatainternationalsocietyforphotogrammetryandremotesensingpotsdamandcityscapedatasets AT yongilkim semanticsegmentationofremotesensingimageryusingheterogeneousbigdatainternationalsocietyforphotogrammetryandremotesensingpotsdamandcityscapedatasets

Semantic Segmentation of Remote-Sensing Imagery Using Heterogeneous Big Data: International Society for Photogrammetry and Remote Sensing Potsdam and Cityscape Datasets

Similar Items