SEMANTIC SEGMENTATION OF AERIAL IMAGERY VIA MULTI-SCALE SHUFFLING CONVOLUTIONAL NEURAL NETWORKS WITH DEEP SUPERVISION

In this paper, we address the semantic segmentation of aerial imagery based on the use of multi-modal data given in the form of true orthophotos and the corresponding Digital Surface Models (DSMs). We present the Deeply-supervised Shuffling Convolutional Neural Network (DSCNN) representing a multi-s...

Full description

Bibliographic Details
Main Authors: K. Chen, M. Weinmann, X. Sun, M. Yan, S. Hinz, B. Jutzi
Format: Article
Language:English
Published: Copernicus Publications 2018-09-01
Series:ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:https://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/IV-1/29/2018/isprs-annals-IV-1-29-2018.pdf
_version_ 1818529461056831488
author K. Chen
K. Chen
M. Weinmann
X. Sun
M. Yan
S. Hinz
B. Jutzi
M. Weinmann
author_facet K. Chen
K. Chen
M. Weinmann
X. Sun
M. Yan
S. Hinz
B. Jutzi
M. Weinmann
author_sort K. Chen
collection DOAJ
description In this paper, we address the semantic segmentation of aerial imagery based on the use of multi-modal data given in the form of true orthophotos and the corresponding Digital Surface Models (DSMs). We present the Deeply-supervised Shuffling Convolutional Neural Network (DSCNN) representing a multi-scale extension of the Shuffling Convolutional Neural Network (SCNN) with deep supervision. Thereby, we take the advantage of the SCNN involving the shuffling operator to effectively upsample feature maps and then fuse multiscale features derived from the intermediate layers of the SCNN, which results in the Multi-scale Shuffling Convolutional Neural Network (MSCNN). Based on the MSCNN, we derive the DSCNN by introducing additional losses into the intermediate layers of the MSCNN. In addition, we investigate the impact of using different sets of hand-crafted radiometric and geometric features derived from the true orthophotos and the DSMs on the semantic segmentation task. For performance evaluation, we use a commonly used benchmark dataset. The achieved results reveal that both multi-scale fusion and deep supervision contribute to an improvement in performance. Furthermore, the use of a diversity of hand-crafted radiometric and geometric features as input for the DSCNN does not provide the best numerical results, but smoother and improved detections for several objects.
first_indexed 2024-12-11T17:07:10Z
format Article
id doaj.art-2d6e428f313d4f329dced04a4cb83db1
institution Directory Open Access Journal
issn 2194-9042
2194-9050
language English
last_indexed 2024-12-11T17:07:10Z
publishDate 2018-09-01
publisher Copernicus Publications
record_format Article
series ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
spelling doaj.art-2d6e428f313d4f329dced04a4cb83db12022-12-22T00:57:39ZengCopernicus PublicationsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences2194-90422194-90502018-09-01IV-1293610.5194/isprs-annals-IV-1-29-2018SEMANTIC SEGMENTATION OF AERIAL IMAGERY VIA MULTI-SCALE SHUFFLING CONVOLUTIONAL NEURAL NETWORKS WITH DEEP SUPERVISIONK. Chen0K. Chen1M. Weinmann2X. Sun3M. Yan4S. Hinz5B. Jutzi6M. Weinmann7Key Laboratory of Technology in Geo-Spatial Information Processing and Application System, Institute of Electronics, Chinese Academy of Sciences, Beijing, P. R. ChinaUniversity of Chinese Academy of Sciences, Beijing, P. R. ChinaInstitute of Computer Science II, University of Bonn, Bonn, GermanyKey Laboratory of Technology in Geo-Spatial Information Processing and Application System, Institute of Electronics, Chinese Academy of Sciences, Beijing, P. R. ChinaKey Laboratory of Technology in Geo-Spatial Information Processing and Application System, Institute of Electronics, Chinese Academy of Sciences, Beijing, P. R. ChinaInstitute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology, Karlsruhe, GermanyInstitute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology, Karlsruhe, GermanyInstitute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology, Karlsruhe, GermanyIn this paper, we address the semantic segmentation of aerial imagery based on the use of multi-modal data given in the form of true orthophotos and the corresponding Digital Surface Models (DSMs). We present the Deeply-supervised Shuffling Convolutional Neural Network (DSCNN) representing a multi-scale extension of the Shuffling Convolutional Neural Network (SCNN) with deep supervision. Thereby, we take the advantage of the SCNN involving the shuffling operator to effectively upsample feature maps and then fuse multiscale features derived from the intermediate layers of the SCNN, which results in the Multi-scale Shuffling Convolutional Neural Network (MSCNN). Based on the MSCNN, we derive the DSCNN by introducing additional losses into the intermediate layers of the MSCNN. In addition, we investigate the impact of using different sets of hand-crafted radiometric and geometric features derived from the true orthophotos and the DSMs on the semantic segmentation task. For performance evaluation, we use a commonly used benchmark dataset. The achieved results reveal that both multi-scale fusion and deep supervision contribute to an improvement in performance. Furthermore, the use of a diversity of hand-crafted radiometric and geometric features as input for the DSCNN does not provide the best numerical results, but smoother and improved detections for several objects.https://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/IV-1/29/2018/isprs-annals-IV-1-29-2018.pdf
spellingShingle K. Chen
K. Chen
M. Weinmann
X. Sun
M. Yan
S. Hinz
B. Jutzi
M. Weinmann
SEMANTIC SEGMENTATION OF AERIAL IMAGERY VIA MULTI-SCALE SHUFFLING CONVOLUTIONAL NEURAL NETWORKS WITH DEEP SUPERVISION
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
title SEMANTIC SEGMENTATION OF AERIAL IMAGERY VIA MULTI-SCALE SHUFFLING CONVOLUTIONAL NEURAL NETWORKS WITH DEEP SUPERVISION
title_full SEMANTIC SEGMENTATION OF AERIAL IMAGERY VIA MULTI-SCALE SHUFFLING CONVOLUTIONAL NEURAL NETWORKS WITH DEEP SUPERVISION
title_fullStr SEMANTIC SEGMENTATION OF AERIAL IMAGERY VIA MULTI-SCALE SHUFFLING CONVOLUTIONAL NEURAL NETWORKS WITH DEEP SUPERVISION
title_full_unstemmed SEMANTIC SEGMENTATION OF AERIAL IMAGERY VIA MULTI-SCALE SHUFFLING CONVOLUTIONAL NEURAL NETWORKS WITH DEEP SUPERVISION
title_short SEMANTIC SEGMENTATION OF AERIAL IMAGERY VIA MULTI-SCALE SHUFFLING CONVOLUTIONAL NEURAL NETWORKS WITH DEEP SUPERVISION
title_sort semantic segmentation of aerial imagery via multi scale shuffling convolutional neural networks with deep supervision
url https://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/IV-1/29/2018/isprs-annals-IV-1-29-2018.pdf
work_keys_str_mv AT kchen semanticsegmentationofaerialimageryviamultiscaleshufflingconvolutionalneuralnetworkswithdeepsupervision
AT kchen semanticsegmentationofaerialimageryviamultiscaleshufflingconvolutionalneuralnetworkswithdeepsupervision
AT mweinmann semanticsegmentationofaerialimageryviamultiscaleshufflingconvolutionalneuralnetworkswithdeepsupervision
AT xsun semanticsegmentationofaerialimageryviamultiscaleshufflingconvolutionalneuralnetworkswithdeepsupervision
AT myan semanticsegmentationofaerialimageryviamultiscaleshufflingconvolutionalneuralnetworkswithdeepsupervision
AT shinz semanticsegmentationofaerialimageryviamultiscaleshufflingconvolutionalneuralnetworkswithdeepsupervision
AT bjutzi semanticsegmentationofaerialimageryviamultiscaleshufflingconvolutionalneuralnetworkswithdeepsupervision
AT mweinmann semanticsegmentationofaerialimageryviamultiscaleshufflingconvolutionalneuralnetworkswithdeepsupervision