CPISNet: Delving into Consistent Proposals of Instance Segmentation Network for High-Resolution Aerial Images

Instance segmentation of high-resolution aerial images is challenging when compared to object detection and semantic segmentation in remote sensing applications. It adopts boundary-aware mask predictions, instead of traditional bounding boxes, to locate the objects-of-interest in pixel-wise. Meanwhi...

Full description

Bibliographic Details
Main Authors: Xiangfeng Zeng, Shunjun Wei, Jinshan Wei, Zichen Zhou, Jun Shi, Xiaoling Zhang, Fan Fan
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/13/14/2788
_version_ 1797526118959415296
author Xiangfeng Zeng
Shunjun Wei
Jinshan Wei
Zichen Zhou
Jun Shi
Xiaoling Zhang
Fan Fan
author_facet Xiangfeng Zeng
Shunjun Wei
Jinshan Wei
Zichen Zhou
Jun Shi
Xiaoling Zhang
Fan Fan
author_sort Xiangfeng Zeng
collection DOAJ
description Instance segmentation of high-resolution aerial images is challenging when compared to object detection and semantic segmentation in remote sensing applications. It adopts boundary-aware mask predictions, instead of traditional bounding boxes, to locate the objects-of-interest in pixel-wise. Meanwhile, instance segmentation can distinguish the densely distributed objects within a certain category by a different color, which is unavailable in semantic segmentation. Despite the distinct advantages, there are rare methods which are dedicated to the high-quality instance segmentation for high-resolution aerial images. In this paper, a novel instance segmentation method, termed consistent proposals of instance segmentation network (CPISNet), for high-resolution aerial images is proposed. Following top-down instance segmentation formula, it adopts the adaptive feature extraction network (AFEN) to extract the multi-level bottom-up augmented feature maps in design space level. Then, elaborated RoI extractor (ERoIE) is designed to extract the mask RoIs via the refined bounding boxes from proposal consistent cascaded (PCC) architecture and multi-level features from AFEN. Finally, the convolution block with shortcut connection is responsible for generating the binary mask for instance segmentation. Experimental conclusions can be drawn on the iSAID and NWPU VHR-10 instance segmentation dataset: (1) Each individual module in CPISNet acts on the whole instance segmentation utility; (2) CPISNet* exceeds vanilla Mask R-CNN 3.4%/3.8% <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi><mi>P</mi></mrow></semantics></math></inline-formula> on iSAID validation/test set and 9.2% <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi><mi>P</mi></mrow></semantics></math></inline-formula> on NWPU VHR-10 instance segmentation dataset; (3) The aliasing masks, missing segmentations, false alarms, and poorly segmented masks can be avoided to some extent for CPISNet; (4) CPISNet receives high precision of instance segmentation for aerial images and interprets the objects with fitting boundary.
first_indexed 2024-03-10T09:24:42Z
format Article
id doaj.art-6457186517294aa7a0b76323613d4c3c
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-10T09:24:42Z
publishDate 2021-07-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-6457186517294aa7a0b76323613d4c3c2023-11-22T04:52:23ZengMDPI AGRemote Sensing2072-42922021-07-011314278810.3390/rs13142788CPISNet: Delving into Consistent Proposals of Instance Segmentation Network for High-Resolution Aerial ImagesXiangfeng Zeng0Shunjun Wei1Jinshan Wei2Zichen Zhou3Jun Shi4Xiaoling Zhang5Fan Fan6School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, ChinaSchool of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, ChinaSchool of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, ChinaSchool of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, ChinaSchool of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, ChinaSchool of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, ChinaSchool of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, ChinaInstance segmentation of high-resolution aerial images is challenging when compared to object detection and semantic segmentation in remote sensing applications. It adopts boundary-aware mask predictions, instead of traditional bounding boxes, to locate the objects-of-interest in pixel-wise. Meanwhile, instance segmentation can distinguish the densely distributed objects within a certain category by a different color, which is unavailable in semantic segmentation. Despite the distinct advantages, there are rare methods which are dedicated to the high-quality instance segmentation for high-resolution aerial images. In this paper, a novel instance segmentation method, termed consistent proposals of instance segmentation network (CPISNet), for high-resolution aerial images is proposed. Following top-down instance segmentation formula, it adopts the adaptive feature extraction network (AFEN) to extract the multi-level bottom-up augmented feature maps in design space level. Then, elaborated RoI extractor (ERoIE) is designed to extract the mask RoIs via the refined bounding boxes from proposal consistent cascaded (PCC) architecture and multi-level features from AFEN. Finally, the convolution block with shortcut connection is responsible for generating the binary mask for instance segmentation. Experimental conclusions can be drawn on the iSAID and NWPU VHR-10 instance segmentation dataset: (1) Each individual module in CPISNet acts on the whole instance segmentation utility; (2) CPISNet* exceeds vanilla Mask R-CNN 3.4%/3.8% <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi><mi>P</mi></mrow></semantics></math></inline-formula> on iSAID validation/test set and 9.2% <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi><mi>P</mi></mrow></semantics></math></inline-formula> on NWPU VHR-10 instance segmentation dataset; (3) The aliasing masks, missing segmentations, false alarms, and poorly segmented masks can be avoided to some extent for CPISNet; (4) CPISNet receives high precision of instance segmentation for aerial images and interprets the objects with fitting boundary.https://www.mdpi.com/2072-4292/13/14/2788instance segmentationaerial imagesregion proposalsconvolutional neural networks
spellingShingle Xiangfeng Zeng
Shunjun Wei
Jinshan Wei
Zichen Zhou
Jun Shi
Xiaoling Zhang
Fan Fan
CPISNet: Delving into Consistent Proposals of Instance Segmentation Network for High-Resolution Aerial Images
Remote Sensing
instance segmentation
aerial images
region proposals
convolutional neural networks
title CPISNet: Delving into Consistent Proposals of Instance Segmentation Network for High-Resolution Aerial Images
title_full CPISNet: Delving into Consistent Proposals of Instance Segmentation Network for High-Resolution Aerial Images
title_fullStr CPISNet: Delving into Consistent Proposals of Instance Segmentation Network for High-Resolution Aerial Images
title_full_unstemmed CPISNet: Delving into Consistent Proposals of Instance Segmentation Network for High-Resolution Aerial Images
title_short CPISNet: Delving into Consistent Proposals of Instance Segmentation Network for High-Resolution Aerial Images
title_sort cpisnet delving into consistent proposals of instance segmentation network for high resolution aerial images
topic instance segmentation
aerial images
region proposals
convolutional neural networks
url https://www.mdpi.com/2072-4292/13/14/2788
work_keys_str_mv AT xiangfengzeng cpisnetdelvingintoconsistentproposalsofinstancesegmentationnetworkforhighresolutionaerialimages
AT shunjunwei cpisnetdelvingintoconsistentproposalsofinstancesegmentationnetworkforhighresolutionaerialimages
AT jinshanwei cpisnetdelvingintoconsistentproposalsofinstancesegmentationnetworkforhighresolutionaerialimages
AT zichenzhou cpisnetdelvingintoconsistentproposalsofinstancesegmentationnetworkforhighresolutionaerialimages
AT junshi cpisnetdelvingintoconsistentproposalsofinstancesegmentationnetworkforhighresolutionaerialimages
AT xiaolingzhang cpisnetdelvingintoconsistentproposalsofinstancesegmentationnetworkforhighresolutionaerialimages
AT fanfan cpisnetdelvingintoconsistentproposalsofinstancesegmentationnetworkforhighresolutionaerialimages