An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation
Abstract To improve the accuracy of remote-sensing image semantic segmentation in complex scenario, an improved DeepLabv3+ lightweight neural network is proposed. Specifically, the lightweight network MobileNetv2 is used as the backbone network. In atrous spatial pyramid pooling (ASPP), to alleviate...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Springer
2023-12-01
|
Series: | Complex & Intelligent Systems |
Subjects: | |
Online Access: | https://doi.org/10.1007/s40747-023-01304-z |
_version_ | 1797233216986284032 |
---|---|
author | Hui Chen Yuanshou Qin Xinyuan Liu Haitao Wang Jinling Zhao |
author_facet | Hui Chen Yuanshou Qin Xinyuan Liu Haitao Wang Jinling Zhao |
author_sort | Hui Chen |
collection | DOAJ |
description | Abstract To improve the accuracy of remote-sensing image semantic segmentation in complex scenario, an improved DeepLabv3+ lightweight neural network is proposed. Specifically, the lightweight network MobileNetv2 is used as the backbone network. In atrous spatial pyramid pooling (ASPP), to alleviate the gridding effect, the Dilated Convolution in original DeepLabv3+ network is replaced with the Hybrid Dilated Convolution (HDC) module. In addition, the traditional spatial mean pooling is replaced by the strip pooling module (SPN) to improve the local segmentation effect. In the decoder, to obtain the rich low-level target edge information, the ResNet50 residual network is added after the low-level feature fusion. To enhance the shallow semantic information, the efficient and lightweight Normalization-based Attention Module (NAM) is added to capture the feature information of small target objects. The results show that, under the INRIA Aerial Image Dataset and same parameter setting, the Mean Pixel Accuracy (MPA) and Mean Intersection over Union (MIoU) are generally best than DeepLabv3+ , U-Net, and PSP-Net, which are respectively improved by 1.22%, − 0.22%, and 2.22% and 2.17%, 1.35%, and 3.42%. Our proposed method has also a good performance on the small object segmentation and multi-object segmentation. What’s more, it significantly converges faster with fewer model parameters and stronger computing power while ensuring the segmentation effect. It is proved to be robust and can provide a methodological reference for high-precision remote-sensing image semantic segmentation. |
first_indexed | 2024-04-24T16:12:39Z |
format | Article |
id | doaj.art-dc85669217044289bf6d88746d25406b |
institution | Directory Open Access Journal |
issn | 2199-4536 2198-6053 |
language | English |
last_indexed | 2024-04-24T16:12:39Z |
publishDate | 2023-12-01 |
publisher | Springer |
record_format | Article |
series | Complex & Intelligent Systems |
spelling | doaj.art-dc85669217044289bf6d88746d25406b2024-03-31T11:39:07ZengSpringerComplex & Intelligent Systems2199-45362198-60532023-12-011022839284910.1007/s40747-023-01304-zAn improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentationHui Chen0Yuanshou Qin1Xinyuan Liu2Haitao Wang3Jinling Zhao4School of Internet, Anhui UniversitySchool of Internet, Anhui UniversitySchool of Internet, Anhui UniversitySchool of Internet, Anhui UniversityNational Engineering Research Center for Analysis and Application of Agro-Ecological Big Data, Anhui UniversityAbstract To improve the accuracy of remote-sensing image semantic segmentation in complex scenario, an improved DeepLabv3+ lightweight neural network is proposed. Specifically, the lightweight network MobileNetv2 is used as the backbone network. In atrous spatial pyramid pooling (ASPP), to alleviate the gridding effect, the Dilated Convolution in original DeepLabv3+ network is replaced with the Hybrid Dilated Convolution (HDC) module. In addition, the traditional spatial mean pooling is replaced by the strip pooling module (SPN) to improve the local segmentation effect. In the decoder, to obtain the rich low-level target edge information, the ResNet50 residual network is added after the low-level feature fusion. To enhance the shallow semantic information, the efficient and lightweight Normalization-based Attention Module (NAM) is added to capture the feature information of small target objects. The results show that, under the INRIA Aerial Image Dataset and same parameter setting, the Mean Pixel Accuracy (MPA) and Mean Intersection over Union (MIoU) are generally best than DeepLabv3+ , U-Net, and PSP-Net, which are respectively improved by 1.22%, − 0.22%, and 2.22% and 2.17%, 1.35%, and 3.42%. Our proposed method has also a good performance on the small object segmentation and multi-object segmentation. What’s more, it significantly converges faster with fewer model parameters and stronger computing power while ensuring the segmentation effect. It is proved to be robust and can provide a methodological reference for high-precision remote-sensing image semantic segmentation.https://doi.org/10.1007/s40747-023-01304-zRemote-sensing imageSemantic segmentationDeepLabv3+ Deep learningLightweight network |
spellingShingle | Hui Chen Yuanshou Qin Xinyuan Liu Haitao Wang Jinling Zhao An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation Complex & Intelligent Systems Remote-sensing image Semantic segmentation DeepLabv3+ Deep learning Lightweight network |
title | An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation |
title_full | An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation |
title_fullStr | An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation |
title_full_unstemmed | An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation |
title_short | An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation |
title_sort | improved deeplabv3 lightweight network for remote sensing image semantic segmentation |
topic | Remote-sensing image Semantic segmentation DeepLabv3+ Deep learning Lightweight network |
url | https://doi.org/10.1007/s40747-023-01304-z |
work_keys_str_mv | AT huichen animproveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation AT yuanshouqin animproveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation AT xinyuanliu animproveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation AT haitaowang animproveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation AT jinlingzhao animproveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation AT huichen improveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation AT yuanshouqin improveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation AT xinyuanliu improveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation AT haitaowang improveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation AT jinlingzhao improveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation |