An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation

Abstract To improve the accuracy of remote-sensing image semantic segmentation in complex scenario, an improved DeepLabv3+ lightweight neural network is proposed. Specifically, the lightweight network MobileNetv2 is used as the backbone network. In atrous spatial pyramid pooling (ASPP), to alleviate...

Full description

Bibliographic Details
Main Authors: Hui Chen, Yuanshou Qin, Xinyuan Liu, Haitao Wang, Jinling Zhao
Format: Article
Language:English
Published: Springer 2023-12-01
Series:Complex & Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1007/s40747-023-01304-z
_version_ 1797233216986284032
author Hui Chen
Yuanshou Qin
Xinyuan Liu
Haitao Wang
Jinling Zhao
author_facet Hui Chen
Yuanshou Qin
Xinyuan Liu
Haitao Wang
Jinling Zhao
author_sort Hui Chen
collection DOAJ
description Abstract To improve the accuracy of remote-sensing image semantic segmentation in complex scenario, an improved DeepLabv3+ lightweight neural network is proposed. Specifically, the lightweight network MobileNetv2 is used as the backbone network. In atrous spatial pyramid pooling (ASPP), to alleviate the gridding effect, the Dilated Convolution in original DeepLabv3+ network is replaced with the Hybrid Dilated Convolution (HDC) module. In addition, the traditional spatial mean pooling is replaced by the strip pooling module (SPN) to improve the local segmentation effect. In the decoder, to obtain the rich low-level target edge information, the ResNet50 residual network is added after the low-level feature fusion. To enhance the shallow semantic information, the efficient and lightweight Normalization-based Attention Module (NAM) is added to capture the feature information of small target objects. The results show that, under the INRIA Aerial Image Dataset and same parameter setting, the Mean Pixel Accuracy (MPA) and Mean Intersection over Union (MIoU) are generally best than DeepLabv3+ , U-Net, and PSP-Net, which are respectively improved by 1.22%, − 0.22%, and 2.22% and 2.17%, 1.35%, and 3.42%. Our proposed method has also a good performance on the small object segmentation and multi-object segmentation. What’s more, it significantly converges faster with fewer model parameters and stronger computing power while ensuring the segmentation effect. It is proved to be robust and can provide a methodological reference for high-precision remote-sensing image semantic segmentation.
first_indexed 2024-04-24T16:12:39Z
format Article
id doaj.art-dc85669217044289bf6d88746d25406b
institution Directory Open Access Journal
issn 2199-4536
2198-6053
language English
last_indexed 2024-04-24T16:12:39Z
publishDate 2023-12-01
publisher Springer
record_format Article
series Complex & Intelligent Systems
spelling doaj.art-dc85669217044289bf6d88746d25406b2024-03-31T11:39:07ZengSpringerComplex & Intelligent Systems2199-45362198-60532023-12-011022839284910.1007/s40747-023-01304-zAn improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentationHui Chen0Yuanshou Qin1Xinyuan Liu2Haitao Wang3Jinling Zhao4School of Internet, Anhui UniversitySchool of Internet, Anhui UniversitySchool of Internet, Anhui UniversitySchool of Internet, Anhui UniversityNational Engineering Research Center for Analysis and Application of Agro-Ecological Big Data, Anhui UniversityAbstract To improve the accuracy of remote-sensing image semantic segmentation in complex scenario, an improved DeepLabv3+ lightweight neural network is proposed. Specifically, the lightweight network MobileNetv2 is used as the backbone network. In atrous spatial pyramid pooling (ASPP), to alleviate the gridding effect, the Dilated Convolution in original DeepLabv3+ network is replaced with the Hybrid Dilated Convolution (HDC) module. In addition, the traditional spatial mean pooling is replaced by the strip pooling module (SPN) to improve the local segmentation effect. In the decoder, to obtain the rich low-level target edge information, the ResNet50 residual network is added after the low-level feature fusion. To enhance the shallow semantic information, the efficient and lightweight Normalization-based Attention Module (NAM) is added to capture the feature information of small target objects. The results show that, under the INRIA Aerial Image Dataset and same parameter setting, the Mean Pixel Accuracy (MPA) and Mean Intersection over Union (MIoU) are generally best than DeepLabv3+ , U-Net, and PSP-Net, which are respectively improved by 1.22%, − 0.22%, and 2.22% and 2.17%, 1.35%, and 3.42%. Our proposed method has also a good performance on the small object segmentation and multi-object segmentation. What’s more, it significantly converges faster with fewer model parameters and stronger computing power while ensuring the segmentation effect. It is proved to be robust and can provide a methodological reference for high-precision remote-sensing image semantic segmentation.https://doi.org/10.1007/s40747-023-01304-zRemote-sensing imageSemantic segmentationDeepLabv3+ Deep learningLightweight network
spellingShingle Hui Chen
Yuanshou Qin
Xinyuan Liu
Haitao Wang
Jinling Zhao
An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation
Complex & Intelligent Systems
Remote-sensing image
Semantic segmentation
DeepLabv3+ 
Deep learning
Lightweight network
title An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation
title_full An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation
title_fullStr An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation
title_full_unstemmed An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation
title_short An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation
title_sort improved deeplabv3 lightweight network for remote sensing image semantic segmentation
topic Remote-sensing image
Semantic segmentation
DeepLabv3+ 
Deep learning
Lightweight network
url https://doi.org/10.1007/s40747-023-01304-z
work_keys_str_mv AT huichen animproveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation
AT yuanshouqin animproveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation
AT xinyuanliu animproveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation
AT haitaowang animproveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation
AT jinlingzhao animproveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation
AT huichen improveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation
AT yuanshouqin improveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation
AT xinyuanliu improveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation
AT haitaowang improveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation
AT jinlingzhao improveddeeplabv3lightweightnetworkforremotesensingimagesemanticsegmentation