Sar Ship Detection Based on Convnext with Multi-Pooling Channel Attention and Feature Intensification Pyramid Network

The advancements in ship detection technology using convolutional neural networks (CNNs) regarding synthetic aperture radar (SAR) images have been significant. Yet, there are still some limitations in the existing detection algorithms. First, the backbones cannot generate high-quality multiscale fea...

Full description

Bibliographic Details
Main Authors: Fanming Wei, Xiao Wang
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/23/17/7641
_version_ 1827727751752187904
author Fanming Wei
Xiao Wang
author_facet Fanming Wei
Xiao Wang
author_sort Fanming Wei
collection DOAJ
description The advancements in ship detection technology using convolutional neural networks (CNNs) regarding synthetic aperture radar (SAR) images have been significant. Yet, there are still some limitations in the existing detection algorithms. First, the backbones cannot generate high-quality multiscale feature maps. Second, there is a lack of suitable attention mechanisms to suppress false alarms. Third, the current feature intensification algorithms are unable to effectively enhance the shallow feature’s semantic information, which hinders the detection of small ships. Fourth, top-level feature maps have rich semantic information; however, as a result of the reduction of channels, the semantic information is weakened. These four problems lead to poor performance in SAR ship detection and recognition. To address the mentioned issues, we put forward a new approach that has the following characteristics. First, we use Convnext as the backbone to generate high-quality multiscale feature maps. Second, to suppress false alarms, the multi-pooling channel attention (MPCA) is designed to generate a corresponding weight for each channel, suppressing redundant feature maps, and further optimizing the feature maps generated by Convnext. Third, a feature intensification pyramid network (FIPN) is specifically designed to intensify the feature maps, especially the shallow feature maps. Fourth, a top-level feature intensification (TLFI) is also proposed to compensate for semantic information loss within the top-level feature maps by utilizing semantic information from different spaces. The experimental dataset employed is the SAR Ship Detection Dataset (SSDD), and the experimental findings display that our approach exhibits superiority compared to other advanced approaches. The overall Average Precision (AP) reaches up to 95.6% on the SSDD, which improves the accuracy by at least 1.7% compared to the current excellent methods.
first_indexed 2024-03-10T23:12:25Z
format Article
id doaj.art-75590b7e0cb94900afb54360296e63e1
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-10T23:12:25Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-75590b7e0cb94900afb54360296e63e12023-11-19T08:52:35ZengMDPI AGSensors1424-82202023-09-012317764110.3390/s23177641Sar Ship Detection Based on Convnext with Multi-Pooling Channel Attention and Feature Intensification Pyramid NetworkFanming Wei0Xiao Wang1College of Computer and Information Engineering, Nanjing Tech University, Nanjing 211816, ChinaCollege of Computer and Information Engineering, Nanjing Tech University, Nanjing 211816, ChinaThe advancements in ship detection technology using convolutional neural networks (CNNs) regarding synthetic aperture radar (SAR) images have been significant. Yet, there are still some limitations in the existing detection algorithms. First, the backbones cannot generate high-quality multiscale feature maps. Second, there is a lack of suitable attention mechanisms to suppress false alarms. Third, the current feature intensification algorithms are unable to effectively enhance the shallow feature’s semantic information, which hinders the detection of small ships. Fourth, top-level feature maps have rich semantic information; however, as a result of the reduction of channels, the semantic information is weakened. These four problems lead to poor performance in SAR ship detection and recognition. To address the mentioned issues, we put forward a new approach that has the following characteristics. First, we use Convnext as the backbone to generate high-quality multiscale feature maps. Second, to suppress false alarms, the multi-pooling channel attention (MPCA) is designed to generate a corresponding weight for each channel, suppressing redundant feature maps, and further optimizing the feature maps generated by Convnext. Third, a feature intensification pyramid network (FIPN) is specifically designed to intensify the feature maps, especially the shallow feature maps. Fourth, a top-level feature intensification (TLFI) is also proposed to compensate for semantic information loss within the top-level feature maps by utilizing semantic information from different spaces. The experimental dataset employed is the SAR Ship Detection Dataset (SSDD), and the experimental findings display that our approach exhibits superiority compared to other advanced approaches. The overall Average Precision (AP) reaches up to 95.6% on the SSDD, which improves the accuracy by at least 1.7% compared to the current excellent methods.https://www.mdpi.com/1424-8220/23/17/7641Convnextfeature intensification pyramid network (FIPN)SAR ship detectionmulti-pooling channel attention (MPCA)top-level feature intensification (TLFI) module
spellingShingle Fanming Wei
Xiao Wang
Sar Ship Detection Based on Convnext with Multi-Pooling Channel Attention and Feature Intensification Pyramid Network
Sensors
Convnext
feature intensification pyramid network (FIPN)
SAR ship detection
multi-pooling channel attention (MPCA)
top-level feature intensification (TLFI) module
title Sar Ship Detection Based on Convnext with Multi-Pooling Channel Attention and Feature Intensification Pyramid Network
title_full Sar Ship Detection Based on Convnext with Multi-Pooling Channel Attention and Feature Intensification Pyramid Network
title_fullStr Sar Ship Detection Based on Convnext with Multi-Pooling Channel Attention and Feature Intensification Pyramid Network
title_full_unstemmed Sar Ship Detection Based on Convnext with Multi-Pooling Channel Attention and Feature Intensification Pyramid Network
title_short Sar Ship Detection Based on Convnext with Multi-Pooling Channel Attention and Feature Intensification Pyramid Network
title_sort sar ship detection based on convnext with multi pooling channel attention and feature intensification pyramid network
topic Convnext
feature intensification pyramid network (FIPN)
SAR ship detection
multi-pooling channel attention (MPCA)
top-level feature intensification (TLFI) module
url https://www.mdpi.com/1424-8220/23/17/7641
work_keys_str_mv AT fanmingwei sarshipdetectionbasedonconvnextwithmultipoolingchannelattentionandfeatureintensificationpyramidnetwork
AT xiaowang sarshipdetectionbasedonconvnextwithmultipoolingchannelattentionandfeatureintensificationpyramidnetwork