Two-Stage Underwater Object Detection Network Using Swin Transformer

Underwater object detection plays an essential role in ocean exploration, and the increasing amount of underwater object image data makes the study of advanced underwater object detection algorithms of great practical significance. However, there are problems with colour offset, low contrast, and ta...

Full description

Bibliographic Details
Main Authors: Jia Liu, Shuang Liu, Shujuan Xu, Changjun Zhou
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9938441/
_version_ 1828130308427350016
author Jia Liu
Shuang Liu
Shujuan Xu
Changjun Zhou
author_facet Jia Liu
Shuang Liu
Shujuan Xu
Changjun Zhou
author_sort Jia Liu
collection DOAJ
description Underwater object detection plays an essential role in ocean exploration, and the increasing amount of underwater object image data makes the study of advanced underwater object detection algorithms of great practical significance. However, there are problems with colour offset, low contrast, and target blur in underwater image data. An underwater object detection algorithm based on Faster R-CNN is proposed to solve these problems. First, the Swin Transformer is used as the backbone network of the algorithm. Second, by adding the path aggregation network, the deep feature map and the shallow feature map are superimposed and fused. Third, online hard example mining, makes the training process more efficient. Fourth, the ROI pooling is improved to ROI align, eliminating the two quantization errors of ROI pooling and improving the detection performance. Compared with other algorithms, the proposed algorithm’s based on improved Faster-RCNN on URPC2018 dataset is improved to 80.54%, and basically solve the problem of missed detection and false detection of objects of different sizes in a complex environment.
first_indexed 2024-04-11T16:32:10Z
format Article
id doaj.art-060ed1ec8b0e425eb8e4aa4d4c3ae239
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-11T16:32:10Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-060ed1ec8b0e425eb8e4aa4d4c3ae2392022-12-22T04:13:59ZengIEEEIEEE Access2169-35362022-01-011011723511724710.1109/ACCESS.2022.32195929938441Two-Stage Underwater Object Detection Network Using Swin TransformerJia Liu0https://orcid.org/0000-0001-8598-8019Shuang Liu1https://orcid.org/0000-0002-0095-4328Shujuan Xu2https://orcid.org/0000-0001-5815-457XChangjun Zhou3https://orcid.org/0000-0002-0129-2231School of Computer Science and Engineering, Dalian Minzu University, Dalian, Liaoning, ChinaSchool of Computer Science and Engineering, Dalian Minzu University, Dalian, Liaoning, ChinaSchool of Computer Science and Engineering, Dalian Minzu University, Dalian, Liaoning, ChinaCollege of Mathematics and Computer Science, Zhejiang Normal University, Zhejiang, Jinhua, ChinaUnderwater object detection plays an essential role in ocean exploration, and the increasing amount of underwater object image data makes the study of advanced underwater object detection algorithms of great practical significance. However, there are problems with colour offset, low contrast, and target blur in underwater image data. An underwater object detection algorithm based on Faster R-CNN is proposed to solve these problems. First, the Swin Transformer is used as the backbone network of the algorithm. Second, by adding the path aggregation network, the deep feature map and the shallow feature map are superimposed and fused. Third, online hard example mining, makes the training process more efficient. Fourth, the ROI pooling is improved to ROI align, eliminating the two quantization errors of ROI pooling and improving the detection performance. Compared with other algorithms, the proposed algorithm’s based on improved Faster-RCNN on URPC2018 dataset is improved to 80.54%, and basically solve the problem of missed detection and false detection of objects of different sizes in a complex environment.https://ieeexplore.ieee.org/document/9938441/Deep learningunderwater target detectionswin transformerfeature fusionfaster R-CNNOHEM
spellingShingle Jia Liu
Shuang Liu
Shujuan Xu
Changjun Zhou
Two-Stage Underwater Object Detection Network Using Swin Transformer
IEEE Access
Deep learning
underwater target detection
swin transformer
feature fusion
faster R-CNN
OHEM
title Two-Stage Underwater Object Detection Network Using Swin Transformer
title_full Two-Stage Underwater Object Detection Network Using Swin Transformer
title_fullStr Two-Stage Underwater Object Detection Network Using Swin Transformer
title_full_unstemmed Two-Stage Underwater Object Detection Network Using Swin Transformer
title_short Two-Stage Underwater Object Detection Network Using Swin Transformer
title_sort two stage underwater object detection network using swin transformer
topic Deep learning
underwater target detection
swin transformer
feature fusion
faster R-CNN
OHEM
url https://ieeexplore.ieee.org/document/9938441/
work_keys_str_mv AT jialiu twostageunderwaterobjectdetectionnetworkusingswintransformer
AT shuangliu twostageunderwaterobjectdetectionnetworkusingswintransformer
AT shujuanxu twostageunderwaterobjectdetectionnetworkusingswintransformer
AT changjunzhou twostageunderwaterobjectdetectionnetworkusingswintransformer