HTDet: A Hybrid Transformer-Based Approach for Underwater Small Object Detection

As marine observation technology develops rapidly, underwater optical image object detection is beginning to occupy an important role in many tasks, such as naval coastal defense tasks, aquaculture, etc. However, in the complex marine environment, the images captured by an optical imaging system are...

Full description

Bibliographic Details
Main Authors: Gangqi Chen, Zhaoyong Mao, Kai Wang, Junge Shen
Format: Article
Language:English
Published: MDPI AG 2023-02-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/15/4/1076
_version_ 1797618351930867712
author Gangqi Chen
Zhaoyong Mao
Kai Wang
Junge Shen
author_facet Gangqi Chen
Zhaoyong Mao
Kai Wang
Junge Shen
author_sort Gangqi Chen
collection DOAJ
description As marine observation technology develops rapidly, underwater optical image object detection is beginning to occupy an important role in many tasks, such as naval coastal defense tasks, aquaculture, etc. However, in the complex marine environment, the images captured by an optical imaging system are usually severely degraded. Therefore, how to detect objects accurately and quickly under such conditions is a critical problem that needs to be solved. In this manuscript, a novel framework for underwater object detection based on a hybrid transformer network is proposed. First, a lightweight hybrid transformer-based network is presented that can extract global contextual information. Second, a fine-grained feature pyramid network is used to overcome the issues of feeble signal disappearance. Third, the test-time-augmentation method is applied for inference without introducing additional parameters. Extensive experiments have shown that the approach we have proposed is able to detect feeble and small objects in an efficient and effective way. Furthermore, our model significantly outperforms the latest advanced detectors with respect to both the number of parameters and the mAP by a considerable margin. Specifically, our detector outperforms the baseline model by 6.3 points, and the model parameters are reduced by 28.5 M.
first_indexed 2024-03-11T08:12:49Z
format Article
id doaj.art-6178615cc5744b8e9699c50e64ec4114
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-11T08:12:49Z
publishDate 2023-02-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-6178615cc5744b8e9699c50e64ec41142023-11-16T23:03:23ZengMDPI AGRemote Sensing2072-42922023-02-01154107610.3390/rs15041076HTDet: A Hybrid Transformer-Based Approach for Underwater Small Object DetectionGangqi Chen0Zhaoyong Mao1Kai Wang2Junge Shen3School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an 710072, ChinaUnmanned System Research Institute, Northwestern Polytechnical University, Xi’an 710072, ChinaHenan Key Laboratory of Underwater Intelligent Equipment, Zhengzhou 710072, ChinaUnmanned System Research Institute, Northwestern Polytechnical University, Xi’an 710072, ChinaAs marine observation technology develops rapidly, underwater optical image object detection is beginning to occupy an important role in many tasks, such as naval coastal defense tasks, aquaculture, etc. However, in the complex marine environment, the images captured by an optical imaging system are usually severely degraded. Therefore, how to detect objects accurately and quickly under such conditions is a critical problem that needs to be solved. In this manuscript, a novel framework for underwater object detection based on a hybrid transformer network is proposed. First, a lightweight hybrid transformer-based network is presented that can extract global contextual information. Second, a fine-grained feature pyramid network is used to overcome the issues of feeble signal disappearance. Third, the test-time-augmentation method is applied for inference without introducing additional parameters. Extensive experiments have shown that the approach we have proposed is able to detect feeble and small objects in an efficient and effective way. Furthermore, our model significantly outperforms the latest advanced detectors with respect to both the number of parameters and the mAP by a considerable margin. Specifically, our detector outperforms the baseline model by 6.3 points, and the model parameters are reduced by 28.5 M.https://www.mdpi.com/2072-4292/15/4/1076deep learningunderwater object detectiontransformerlightweightfeeble and small object
spellingShingle Gangqi Chen
Zhaoyong Mao
Kai Wang
Junge Shen
HTDet: A Hybrid Transformer-Based Approach for Underwater Small Object Detection
Remote Sensing
deep learning
underwater object detection
transformer
lightweight
feeble and small object
title HTDet: A Hybrid Transformer-Based Approach for Underwater Small Object Detection
title_full HTDet: A Hybrid Transformer-Based Approach for Underwater Small Object Detection
title_fullStr HTDet: A Hybrid Transformer-Based Approach for Underwater Small Object Detection
title_full_unstemmed HTDet: A Hybrid Transformer-Based Approach for Underwater Small Object Detection
title_short HTDet: A Hybrid Transformer-Based Approach for Underwater Small Object Detection
title_sort htdet a hybrid transformer based approach for underwater small object detection
topic deep learning
underwater object detection
transformer
lightweight
feeble and small object
url https://www.mdpi.com/2072-4292/15/4/1076
work_keys_str_mv AT gangqichen htdetahybridtransformerbasedapproachforunderwatersmallobjectdetection
AT zhaoyongmao htdetahybridtransformerbasedapproachforunderwatersmallobjectdetection
AT kaiwang htdetahybridtransformerbasedapproachforunderwatersmallobjectdetection
AT jungeshen htdetahybridtransformerbasedapproachforunderwatersmallobjectdetection