UMOTMA: Underwater multiple object tracking with memory aggregation

Underwater multi-object tracking (UMOT) is an important technology in marine animal ethology. It is affected by complex factors such as scattering, background interference, and occlusion, which makes it a challenging computer vision task. As a result, the stable continuation of trajectories among di...

Full description

Bibliographic Details
Main Authors:	Zhicheng Hao, Jun Qiu, Haimiao Zhang, Guangbo Ren, Chang Liu
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2022-11-01
Series:	Frontiers in Marine Science
Subjects:	artificial intelligence underwater multiple object tracking marine environment long-short term memory vision transformer
Online Access:	https://www.frontiersin.org/articles/10.3389/fmars.2022.1071618/full

_version_	1811211392532873216
author	Zhicheng Hao Jun Qiu Haimiao Zhang Guangbo Ren Chang Liu
author_facet	Zhicheng Hao Jun Qiu Haimiao Zhang Guangbo Ren Chang Liu
author_sort	Zhicheng Hao
collection	DOAJ
description	Underwater multi-object tracking (UMOT) is an important technology in marine animal ethology. It is affected by complex factors such as scattering, background interference, and occlusion, which makes it a challenging computer vision task. As a result, the stable continuation of trajectories among different targets has been the key to the tracking performance of UMOT tasks. To solve such challenges, we propose an underwater multi-object tracking algorithm based on memory aggregation (UMOTMA) to effectively associate multiple frames with targets. First, we propose a long short-term memory (LSTM)-based memory aggregation module (LSMAM) to enhance memory utilization between multiple frames. Next, LSMAM embeds LSTM into the transformer structure to save and aggregate features between multiple frames. Then, an underwater image enhancement module ME is introduced to process the original underwater images, which improves the quality and visibility of the underwater images so that the model can extract better features from the images. Finally, LSMAM and ME are integrated with a backbone network to implement the entire algorithm framework, which can fully utilize the historical information of the tracked targets. Experiments on the UMOT datasets and the underwater fish school datasets show that UMOTMA generally outperforms existing models and can maintain the stability of the target trajectory while ensuring high-quality detection. The code is available via Github.
first_indexed	2024-04-12T05:13:35Z
format	Article
id	doaj.art-2cb5f9160c1746adad9b41f93be79f89
institution	Directory Open Access Journal
issn	2296-7745
language	English
last_indexed	2024-04-12T05:13:35Z
publishDate	2022-11-01
publisher	Frontiers Media S.A.
record_format	Article
series	Frontiers in Marine Science
spelling	doaj.art-2cb5f9160c1746adad9b41f93be79f892022-12-22T03:46:42ZengFrontiers Media S.A.Frontiers in Marine Science2296-77452022-11-01910.3389/fmars.2022.10716181071618UMOTMA: Underwater multiple object tracking with memory aggregationZhicheng Hao0Jun Qiu1Haimiao Zhang2Guangbo Ren3Chang Liu4Institute of Applied Mathematics, Beijing Information Science and Technology University, Beijing, ChinaInstitute of Applied Mathematics, Beijing Information Science and Technology University, Beijing, ChinaInstitute of Applied Mathematics, Beijing Information Science and Technology University, Beijing, ChinaFirst Institute of Oceanography, Ministry of Natural Resources, Qingdao, Shandong, ChinaInstitute of Applied Mathematics, Beijing Information Science and Technology University, Beijing, ChinaUnderwater multi-object tracking (UMOT) is an important technology in marine animal ethology. It is affected by complex factors such as scattering, background interference, and occlusion, which makes it a challenging computer vision task. As a result, the stable continuation of trajectories among different targets has been the key to the tracking performance of UMOT tasks. To solve such challenges, we propose an underwater multi-object tracking algorithm based on memory aggregation (UMOTMA) to effectively associate multiple frames with targets. First, we propose a long short-term memory (LSTM)-based memory aggregation module (LSMAM) to enhance memory utilization between multiple frames. Next, LSMAM embeds LSTM into the transformer structure to save and aggregate features between multiple frames. Then, an underwater image enhancement module ME is introduced to process the original underwater images, which improves the quality and visibility of the underwater images so that the model can extract better features from the images. Finally, LSMAM and ME are integrated with a backbone network to implement the entire algorithm framework, which can fully utilize the historical information of the tracked targets. Experiments on the UMOT datasets and the underwater fish school datasets show that UMOTMA generally outperforms existing models and can maintain the stability of the target trajectory while ensuring high-quality detection. The code is available via Github.https://www.frontiersin.org/articles/10.3389/fmars.2022.1071618/fullartificial intelligenceunderwater multiple object trackingmarine environmentlong-short term memoryvision transformer
spellingShingle	Zhicheng Hao Jun Qiu Haimiao Zhang Guangbo Ren Chang Liu UMOTMA: Underwater multiple object tracking with memory aggregation Frontiers in Marine Science artificial intelligence underwater multiple object tracking marine environment long-short term memory vision transformer
title	UMOTMA: Underwater multiple object tracking with memory aggregation
title_full	UMOTMA: Underwater multiple object tracking with memory aggregation
title_fullStr	UMOTMA: Underwater multiple object tracking with memory aggregation
title_full_unstemmed	UMOTMA: Underwater multiple object tracking with memory aggregation
title_short	UMOTMA: Underwater multiple object tracking with memory aggregation
title_sort	umotma underwater multiple object tracking with memory aggregation
topic	artificial intelligence underwater multiple object tracking marine environment long-short term memory vision transformer
url	https://www.frontiersin.org/articles/10.3389/fmars.2022.1071618/full
work_keys_str_mv	AT zhichenghao umotmaunderwatermultipleobjecttrackingwithmemoryaggregation AT junqiu umotmaunderwatermultipleobjecttrackingwithmemoryaggregation AT haimiaozhang umotmaunderwatermultipleobjecttrackingwithmemoryaggregation AT guangboren umotmaunderwatermultipleobjecttrackingwithmemoryaggregation AT changliu umotmaunderwatermultipleobjecttrackingwithmemoryaggregation

UMOTMA: Underwater multiple object tracking with memory aggregation

Similar Items