Siamese Transformer Network for Real-Time Aerial Object Tracking

Recently, deep learning (DL) based trackers have attracted tremendous interest for their high performance. Despite the remarkable success, most trackers utilizing deep convolution features commonly neglect tracking speed, which is crucial for aerial tracking on mobile devices. In this paper, we prop...

Full description

Bibliographic Details
Main Authors: Haijun Wang, Shengyan Zhang
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9908547/
_version_ 1817986242338357248
author Haijun Wang
Shengyan Zhang
author_facet Haijun Wang
Shengyan Zhang
author_sort Haijun Wang
collection DOAJ
description Recently, deep learning (DL) based trackers have attracted tremendous interest for their high performance. Despite the remarkable success, most trackers utilizing deep convolution features commonly neglect tracking speed, which is crucial for aerial tracking on mobile devices. In this paper, we propose an efficient and effective transformer based aerial tracker in the framework of Siamese, which inherits the merits from both transformer and Siamese architectures. Specifically, the outputs from multiple convolution layers are fed into transformer to construct robust features of template patch and search patch, respectively. Consequently, the interdependencies between low-level information and semantic information are interactively fused to improve the ability of encoding target appearance. Finally, traditional depth-wise cross correlation is introduced to generate a similarity map for object location and bounding box regression. Extensive experimental results on three popular benchmarks (DTB70, UAV123@10fps, and UAV20L) have demonstrated that our proposed tracker outperforms other 12 state-of-the-art trackers and achieves a real-time tracking speed of 71.3 frames per second (FPS) on GPU, which can be applied in mobile platform.
first_indexed 2024-04-14T00:07:51Z
format Article
id doaj.art-862fe681a48d4e018513c31d7ce49dab
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-14T00:07:51Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-862fe681a48d4e018513c31d7ce49dab2022-12-22T02:23:28ZengIEEEIEEE Access2169-35362022-01-011010520110521310.1109/ACCESS.2022.32115169908547Siamese Transformer Network for Real-Time Aerial Object TrackingHaijun Wang0https://orcid.org/0000-0003-2481-9662Shengyan Zhang1Aviation Information Technology Research and Development, Binzhou University, Binzhou, ChinaAviation Information Technology Research and Development, Binzhou University, Binzhou, ChinaRecently, deep learning (DL) based trackers have attracted tremendous interest for their high performance. Despite the remarkable success, most trackers utilizing deep convolution features commonly neglect tracking speed, which is crucial for aerial tracking on mobile devices. In this paper, we propose an efficient and effective transformer based aerial tracker in the framework of Siamese, which inherits the merits from both transformer and Siamese architectures. Specifically, the outputs from multiple convolution layers are fed into transformer to construct robust features of template patch and search patch, respectively. Consequently, the interdependencies between low-level information and semantic information are interactively fused to improve the ability of encoding target appearance. Finally, traditional depth-wise cross correlation is introduced to generate a similarity map for object location and bounding box regression. Extensive experimental results on three popular benchmarks (DTB70, UAV123@10fps, and UAV20L) have demonstrated that our proposed tracker outperforms other 12 state-of-the-art trackers and achieves a real-time tracking speed of 71.3 frames per second (FPS) on GPU, which can be applied in mobile platform.https://ieeexplore.ieee.org/document/9908547/Aerial object trackingtransformerself-attentionSiamese networkdeep neural network
spellingShingle Haijun Wang
Shengyan Zhang
Siamese Transformer Network for Real-Time Aerial Object Tracking
IEEE Access
Aerial object tracking
transformer
self-attention
Siamese network
deep neural network
title Siamese Transformer Network for Real-Time Aerial Object Tracking
title_full Siamese Transformer Network for Real-Time Aerial Object Tracking
title_fullStr Siamese Transformer Network for Real-Time Aerial Object Tracking
title_full_unstemmed Siamese Transformer Network for Real-Time Aerial Object Tracking
title_short Siamese Transformer Network for Real-Time Aerial Object Tracking
title_sort siamese transformer network for real time aerial object tracking
topic Aerial object tracking
transformer
self-attention
Siamese network
deep neural network
url https://ieeexplore.ieee.org/document/9908547/
work_keys_str_mv AT haijunwang siamesetransformernetworkforrealtimeaerialobjecttracking
AT shengyanzhang siamesetransformernetworkforrealtimeaerialobjecttracking