Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object

Object tracking has been a difficult problem in the field of vision in recent years. The core task is to continuously locate an object in video sequences and mark its location with bounding boxes. Most of the existing tracking methods use the idea of object detection, and separate the video sequence...

Full description

Bibliographic Details
Main Author: GUO Mingzhe, CAI Zixin, WANG Xinyue, JING Liping, YU Jian
Format: Article
Language:zho
Published: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press 2021-06-01
Series:Jisuanji kexue yu tansuo
Subjects:
Online Access:http://fcst.ceaj.org/CN/abstract/abstract2719.shtml
_version_ 1818432872845934592
author GUO Mingzhe, CAI Zixin, WANG Xinyue, JING Liping, YU Jian
author_facet GUO Mingzhe, CAI Zixin, WANG Xinyue, JING Liping, YU Jian
author_sort GUO Mingzhe, CAI Zixin, WANG Xinyue, JING Liping, YU Jian
collection DOAJ
description Object tracking has been a difficult problem in the field of vision in recent years. The core task is to continuously locate an object in video sequences and mark its location with bounding boxes. Most of the existing tracking methods use the idea of object detection, and separate the video sequence by frame to detect the target separately. Although this strategy makes full use of the current frame information, it ignores the spatio-temporal correlation information among frames. However, the spatio-temporal correlation information is the key of adapting to the change of the target??s appearance and fully representing the target. To solve this problem, this paper proposes a spatio-temporal siamese network (STSiam) based on spatio-temporal correlation. STSiam uses the spatio-temporal correlation information for target locating and real-time tracking in two stages: object localization and object repre-sentation. In the stage of object localization, STSiam adaptively captures the features of the target and its surroun-ding area, and updates the target matching template to ensure that it is not affected by appearance changes. In the stage of object representation, STSiam pays attention to the spatial correlation information between corresponding regions in different frames. By using the object localization, STSiam locates the target area and learns the target bounding box correction parameters to ensure that the bounding box fits the target as closely as possible. The model's network architecture is based on offline training, and it is no need to update model parameters during online tracking to ensure its real-time tracking speed. Extensive experiments on visual tracking benchmarks including OTB2015, VOT2016, VOT2018 and LaSOT demonstrate that STSiam achieves state-of-the-art performance in terms of accu-racy, robustness and speed compared with existing methods.
first_indexed 2024-12-14T16:12:06Z
format Article
id doaj.art-ad1583e908d04c599c074e0de9fbf3c0
institution Directory Open Access Journal
issn 1673-9418
language zho
last_indexed 2024-12-14T16:12:06Z
publishDate 2021-06-01
publisher Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
record_format Article
series Jisuanji kexue yu tansuo
spelling doaj.art-ad1583e908d04c599c074e0de9fbf3c02022-12-21T22:54:59ZzhoJournal of Computer Engineering and Applications Beijing Co., Ltd., Science PressJisuanji kexue yu tansuo1673-94182021-06-011561049106110.3778/j.issn.1673-9418.2007002Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking ObjectGUO Mingzhe, CAI Zixin, WANG Xinyue, JING Liping, YU Jian0Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing 100044, ChinaObject tracking has been a difficult problem in the field of vision in recent years. The core task is to continuously locate an object in video sequences and mark its location with bounding boxes. Most of the existing tracking methods use the idea of object detection, and separate the video sequence by frame to detect the target separately. Although this strategy makes full use of the current frame information, it ignores the spatio-temporal correlation information among frames. However, the spatio-temporal correlation information is the key of adapting to the change of the target??s appearance and fully representing the target. To solve this problem, this paper proposes a spatio-temporal siamese network (STSiam) based on spatio-temporal correlation. STSiam uses the spatio-temporal correlation information for target locating and real-time tracking in two stages: object localization and object repre-sentation. In the stage of object localization, STSiam adaptively captures the features of the target and its surroun-ding area, and updates the target matching template to ensure that it is not affected by appearance changes. In the stage of object representation, STSiam pays attention to the spatial correlation information between corresponding regions in different frames. By using the object localization, STSiam locates the target area and learns the target bounding box correction parameters to ensure that the bounding box fits the target as closely as possible. The model's network architecture is based on offline training, and it is no need to update model parameters during online tracking to ensure its real-time tracking speed. Extensive experiments on visual tracking benchmarks including OTB2015, VOT2016, VOT2018 and LaSOT demonstrate that STSiam achieves state-of-the-art performance in terms of accu-racy, robustness and speed compared with existing methods.http://fcst.ceaj.org/CN/abstract/abstract2719.shtmlspatio-temporal correlationfeature; trackingobject localizationobject representation
spellingShingle GUO Mingzhe, CAI Zixin, WANG Xinyue, JING Liping, YU Jian
Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object
Jisuanji kexue yu tansuo
spatio-temporal correlation
feature; tracking
object localization
object representation
title Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object
title_full Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object
title_fullStr Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object
title_full_unstemmed Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object
title_short Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object
title_sort spatio temporal correlation based adaptive feature learning of tracking object
topic spatio-temporal correlation
feature; tracking
object localization
object representation
url http://fcst.ceaj.org/CN/abstract/abstract2719.shtml
work_keys_str_mv AT guomingzhecaizixinwangxinyuejinglipingyujian spatiotemporalcorrelationbasedadaptivefeaturelearningoftrackingobject