Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object
Object tracking has been a difficult problem in the field of vision in recent years. The core task is to continuously locate an object in video sequences and mark its location with bounding boxes. Most of the existing tracking methods use the idea of object detection, and separate the video sequence...
Main Author: | |
---|---|
Format: | Article |
Language: | zho |
Published: |
Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
2021-06-01
|
Series: | Jisuanji kexue yu tansuo |
Subjects: | |
Online Access: | http://fcst.ceaj.org/CN/abstract/abstract2719.shtml |
_version_ | 1818432872845934592 |
---|---|
author | GUO Mingzhe, CAI Zixin, WANG Xinyue, JING Liping, YU Jian |
author_facet | GUO Mingzhe, CAI Zixin, WANG Xinyue, JING Liping, YU Jian |
author_sort | GUO Mingzhe, CAI Zixin, WANG Xinyue, JING Liping, YU Jian |
collection | DOAJ |
description | Object tracking has been a difficult problem in the field of vision in recent years. The core task is to continuously locate an object in video sequences and mark its location with bounding boxes. Most of the existing tracking methods use the idea of object detection, and separate the video sequence by frame to detect the target separately. Although this strategy makes full use of the current frame information, it ignores the spatio-temporal correlation information among frames. However, the spatio-temporal correlation information is the key of adapting to the change of the target??s appearance and fully representing the target. To solve this problem, this paper proposes a spatio-temporal siamese network (STSiam) based on spatio-temporal correlation. STSiam uses the spatio-temporal correlation information for target locating and real-time tracking in two stages: object localization and object repre-sentation. In the stage of object localization, STSiam adaptively captures the features of the target and its surroun-ding area, and updates the target matching template to ensure that it is not affected by appearance changes. In the stage of object representation, STSiam pays attention to the spatial correlation information between corresponding regions in different frames. By using the object localization, STSiam locates the target area and learns the target bounding box correction parameters to ensure that the bounding box fits the target as closely as possible. The model's network architecture is based on offline training, and it is no need to update model parameters during online tracking to ensure its real-time tracking speed. Extensive experiments on visual tracking benchmarks including OTB2015, VOT2016, VOT2018 and LaSOT demonstrate that STSiam achieves state-of-the-art performance in terms of accu-racy, robustness and speed compared with existing methods. |
first_indexed | 2024-12-14T16:12:06Z |
format | Article |
id | doaj.art-ad1583e908d04c599c074e0de9fbf3c0 |
institution | Directory Open Access Journal |
issn | 1673-9418 |
language | zho |
last_indexed | 2024-12-14T16:12:06Z |
publishDate | 2021-06-01 |
publisher | Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press |
record_format | Article |
series | Jisuanji kexue yu tansuo |
spelling | doaj.art-ad1583e908d04c599c074e0de9fbf3c02022-12-21T22:54:59ZzhoJournal of Computer Engineering and Applications Beijing Co., Ltd., Science PressJisuanji kexue yu tansuo1673-94182021-06-011561049106110.3778/j.issn.1673-9418.2007002Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking ObjectGUO Mingzhe, CAI Zixin, WANG Xinyue, JING Liping, YU Jian0Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing 100044, ChinaObject tracking has been a difficult problem in the field of vision in recent years. The core task is to continuously locate an object in video sequences and mark its location with bounding boxes. Most of the existing tracking methods use the idea of object detection, and separate the video sequence by frame to detect the target separately. Although this strategy makes full use of the current frame information, it ignores the spatio-temporal correlation information among frames. However, the spatio-temporal correlation information is the key of adapting to the change of the target??s appearance and fully representing the target. To solve this problem, this paper proposes a spatio-temporal siamese network (STSiam) based on spatio-temporal correlation. STSiam uses the spatio-temporal correlation information for target locating and real-time tracking in two stages: object localization and object repre-sentation. In the stage of object localization, STSiam adaptively captures the features of the target and its surroun-ding area, and updates the target matching template to ensure that it is not affected by appearance changes. In the stage of object representation, STSiam pays attention to the spatial correlation information between corresponding regions in different frames. By using the object localization, STSiam locates the target area and learns the target bounding box correction parameters to ensure that the bounding box fits the target as closely as possible. The model's network architecture is based on offline training, and it is no need to update model parameters during online tracking to ensure its real-time tracking speed. Extensive experiments on visual tracking benchmarks including OTB2015, VOT2016, VOT2018 and LaSOT demonstrate that STSiam achieves state-of-the-art performance in terms of accu-racy, robustness and speed compared with existing methods.http://fcst.ceaj.org/CN/abstract/abstract2719.shtmlspatio-temporal correlationfeature; trackingobject localizationobject representation |
spellingShingle | GUO Mingzhe, CAI Zixin, WANG Xinyue, JING Liping, YU Jian Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object Jisuanji kexue yu tansuo spatio-temporal correlation feature; tracking object localization object representation |
title | Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object |
title_full | Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object |
title_fullStr | Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object |
title_full_unstemmed | Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object |
title_short | Spatio-Temporal Correlation Based Adaptive Feature Learning of Tracking Object |
title_sort | spatio temporal correlation based adaptive feature learning of tracking object |
topic | spatio-temporal correlation feature; tracking object localization object representation |
url | http://fcst.ceaj.org/CN/abstract/abstract2719.shtml |
work_keys_str_mv | AT guomingzhecaizixinwangxinyuejinglipingyujian spatiotemporalcorrelationbasedadaptivefeaturelearningoftrackingobject |