High-resolution satellite video single object tracking based on thicksiam framework

High-resolution satellite videos realize the short-dated gaze observation of the designated area on the ground, and its emergence has improved the temporal resolution of remote sensing data to the second level. Single object tracking (SOT) task in satellite video has attracted considerable attention...

Full description

Bibliographic Details
Main Authors: Xiaodong Zhang, Kun Zhu, Guanzhou Chen, Puyun Liao, Xiaoliang Tan, Tong Wang, Xianwei Li
Format: Article
Language:English
Published: Taylor & Francis Group 2023-12-01
Series:GIScience & Remote Sensing
Subjects:
Online Access:http://dx.doi.org/10.1080/15481603.2022.2163063
_version_ 1797679036864921600
author Xiaodong Zhang
Kun Zhu
Guanzhou Chen
Puyun Liao
Xiaoliang Tan
Tong Wang
Xianwei Li
author_facet Xiaodong Zhang
Kun Zhu
Guanzhou Chen
Puyun Liao
Xiaoliang Tan
Tong Wang
Xianwei Li
author_sort Xiaodong Zhang
collection DOAJ
description High-resolution satellite videos realize the short-dated gaze observation of the designated area on the ground, and its emergence has improved the temporal resolution of remote sensing data to the second level. Single object tracking (SOT) task in satellite video has attracted considerable attention. However, it faces challenges such as complex background, poor object feature representation, and lack of publicly available datasets. To cope with these challenges, a ThickSiam framework consisting of a Thickened Residual Block Siamese Network (TRBS-Net) for extracting robust semantic features to obtain the initial tracking results and a Remoulded Kalman Filter (RKF) module for simultaneously correcting the trajectory and size of the targets is designed in this work. The results of TRBS-Net and RKF modules are combined by an N-frame-convergence mechanism to achieve accurate tracking results. Ablation experiments are implemented on our annotated dataset to evaluate the performance of the proposed ThickSiam framework and other 19 state-of-the-art trackers. The comparison results show that our ThickSiam tracker obtains a precision value of 0.991 and a success value of 0.755 while running at 56.849 FPS implemented on one NVIDIA GTX1070Ti GPU.
first_indexed 2024-03-11T23:08:35Z
format Article
id doaj.art-9b3efe3b09ff469bbc8a3f5f05864abf
institution Directory Open Access Journal
issn 1548-1603
1943-7226
language English
last_indexed 2024-03-11T23:08:35Z
publishDate 2023-12-01
publisher Taylor & Francis Group
record_format Article
series GIScience & Remote Sensing
spelling doaj.art-9b3efe3b09ff469bbc8a3f5f05864abf2023-09-21T12:43:09ZengTaylor & Francis GroupGIScience & Remote Sensing1548-16031943-72262023-12-0160110.1080/15481603.2022.21630632163063High-resolution satellite video single object tracking based on thicksiam frameworkXiaodong Zhang0Kun Zhu1Guanzhou Chen2Puyun Liao3Xiaoliang Tan4Tong Wang5Xianwei Li6Wuhan UniversityWuhan UniversityWuhan UniversityWuhan UniversityWuhan UniversityWuhan UniversityWuhan UniversityHigh-resolution satellite videos realize the short-dated gaze observation of the designated area on the ground, and its emergence has improved the temporal resolution of remote sensing data to the second level. Single object tracking (SOT) task in satellite video has attracted considerable attention. However, it faces challenges such as complex background, poor object feature representation, and lack of publicly available datasets. To cope with these challenges, a ThickSiam framework consisting of a Thickened Residual Block Siamese Network (TRBS-Net) for extracting robust semantic features to obtain the initial tracking results and a Remoulded Kalman Filter (RKF) module for simultaneously correcting the trajectory and size of the targets is designed in this work. The results of TRBS-Net and RKF modules are combined by an N-frame-convergence mechanism to achieve accurate tracking results. Ablation experiments are implemented on our annotated dataset to evaluate the performance of the proposed ThickSiam framework and other 19 state-of-the-art trackers. The comparison results show that our ThickSiam tracker obtains a precision value of 0.991 and a success value of 0.755 while running at 56.849 FPS implemented on one NVIDIA GTX1070Ti GPU.http://dx.doi.org/10.1080/15481603.2022.2163063high-resolution satellite videossingle object trackingsiamese networkkalman filter
spellingShingle Xiaodong Zhang
Kun Zhu
Guanzhou Chen
Puyun Liao
Xiaoliang Tan
Tong Wang
Xianwei Li
High-resolution satellite video single object tracking based on thicksiam framework
GIScience & Remote Sensing
high-resolution satellite videos
single object tracking
siamese network
kalman filter
title High-resolution satellite video single object tracking based on thicksiam framework
title_full High-resolution satellite video single object tracking based on thicksiam framework
title_fullStr High-resolution satellite video single object tracking based on thicksiam framework
title_full_unstemmed High-resolution satellite video single object tracking based on thicksiam framework
title_short High-resolution satellite video single object tracking based on thicksiam framework
title_sort high resolution satellite video single object tracking based on thicksiam framework
topic high-resolution satellite videos
single object tracking
siamese network
kalman filter
url http://dx.doi.org/10.1080/15481603.2022.2163063
work_keys_str_mv AT xiaodongzhang highresolutionsatellitevideosingleobjecttrackingbasedonthicksiamframework
AT kunzhu highresolutionsatellitevideosingleobjecttrackingbasedonthicksiamframework
AT guanzhouchen highresolutionsatellitevideosingleobjecttrackingbasedonthicksiamframework
AT puyunliao highresolutionsatellitevideosingleobjecttrackingbasedonthicksiamframework
AT xiaoliangtan highresolutionsatellitevideosingleobjecttrackingbasedonthicksiamframework
AT tongwang highresolutionsatellitevideosingleobjecttrackingbasedonthicksiamframework
AT xianweili highresolutionsatellitevideosingleobjecttrackingbasedonthicksiamframework