Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification

Abstract Aircraft tracking in aerial videos is a challenge due to the small size and appearance affinity with other targets. Recently, the tracking‐by‐detection trackers, such as simple online and realtime tracking (SORT), attracted the attention of many researchers due to the great evolution in dee...

Full description

Bibliographic Details
Main Authors: M. Ahmed, Ali Maher, X. Bai
Format: Article
Language:English
Published: Wiley 2023-02-01
Series:IET Image Processing
Online Access:https://doi.org/10.1049/ipr2.12665
_version_ 1828012437918449664
author M. Ahmed
Ali Maher
X. Bai
author_facet M. Ahmed
Ali Maher
X. Bai
author_sort M. Ahmed
collection DOAJ
description Abstract Aircraft tracking in aerial videos is a challenge due to the small size and appearance affinity with other targets. Recently, the tracking‐by‐detection trackers, such as simple online and realtime tracking (SORT), attracted the attention of many researchers due to the great evolution in deep learning‐based object detectors. SORT depends on motion cues in solving the data association problem by matching high‐score detection bounding boxes to tracklets. However, motion cues are insufficient to track occluded or similar aircraft efficiently. In addition, most occluded aircrafts have a low‐confidence percentage. The main objective of this paper is to propose an F‐SORT tracker, an enhanced SORT tracker with the following improvements. Firstly, fused RetinaNet for small aerial targets is used as a detection framework to boost the detectability of SORT. Then, aircraft appearance features are extracted from the shallowest prediction layer and utilized as additional cues to improve the data association algorithm. Finally, the extracted features distinguish aircraft online from backgrounds via a K‐nearest neighbour classifier. Experiments on an airport dataset show that F‐SORT can significantly improve aircraft tracking. F‐SORT outperforms other state‐of‐the‐art trackers, achieving 72.75%, 59.63%, and 82.89% on MOTA, HOTA, and IDF1. In addition, F‐SORT classifying algorithm boosts other trackers when applied.
first_indexed 2024-04-10T09:32:33Z
format Article
id doaj.art-f0a4eb947e294b3481453666be154bf5
institution Directory Open Access Journal
issn 1751-9659
1751-9667
language English
last_indexed 2024-04-10T09:32:33Z
publishDate 2023-02-01
publisher Wiley
record_format Article
series IET Image Processing
spelling doaj.art-f0a4eb947e294b3481453666be154bf52023-02-19T04:18:32ZengWileyIET Image Processing1751-96591751-96672023-02-0117368770810.1049/ipr2.12665Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classificationM. Ahmed0Ali Maher1X. Bai2Image Processing Center Beihang University Beijing ChinaMilitary Technical College Cairo EgyptImage Processing Center Beihang University Beijing ChinaAbstract Aircraft tracking in aerial videos is a challenge due to the small size and appearance affinity with other targets. Recently, the tracking‐by‐detection trackers, such as simple online and realtime tracking (SORT), attracted the attention of many researchers due to the great evolution in deep learning‐based object detectors. SORT depends on motion cues in solving the data association problem by matching high‐score detection bounding boxes to tracklets. However, motion cues are insufficient to track occluded or similar aircraft efficiently. In addition, most occluded aircrafts have a low‐confidence percentage. The main objective of this paper is to propose an F‐SORT tracker, an enhanced SORT tracker with the following improvements. Firstly, fused RetinaNet for small aerial targets is used as a detection framework to boost the detectability of SORT. Then, aircraft appearance features are extracted from the shallowest prediction layer and utilized as additional cues to improve the data association algorithm. Finally, the extracted features distinguish aircraft online from backgrounds via a K‐nearest neighbour classifier. Experiments on an airport dataset show that F‐SORT can significantly improve aircraft tracking. F‐SORT outperforms other state‐of‐the‐art trackers, achieving 72.75%, 59.63%, and 82.89% on MOTA, HOTA, and IDF1. In addition, F‐SORT classifying algorithm boosts other trackers when applied.https://doi.org/10.1049/ipr2.12665
spellingShingle M. Ahmed
Ali Maher
X. Bai
Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification
IET Image Processing
title Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification
title_full Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification
title_fullStr Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification
title_full_unstemmed Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification
title_short Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification
title_sort aircraft tracking in aerial videos based on fused retinanet and low score detection classification
url https://doi.org/10.1049/ipr2.12665
work_keys_str_mv AT mahmed aircrafttrackinginaerialvideosbasedonfusedretinanetandlowscoredetectionclassification
AT alimaher aircrafttrackinginaerialvideosbasedonfusedretinanetandlowscoredetectionclassification
AT xbai aircrafttrackinginaerialvideosbasedonfusedretinanetandlowscoredetectionclassification