Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification
Abstract Aircraft tracking in aerial videos is a challenge due to the small size and appearance affinity with other targets. Recently, the tracking‐by‐detection trackers, such as simple online and realtime tracking (SORT), attracted the attention of many researchers due to the great evolution in dee...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2023-02-01
|
Series: | IET Image Processing |
Online Access: | https://doi.org/10.1049/ipr2.12665 |
_version_ | 1828012437918449664 |
---|---|
author | M. Ahmed Ali Maher X. Bai |
author_facet | M. Ahmed Ali Maher X. Bai |
author_sort | M. Ahmed |
collection | DOAJ |
description | Abstract Aircraft tracking in aerial videos is a challenge due to the small size and appearance affinity with other targets. Recently, the tracking‐by‐detection trackers, such as simple online and realtime tracking (SORT), attracted the attention of many researchers due to the great evolution in deep learning‐based object detectors. SORT depends on motion cues in solving the data association problem by matching high‐score detection bounding boxes to tracklets. However, motion cues are insufficient to track occluded or similar aircraft efficiently. In addition, most occluded aircrafts have a low‐confidence percentage. The main objective of this paper is to propose an F‐SORT tracker, an enhanced SORT tracker with the following improvements. Firstly, fused RetinaNet for small aerial targets is used as a detection framework to boost the detectability of SORT. Then, aircraft appearance features are extracted from the shallowest prediction layer and utilized as additional cues to improve the data association algorithm. Finally, the extracted features distinguish aircraft online from backgrounds via a K‐nearest neighbour classifier. Experiments on an airport dataset show that F‐SORT can significantly improve aircraft tracking. F‐SORT outperforms other state‐of‐the‐art trackers, achieving 72.75%, 59.63%, and 82.89% on MOTA, HOTA, and IDF1. In addition, F‐SORT classifying algorithm boosts other trackers when applied. |
first_indexed | 2024-04-10T09:32:33Z |
format | Article |
id | doaj.art-f0a4eb947e294b3481453666be154bf5 |
institution | Directory Open Access Journal |
issn | 1751-9659 1751-9667 |
language | English |
last_indexed | 2024-04-10T09:32:33Z |
publishDate | 2023-02-01 |
publisher | Wiley |
record_format | Article |
series | IET Image Processing |
spelling | doaj.art-f0a4eb947e294b3481453666be154bf52023-02-19T04:18:32ZengWileyIET Image Processing1751-96591751-96672023-02-0117368770810.1049/ipr2.12665Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classificationM. Ahmed0Ali Maher1X. Bai2Image Processing Center Beihang University Beijing ChinaMilitary Technical College Cairo EgyptImage Processing Center Beihang University Beijing ChinaAbstract Aircraft tracking in aerial videos is a challenge due to the small size and appearance affinity with other targets. Recently, the tracking‐by‐detection trackers, such as simple online and realtime tracking (SORT), attracted the attention of many researchers due to the great evolution in deep learning‐based object detectors. SORT depends on motion cues in solving the data association problem by matching high‐score detection bounding boxes to tracklets. However, motion cues are insufficient to track occluded or similar aircraft efficiently. In addition, most occluded aircrafts have a low‐confidence percentage. The main objective of this paper is to propose an F‐SORT tracker, an enhanced SORT tracker with the following improvements. Firstly, fused RetinaNet for small aerial targets is used as a detection framework to boost the detectability of SORT. Then, aircraft appearance features are extracted from the shallowest prediction layer and utilized as additional cues to improve the data association algorithm. Finally, the extracted features distinguish aircraft online from backgrounds via a K‐nearest neighbour classifier. Experiments on an airport dataset show that F‐SORT can significantly improve aircraft tracking. F‐SORT outperforms other state‐of‐the‐art trackers, achieving 72.75%, 59.63%, and 82.89% on MOTA, HOTA, and IDF1. In addition, F‐SORT classifying algorithm boosts other trackers when applied.https://doi.org/10.1049/ipr2.12665 |
spellingShingle | M. Ahmed Ali Maher X. Bai Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification IET Image Processing |
title | Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification |
title_full | Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification |
title_fullStr | Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification |
title_full_unstemmed | Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification |
title_short | Aircraft tracking in aerial videos based on fused RetinaNet and low‐score detection classification |
title_sort | aircraft tracking in aerial videos based on fused retinanet and low score detection classification |
url | https://doi.org/10.1049/ipr2.12665 |
work_keys_str_mv | AT mahmed aircrafttrackinginaerialvideosbasedonfusedretinanetandlowscoredetectionclassification AT alimaher aircrafttrackinginaerialvideosbasedonfusedretinanetandlowscoredetectionclassification AT xbai aircrafttrackinginaerialvideosbasedonfusedretinanetandlowscoredetectionclassification |