PAE: Portable Appearance Extension for Multiple Object Detection and Tracking in Traffic Scenes

Multi-object tracking (MOT) is an important field in computer vision that provides a critical understanding of video analysis in various applications, such as vehicle tracking in intelligent transportation systems (ITS). Several deep learning-based approaches have been introduced to basic motion and...

Full description

Bibliographic Details
Main Authors: Ibrahim Soliman Mohamed, Lim Kim Chuan
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9737499/
_version_ 1811271673401311232
author Ibrahim Soliman Mohamed
Lim Kim Chuan
author_facet Ibrahim Soliman Mohamed
Lim Kim Chuan
author_sort Ibrahim Soliman Mohamed
collection DOAJ
description Multi-object tracking (MOT) is an important field in computer vision that provides a critical understanding of video analysis in various applications, such as vehicle tracking in intelligent transportation systems (ITS). Several deep learning-based approaches have been introduced to basic motion and IoU trackers by extracting appearance features to assist in challenging situations such as lossy detection and occlusion. This study proposes a portable appearance extension (PAE) for single-stage object detection to jointly detect and extract appearance embeddings using a shared model. Furthermore, a novel training framework with a single image and without re-identification annotations is presented using an augmentation module, saving a tremendous amount of human labeling effort and increasing the real-world application adoption rate. Using UA-DETRAC dataset, RetinaNet-PAE and SSD-PAE achieve comparable results with current state-of-the-art models, where RetinaNet-PAE prioritizes detection and tracking performance with a 58.0% HOTA score and 4 FPS. In contrast, SSD-PAE prioritizes latency performance with a 47.3% HOTA score and 40 FPS.
first_indexed 2024-04-12T22:24:08Z
format Article
id doaj.art-83c3d913975f462a8d46be461b6182ab
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-12T22:24:08Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-83c3d913975f462a8d46be461b6182ab2022-12-22T03:14:13ZengIEEEIEEE Access2169-35362022-01-0110372573726810.1109/ACCESS.2022.31604249737499PAE: Portable Appearance Extension for Multiple Object Detection and Tracking in Traffic ScenesIbrahim Soliman Mohamed0https://orcid.org/0000-0003-4924-8851Lim Kim Chuan1https://orcid.org/0000-0002-0374-1993Machine Learning and Signal Processing, Centre for Telecommunication Research & Innovation, Faculty of Electronics and Computer Engineering, Universiti Teknikal Malaysia Melaka, Durian Tunggal, MalaysiaMachine Learning and Signal Processing, Centre for Telecommunication Research & Innovation, Faculty of Electronics and Computer Engineering, Universiti Teknikal Malaysia Melaka, Durian Tunggal, MalaysiaMulti-object tracking (MOT) is an important field in computer vision that provides a critical understanding of video analysis in various applications, such as vehicle tracking in intelligent transportation systems (ITS). Several deep learning-based approaches have been introduced to basic motion and IoU trackers by extracting appearance features to assist in challenging situations such as lossy detection and occlusion. This study proposes a portable appearance extension (PAE) for single-stage object detection to jointly detect and extract appearance embeddings using a shared model. Furthermore, a novel training framework with a single image and without re-identification annotations is presented using an augmentation module, saving a tremendous amount of human labeling effort and increasing the real-world application adoption rate. Using UA-DETRAC dataset, RetinaNet-PAE and SSD-PAE achieve comparable results with current state-of-the-art models, where RetinaNet-PAE prioritizes detection and tracking performance with a 58.0% HOTA score and 4 FPS. In contrast, SSD-PAE prioritizes latency performance with a 47.3% HOTA score and 40 FPS.https://ieeexplore.ieee.org/document/9737499/Computer visionobject detection and trackingappearance embeddingvehicle tracking
spellingShingle Ibrahim Soliman Mohamed
Lim Kim Chuan
PAE: Portable Appearance Extension for Multiple Object Detection and Tracking in Traffic Scenes
IEEE Access
Computer vision
object detection and tracking
appearance embedding
vehicle tracking
title PAE: Portable Appearance Extension for Multiple Object Detection and Tracking in Traffic Scenes
title_full PAE: Portable Appearance Extension for Multiple Object Detection and Tracking in Traffic Scenes
title_fullStr PAE: Portable Appearance Extension for Multiple Object Detection and Tracking in Traffic Scenes
title_full_unstemmed PAE: Portable Appearance Extension for Multiple Object Detection and Tracking in Traffic Scenes
title_short PAE: Portable Appearance Extension for Multiple Object Detection and Tracking in Traffic Scenes
title_sort pae portable appearance extension for multiple object detection and tracking in traffic scenes
topic Computer vision
object detection and tracking
appearance embedding
vehicle tracking
url https://ieeexplore.ieee.org/document/9737499/
work_keys_str_mv AT ibrahimsolimanmohamed paeportableappearanceextensionformultipleobjectdetectionandtrackingintrafficscenes
AT limkimchuan paeportableappearanceextensionformultipleobjectdetectionandtrackingintrafficscenes