LMOT: Efficient Light-Weight Detection and Tracking in Crowds

Multi-object tracking is a vital component in various robotics and computer vision applications. However, existing multi-object tracking techniques trade off computation runtime for tracking accuracy leading to challenges in deploying such pipelines in real-time applications. This paper introduces a...

Full description

Bibliographic Details
Main Authors:	Rana Mostafa, Hoda Baraka, Abdelmoniem Bayoumi
Format:	Article
Language:	English
Published:	IEEE 2022-01-01
Series:	IEEE Access
Subjects:	Multi-object tracking pedestrian tracking joint detection and tracking object detection deep learning
Online Access:	https://ieeexplore.ieee.org/document/9852199/

_version_	1828299641391677440
author	Rana Mostafa Hoda Baraka Abdelmoniem Bayoumi
author_facet	Rana Mostafa Hoda Baraka Abdelmoniem Bayoumi
author_sort	Rana Mostafa
collection	DOAJ
description	Multi-object tracking is a vital component in various robotics and computer vision applications. However, existing multi-object tracking techniques trade off computation runtime for tracking accuracy leading to challenges in deploying such pipelines in real-time applications. This paper introduces a novel real-time model, LMOT, i.e., Light-weight Multi-Object Tracker, that performs joint pedestrian detection and tracking. LMOT introduces a simplified DLA-34 encoder network to extract detection features for the current image that are computationally efficient. Furthermore, we generate efficient tracking features using a linear transformer for the prior image frame and its corresponding detection heatmap. After that, LMOT fuses both detection and tracking feature maps in a multi-layer scheme and performs a two-stage online data association relying on the Kalman filter to generate tracklets. We evaluated our model on the challenging real-world MOT16/17/20 datasets, showing LMOT significantly outperforms the state-of-the-art trackers concerning runtime while maintaining high robustness. LMOT is approximately ten times faster than state-of-the-art trackers while being only 3.8% behind in performance accuracy on average leading to a much computationally lighter model.
first_indexed	2024-04-13T12:57:38Z
format	Article
id	doaj.art-55277569243046348e990e4d951b6083
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-04-13T12:57:38Z
publishDate	2022-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-55277569243046348e990e4d951b60832022-12-22T02:46:00ZengIEEEIEEE Access2169-35362022-01-0110830858309510.1109/ACCESS.2022.31971579852199LMOT: Efficient Light-Weight Detection and Tracking in CrowdsRana Mostafa0https://orcid.org/0000-0001-9097-1449Hoda Baraka1Abdelmoniem Bayoumi2https://orcid.org/0000-0002-1334-8095Department of Computer Engineering, Faculty of Engineering, Cairo University, Giza, EgyptDepartment of Computer Engineering, Faculty of Engineering, Cairo University, Giza, EgyptDepartment of Computer Engineering, Faculty of Engineering, Cairo University, Giza, EgyptMulti-object tracking is a vital component in various robotics and computer vision applications. However, existing multi-object tracking techniques trade off computation runtime for tracking accuracy leading to challenges in deploying such pipelines in real-time applications. This paper introduces a novel real-time model, LMOT, i.e., Light-weight Multi-Object Tracker, that performs joint pedestrian detection and tracking. LMOT introduces a simplified DLA-34 encoder network to extract detection features for the current image that are computationally efficient. Furthermore, we generate efficient tracking features using a linear transformer for the prior image frame and its corresponding detection heatmap. After that, LMOT fuses both detection and tracking feature maps in a multi-layer scheme and performs a two-stage online data association relying on the Kalman filter to generate tracklets. We evaluated our model on the challenging real-world MOT16/17/20 datasets, showing LMOT significantly outperforms the state-of-the-art trackers concerning runtime while maintaining high robustness. LMOT is approximately ten times faster than state-of-the-art trackers while being only 3.8% behind in performance accuracy on average leading to a much computationally lighter model.https://ieeexplore.ieee.org/document/9852199/Multi-object trackingpedestrian trackingjoint detection and trackingobject detectiondeep learning
spellingShingle	Rana Mostafa Hoda Baraka Abdelmoniem Bayoumi LMOT: Efficient Light-Weight Detection and Tracking in Crowds IEEE Access Multi-object tracking pedestrian tracking joint detection and tracking object detection deep learning
title	LMOT: Efficient Light-Weight Detection and Tracking in Crowds
title_full	LMOT: Efficient Light-Weight Detection and Tracking in Crowds
title_fullStr	LMOT: Efficient Light-Weight Detection and Tracking in Crowds
title_full_unstemmed	LMOT: Efficient Light-Weight Detection and Tracking in Crowds
title_short	LMOT: Efficient Light-Weight Detection and Tracking in Crowds
title_sort	lmot efficient light weight detection and tracking in crowds
topic	Multi-object tracking pedestrian tracking joint detection and tracking object detection deep learning
url	https://ieeexplore.ieee.org/document/9852199/
work_keys_str_mv	AT ranamostafa lmotefficientlightweightdetectionandtrackingincrowds AT hodabaraka lmotefficientlightweightdetectionandtrackingincrowds AT abdelmoniembayoumi lmotefficientlightweightdetectionandtrackingincrowds

LMOT: Efficient Light-Weight Detection and Tracking in Crowds

Similar Items