Orientation- and Scale-Invariant Multi-Vehicle Detection and Tracking from Unmanned Aerial Videos

Along with the advancement of light-weight sensing and processing technologies, unmanned aerial vehicles (UAVs) have recently become popular platforms for intelligent traffic monitoring and control. UAV-mounted cameras can capture traffic-flow videos from various perspectives providing a comprehensi...

Full description

Bibliographic Details
Main Authors: Jie Wang, Sandra Simeonova, Mozhdeh Shahbazi
Format: Article
Language:English
Published: MDPI AG 2019-09-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/11/18/2155
_version_ 1818959986685902848
author Jie Wang
Sandra Simeonova
Mozhdeh Shahbazi
author_facet Jie Wang
Sandra Simeonova
Mozhdeh Shahbazi
author_sort Jie Wang
collection DOAJ
description Along with the advancement of light-weight sensing and processing technologies, unmanned aerial vehicles (UAVs) have recently become popular platforms for intelligent traffic monitoring and control. UAV-mounted cameras can capture traffic-flow videos from various perspectives providing a comprehensive insight into road conditions. To analyze the traffic flow from remotely captured videos, a reliable and accurate vehicle detection-and-tracking approach is required. In this paper, we propose a deep-learning framework for vehicle detection and tracking from UAV videos for monitoring traffic flow in complex road structures. This approach is designed to be invariant to significant orientation and scale variations in the videos. The detection procedure is performed by fine-tuning a state-of-the-art object detector, You Only Look Once (YOLOv3), using several custom-labeled traffic datasets. Vehicle tracking is conducted following a tracking-by-detection paradigm, where deep appearance features are used for vehicle re-identification, and Kalman filtering is used for motion estimation. The proposed methodology is tested on a variety of real videos collected by UAVs under various conditions, e.g., in late afternoons with long vehicle shadows, in dawn with vehicles lights being on, over roundabouts and interchange roads where vehicle directions change considerably, and from various viewpoints where vehicles’ appearance undergo substantial perspective distortions. The proposed tracking-by-detection approach performs efficiently at 11 frames per second on color videos of 2720p resolution. Experiments demonstrated that high detection accuracy could be achieved with an average F1-score of 92.1%. Besides, the tracking technique performs accurately, with an average multiple-object tracking accuracy (MOTA) of 81.3%. The proposed approach also addressed the shortcomings of the state-of-the-art in multi-object tracking regarding frequent identity switching, resulting in a total of only one identity switch over every 305 tracked vehicles.
first_indexed 2024-12-20T11:50:21Z
format Article
id doaj.art-c882567b72e4471c8813b274406d788d
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-12-20T11:50:21Z
publishDate 2019-09-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-c882567b72e4471c8813b274406d788d2022-12-21T19:41:48ZengMDPI AGRemote Sensing2072-42922019-09-011118215510.3390/rs11182155rs11182155Orientation- and Scale-Invariant Multi-Vehicle Detection and Tracking from Unmanned Aerial VideosJie Wang0Sandra Simeonova1Mozhdeh Shahbazi2Department of Geomatics Engineering, University of Calgary, 2500 University Drive NW, Calgary, AB T2N 1N4, CanadaDepartment of Geomatics Engineering, University of Calgary, 2500 University Drive NW, Calgary, AB T2N 1N4, CanadaDepartment of Geomatics Engineering, University of Calgary, 2500 University Drive NW, Calgary, AB T2N 1N4, CanadaAlong with the advancement of light-weight sensing and processing technologies, unmanned aerial vehicles (UAVs) have recently become popular platforms for intelligent traffic monitoring and control. UAV-mounted cameras can capture traffic-flow videos from various perspectives providing a comprehensive insight into road conditions. To analyze the traffic flow from remotely captured videos, a reliable and accurate vehicle detection-and-tracking approach is required. In this paper, we propose a deep-learning framework for vehicle detection and tracking from UAV videos for monitoring traffic flow in complex road structures. This approach is designed to be invariant to significant orientation and scale variations in the videos. The detection procedure is performed by fine-tuning a state-of-the-art object detector, You Only Look Once (YOLOv3), using several custom-labeled traffic datasets. Vehicle tracking is conducted following a tracking-by-detection paradigm, where deep appearance features are used for vehicle re-identification, and Kalman filtering is used for motion estimation. The proposed methodology is tested on a variety of real videos collected by UAVs under various conditions, e.g., in late afternoons with long vehicle shadows, in dawn with vehicles lights being on, over roundabouts and interchange roads where vehicle directions change considerably, and from various viewpoints where vehicles’ appearance undergo substantial perspective distortions. The proposed tracking-by-detection approach performs efficiently at 11 frames per second on color videos of 2720p resolution. Experiments demonstrated that high detection accuracy could be achieved with an average F1-score of 92.1%. Besides, the tracking technique performs accurately, with an average multiple-object tracking accuracy (MOTA) of 81.3%. The proposed approach also addressed the shortcomings of the state-of-the-art in multi-object tracking regarding frequent identity switching, resulting in a total of only one identity switch over every 305 tracked vehicles.https://www.mdpi.com/2072-4292/11/18/2155traffic monitoringvehicle detectionmulti-vehicle trackingvehicle re-identificationunmanned aerial vehiclesdeep convolutional neural network.
spellingShingle Jie Wang
Sandra Simeonova
Mozhdeh Shahbazi
Orientation- and Scale-Invariant Multi-Vehicle Detection and Tracking from Unmanned Aerial Videos
Remote Sensing
traffic monitoring
vehicle detection
multi-vehicle tracking
vehicle re-identification
unmanned aerial vehicles
deep convolutional neural network.
title Orientation- and Scale-Invariant Multi-Vehicle Detection and Tracking from Unmanned Aerial Videos
title_full Orientation- and Scale-Invariant Multi-Vehicle Detection and Tracking from Unmanned Aerial Videos
title_fullStr Orientation- and Scale-Invariant Multi-Vehicle Detection and Tracking from Unmanned Aerial Videos
title_full_unstemmed Orientation- and Scale-Invariant Multi-Vehicle Detection and Tracking from Unmanned Aerial Videos
title_short Orientation- and Scale-Invariant Multi-Vehicle Detection and Tracking from Unmanned Aerial Videos
title_sort orientation and scale invariant multi vehicle detection and tracking from unmanned aerial videos
topic traffic monitoring
vehicle detection
multi-vehicle tracking
vehicle re-identification
unmanned aerial vehicles
deep convolutional neural network.
url https://www.mdpi.com/2072-4292/11/18/2155
work_keys_str_mv AT jiewang orientationandscaleinvariantmultivehicledetectionandtrackingfromunmannedaerialvideos
AT sandrasimeonova orientationandscaleinvariantmultivehicledetectionandtrackingfromunmannedaerialvideos
AT mozhdehshahbazi orientationandscaleinvariantmultivehicledetectionandtrackingfromunmannedaerialvideos