Boosting Multi-Vehicle Tracking with a Joint Object Detection and Viewpoint Estimation Sensor

In this work, we address the problem of multi-vehicle detection and tracking for traffic monitoring applications. We preset a novel intelligent visual sensor for tracking-by-detection with simultaneous pose estimation. Essentially, we adapt an Extended Kalman Filter (EKF) to work not only with the d...

Full description

Bibliographic Details
Main Authors: Roberto J. López-Sastre, Carlos Herranz-Perdiguero, Ricardo Guerrero-Gómez-Olmedo, Daniel Oñoro-Rubio, Saturnino Maldonado-Bascón
Format: Article
Language:English
Published: MDPI AG 2019-09-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/19/19/4062
_version_ 1817992813891026944
author Roberto J. López-Sastre
Carlos Herranz-Perdiguero
Ricardo Guerrero-Gómez-Olmedo
Daniel Oñoro-Rubio
Saturnino Maldonado-Bascón
author_facet Roberto J. López-Sastre
Carlos Herranz-Perdiguero
Ricardo Guerrero-Gómez-Olmedo
Daniel Oñoro-Rubio
Saturnino Maldonado-Bascón
author_sort Roberto J. López-Sastre
collection DOAJ
description In this work, we address the problem of multi-vehicle detection and tracking for traffic monitoring applications. We preset a novel intelligent visual sensor for tracking-by-detection with simultaneous pose estimation. Essentially, we adapt an Extended Kalman Filter (EKF) to work not only with the detections of the vehicles but also with their estimated coarse viewpoints, directly obtained with the vision sensor. We show that enhancing the tracking with observations of the vehicle pose, results in a better estimation of the vehicles trajectories. For the simultaneous object detection and viewpoint estimation task, we present and evaluate two independent solutions. One is based on a fast GPU implementation of a Histogram of Oriented Gradients (HOG) detector with Support Vector Machines (SVMs). For the second, we adequately modify and train the Faster R-CNN deep learning model, in order to recover from it not only the object localization but also an estimation of its pose. Finally, we publicly release a challenging dataset, the GRAM Road Traffic Monitoring (GRAM-RTM), which has been especially designed for evaluating multi-vehicle tracking approaches within the context of traffic monitoring applications. It comprises more than 700 unique vehicles annotated across more than 40.300 frames of three videos. We expect the GRAM-RTM becomes a benchmark in vehicle detection and tracking, providing the computer vision and intelligent transportation systems communities with a standard set of images, annotations and evaluation procedures for multi-vehicle tracking. We present a thorough experimental evaluation of our approaches with the GRAM-RTM, which will be useful for establishing further comparisons. The results obtained confirm that the simultaneous integration of vehicle localizations and pose estimations as observations in an EKF, improves the tracking results.
first_indexed 2024-04-14T01:31:18Z
format Article
id doaj.art-af266947dd06450e9e18a83cc6f0a544
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-04-14T01:31:18Z
publishDate 2019-09-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-af266947dd06450e9e18a83cc6f0a5442022-12-22T02:20:12ZengMDPI AGSensors1424-82202019-09-011919406210.3390/s19194062s19194062Boosting Multi-Vehicle Tracking with a Joint Object Detection and Viewpoint Estimation SensorRoberto J. López-Sastre0Carlos Herranz-Perdiguero1Ricardo Guerrero-Gómez-Olmedo2Daniel Oñoro-Rubio3Saturnino Maldonado-Bascón4GRAM, Department of Signal Theory and Communications, University of Alcalá, 28805 Alcalá de Henares, SpainGRAM, Department of Signal Theory and Communications, University of Alcalá, 28805 Alcalá de Henares, SpainBBVA Next Technologies, 28050 Madrid, SpainNEC Labs Europe, Kurfürsten-Anlage 36, 69115 Heidelberg, GermanyGRAM, Department of Signal Theory and Communications, University of Alcalá, 28805 Alcalá de Henares, SpainIn this work, we address the problem of multi-vehicle detection and tracking for traffic monitoring applications. We preset a novel intelligent visual sensor for tracking-by-detection with simultaneous pose estimation. Essentially, we adapt an Extended Kalman Filter (EKF) to work not only with the detections of the vehicles but also with their estimated coarse viewpoints, directly obtained with the vision sensor. We show that enhancing the tracking with observations of the vehicle pose, results in a better estimation of the vehicles trajectories. For the simultaneous object detection and viewpoint estimation task, we present and evaluate two independent solutions. One is based on a fast GPU implementation of a Histogram of Oriented Gradients (HOG) detector with Support Vector Machines (SVMs). For the second, we adequately modify and train the Faster R-CNN deep learning model, in order to recover from it not only the object localization but also an estimation of its pose. Finally, we publicly release a challenging dataset, the GRAM Road Traffic Monitoring (GRAM-RTM), which has been especially designed for evaluating multi-vehicle tracking approaches within the context of traffic monitoring applications. It comprises more than 700 unique vehicles annotated across more than 40.300 frames of three videos. We expect the GRAM-RTM becomes a benchmark in vehicle detection and tracking, providing the computer vision and intelligent transportation systems communities with a standard set of images, annotations and evaluation procedures for multi-vehicle tracking. We present a thorough experimental evaluation of our approaches with the GRAM-RTM, which will be useful for establishing further comparisons. The results obtained confirm that the simultaneous integration of vehicle localizations and pose estimations as observations in an EKF, improves the tracking results.https://www.mdpi.com/1424-8220/19/19/4062traffic monitoring sensorvehicle trackingvehicle detectiontracking by detectionviewpoint estimationsmart city
spellingShingle Roberto J. López-Sastre
Carlos Herranz-Perdiguero
Ricardo Guerrero-Gómez-Olmedo
Daniel Oñoro-Rubio
Saturnino Maldonado-Bascón
Boosting Multi-Vehicle Tracking with a Joint Object Detection and Viewpoint Estimation Sensor
Sensors
traffic monitoring sensor
vehicle tracking
vehicle detection
tracking by detection
viewpoint estimation
smart city
title Boosting Multi-Vehicle Tracking with a Joint Object Detection and Viewpoint Estimation Sensor
title_full Boosting Multi-Vehicle Tracking with a Joint Object Detection and Viewpoint Estimation Sensor
title_fullStr Boosting Multi-Vehicle Tracking with a Joint Object Detection and Viewpoint Estimation Sensor
title_full_unstemmed Boosting Multi-Vehicle Tracking with a Joint Object Detection and Viewpoint Estimation Sensor
title_short Boosting Multi-Vehicle Tracking with a Joint Object Detection and Viewpoint Estimation Sensor
title_sort boosting multi vehicle tracking with a joint object detection and viewpoint estimation sensor
topic traffic monitoring sensor
vehicle tracking
vehicle detection
tracking by detection
viewpoint estimation
smart city
url https://www.mdpi.com/1424-8220/19/19/4062
work_keys_str_mv AT robertojlopezsastre boostingmultivehicletrackingwithajointobjectdetectionandviewpointestimationsensor
AT carlosherranzperdiguero boostingmultivehicletrackingwithajointobjectdetectionandviewpointestimationsensor
AT ricardoguerrerogomezolmedo boostingmultivehicletrackingwithajointobjectdetectionandviewpointestimationsensor
AT danielonororubio boostingmultivehicletrackingwithajointobjectdetectionandviewpointestimationsensor
AT saturninomaldonadobascon boostingmultivehicletrackingwithajointobjectdetectionandviewpointestimationsensor