Embedded Descriptor Generation in Faster R-CNN for Multi-Object Tracking

With the rapid growth of computer usage to extract the required knowledge from a huge amount of information, such as a video file, significant attention has been brought towards multi-object detection and tracking. Artificial Neural Networks (ANNs) have shown outstanding performance in multi-object...

Full description

Bibliographic Details
Main Authors: Younis Younis, Khalil Alsaif
Format: Article
Language:Arabic
Published: Mosul University 2021-12-01
Series:Al-Rafidain Journal of Computer Sciences and Mathematics
Subjects:
Online Access:https://csmj.mosuljournals.com/article_170013_6beb8ac2b3e9a8484bf055a6eaf992ee.pdf
_version_ 1818330341589385216
author Younis Younis
Khalil Alsaif
author_facet Younis Younis
Khalil Alsaif
author_sort Younis Younis
collection DOAJ
description With the rapid growth of computer usage to extract the required knowledge from a huge amount of information, such as a video file, significant attention has been brought towards multi-object detection and tracking. Artificial Neural Networks (ANNs) have shown outstanding performance in multi-object detection, especially the Faster R-CNN network. In this study, a new method is proposed for multi-object tracking based on descriptors generated by a neural network that is embedded in the Faster R-CNN. This embedding allows the proposed method to directly output a descriptor for each object detected by the Faster R-CNN, based on the features detected by the Faster R-CNN to detect the object. The use of these features allows the proposed method to output accurate values rapidly, as these features are already computed for the detection and have been able to provide outstanding performance in the detection stage. The descriptors that are collected from the proposed method are then clustered into a number of clusters equal to the number of objects detected in the first frame of the video. Then, for further frames, the number of clusters is increased until the distance between the centroid of the newly created cluster and the nearest centroid is less than the average distance among the centroids. Newly added clusters are considered for new objects, whereas older ones are kept in case the object reappears in the video. The proposed method is evaluated using the UA-DETRAC (University at Albany Detection and Tracking) dataset and has been able to achieve 64.8% MOTA and 83.6% MOTP, with a processing speed of 127.3 frames per second.
first_indexed 2024-12-13T13:02:25Z
format Article
id doaj.art-ecf2ac341ec2403c9f82e4d44f7b4a33
institution Directory Open Access Journal
issn 1815-4816
2311-7990
language Arabic
last_indexed 2024-12-13T13:02:25Z
publishDate 2021-12-01
publisher Mosul University
record_format Article
series Al-Rafidain Journal of Computer Sciences and Mathematics
spelling doaj.art-ecf2ac341ec2403c9f82e4d44f7b4a332022-12-21T23:44:57ZaraMosul UniversityAl-Rafidain Journal of Computer Sciences and Mathematics1815-48162311-79902021-12-011529110210.33899/csmj.2021.170013170013Embedded Descriptor Generation in Faster R-CNN for Multi-Object TrackingYounis Younis0Khalil Alsaif1Department of Computer Science College of Education for Pure Science, University of Mosul, Mosul, IraqCollege of Computer Sciences and Mathematics University of MosulWith the rapid growth of computer usage to extract the required knowledge from a huge amount of information, such as a video file, significant attention has been brought towards multi-object detection and tracking. Artificial Neural Networks (ANNs) have shown outstanding performance in multi-object detection, especially the Faster R-CNN network. In this study, a new method is proposed for multi-object tracking based on descriptors generated by a neural network that is embedded in the Faster R-CNN. This embedding allows the proposed method to directly output a descriptor for each object detected by the Faster R-CNN, based on the features detected by the Faster R-CNN to detect the object. The use of these features allows the proposed method to output accurate values rapidly, as these features are already computed for the detection and have been able to provide outstanding performance in the detection stage. The descriptors that are collected from the proposed method are then clustered into a number of clusters equal to the number of objects detected in the first frame of the video. Then, for further frames, the number of clusters is increased until the distance between the centroid of the newly created cluster and the nearest centroid is less than the average distance among the centroids. Newly added clusters are considered for new objects, whereas older ones are kept in case the object reappears in the video. The proposed method is evaluated using the UA-DETRAC (University at Albany Detection and Tracking) dataset and has been able to achieve 64.8% MOTA and 83.6% MOTP, with a processing speed of 127.3 frames per second.https://csmj.mosuljournals.com/article_170013_6beb8ac2b3e9a8484bf055a6eaf992ee.pdfconvolutional neural networksmulti-object detectionmulti-object tracking
spellingShingle Younis Younis
Khalil Alsaif
Embedded Descriptor Generation in Faster R-CNN for Multi-Object Tracking
Al-Rafidain Journal of Computer Sciences and Mathematics
convolutional neural networks
multi-object detection
multi-object tracking
title Embedded Descriptor Generation in Faster R-CNN for Multi-Object Tracking
title_full Embedded Descriptor Generation in Faster R-CNN for Multi-Object Tracking
title_fullStr Embedded Descriptor Generation in Faster R-CNN for Multi-Object Tracking
title_full_unstemmed Embedded Descriptor Generation in Faster R-CNN for Multi-Object Tracking
title_short Embedded Descriptor Generation in Faster R-CNN for Multi-Object Tracking
title_sort embedded descriptor generation in faster r cnn for multi object tracking
topic convolutional neural networks
multi-object detection
multi-object tracking
url https://csmj.mosuljournals.com/article_170013_6beb8ac2b3e9a8484bf055a6eaf992ee.pdf
work_keys_str_mv AT younisyounis embeddeddescriptorgenerationinfasterrcnnformultiobjecttracking
AT khalilalsaif embeddeddescriptorgenerationinfasterrcnnformultiobjecttracking