Visual recognition using artificial intelligence (person detection and tracking using artificial intelligence)

This report motive is to produce a robust real-time CCTV monitoring system aid to assist security guards, using only input video footage an object detection machine learning model and a tracking algorithm. The objective of this report is to investigate and determine which object tracking machin...

Full description

Bibliographic Details
Main Author: Shee Thoo, Sean Jun Hao
Other Authors: Yap Kim Hui
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/157722
_version_ 1826128106665017344
author Shee Thoo, Sean Jun Hao
author2 Yap Kim Hui
author_facet Yap Kim Hui
Shee Thoo, Sean Jun Hao
author_sort Shee Thoo, Sean Jun Hao
collection NTU
description This report motive is to produce a robust real-time CCTV monitoring system aid to assist security guards, using only input video footage an object detection machine learning model and a tracking algorithm. The objective of this report is to investigate and determine which object tracking machine learning model and tracking algorithm is best for the task. The report presents the precision, recall, and F1 score of different versions of the state-of-the art object detection model family, You Only Look Once (YOLO), and the precision, swapping of track identification rates, new track identification rates, and speed of the state of-the-art multiple object-tracking algorithm family, Simple Online and Realtime Tracking (SORT). Different object detection machine learning models were trained and validated on 60,796 images of people in different angles, background, field of view and different scenarios. Once the best object detector was determined it was combined with different tracking algorithms and benchmarked on a labelled testing dataset consisting of 697 images/frames. The results of the research shows, in terms of detection YOLOv5-XLarge had the best detection performance while YOLOv5-Nano had the fastest speed. However, for a real-time CCTV monitoring system a balance between the detection performance and speed is key hence YOLOv5-Small is the most suitable for the task. In terms of tracking, Deep SORT with osnet_x0_5 has the best average precision while SORT has the best speed. While SORT has the best speed, Deep SORT with osnet_x0_5 speed is comparable to SORT but with a much larger precision value and consistency of tracking. Hence Deep SORT with osnet_x0_5 would be chosen as the tracking algorithm. Hence to produce a CCTV monitoring system to aid security guards, object detection from YOLOv5-Small Machine Learning Model would be combined with tracking using Deep SORT with osnet_x0_5 tracking algorithm.
first_indexed 2024-10-01T07:19:30Z
format Final Year Project (FYP)
id ntu-10356/157722
institution Nanyang Technological University
language English
last_indexed 2024-10-01T07:19:30Z
publishDate 2022
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1577222023-07-07T19:07:00Z Visual recognition using artificial intelligence (person detection and tracking using artificial intelligence) Shee Thoo, Sean Jun Hao Yap Kim Hui School of Electrical and Electronic Engineering EKHYap@ntu.edu.sg Engineering::Electrical and electronic engineering This report motive is to produce a robust real-time CCTV monitoring system aid to assist security guards, using only input video footage an object detection machine learning model and a tracking algorithm. The objective of this report is to investigate and determine which object tracking machine learning model and tracking algorithm is best for the task. The report presents the precision, recall, and F1 score of different versions of the state-of-the art object detection model family, You Only Look Once (YOLO), and the precision, swapping of track identification rates, new track identification rates, and speed of the state of-the-art multiple object-tracking algorithm family, Simple Online and Realtime Tracking (SORT). Different object detection machine learning models were trained and validated on 60,796 images of people in different angles, background, field of view and different scenarios. Once the best object detector was determined it was combined with different tracking algorithms and benchmarked on a labelled testing dataset consisting of 697 images/frames. The results of the research shows, in terms of detection YOLOv5-XLarge had the best detection performance while YOLOv5-Nano had the fastest speed. However, for a real-time CCTV monitoring system a balance between the detection performance and speed is key hence YOLOv5-Small is the most suitable for the task. In terms of tracking, Deep SORT with osnet_x0_5 has the best average precision while SORT has the best speed. While SORT has the best speed, Deep SORT with osnet_x0_5 speed is comparable to SORT but with a much larger precision value and consistency of tracking. Hence Deep SORT with osnet_x0_5 would be chosen as the tracking algorithm. Hence to produce a CCTV monitoring system to aid security guards, object detection from YOLOv5-Small Machine Learning Model would be combined with tracking using Deep SORT with osnet_x0_5 tracking algorithm. Bachelor of Engineering (Electrical and Electronic Engineering) 2022-05-18T06:56:35Z 2022-05-18T06:56:35Z 2022 Final Year Project (FYP) Shee Thoo, S. J. H. (2022). Visual recognition using artificial intelligence (person detection and tracking using artificial intelligence). Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/157722 https://hdl.handle.net/10356/157722 en application/pdf Nanyang Technological University
spellingShingle Engineering::Electrical and electronic engineering
Shee Thoo, Sean Jun Hao
Visual recognition using artificial intelligence (person detection and tracking using artificial intelligence)
title Visual recognition using artificial intelligence (person detection and tracking using artificial intelligence)
title_full Visual recognition using artificial intelligence (person detection and tracking using artificial intelligence)
title_fullStr Visual recognition using artificial intelligence (person detection and tracking using artificial intelligence)
title_full_unstemmed Visual recognition using artificial intelligence (person detection and tracking using artificial intelligence)
title_short Visual recognition using artificial intelligence (person detection and tracking using artificial intelligence)
title_sort visual recognition using artificial intelligence person detection and tracking using artificial intelligence
topic Engineering::Electrical and electronic engineering
url https://hdl.handle.net/10356/157722
work_keys_str_mv AT sheethooseanjunhao visualrecognitionusingartificialintelligencepersondetectionandtrackingusingartificialintelligence