Detecting and tracking objects in RGB-D video

Detecting and tracking objects in videos and images is a rapidly growing field of research. Identifying, recognising, detecting and tracking objects such as humans, cars, obstacles etc. has many applications. There are a large number of methods to perform these tasks. They vary in performan...

Full description

Bibliographic Details
Main Author: Chakrabarty Siddhanta
Other Authors: Chan Kap Luk
Format: Thesis
Language:English
Published: 2015
Subjects:
Online Access:http://hdl.handle.net/10356/64815
_version_ 1811688059281866752
author Chakrabarty Siddhanta
author2 Chan Kap Luk
author_facet Chan Kap Luk
Chakrabarty Siddhanta
author_sort Chakrabarty Siddhanta
collection NTU
description Detecting and tracking objects in videos and images is a rapidly growing field of research. Identifying, recognising, detecting and tracking objects such as humans, cars, obstacles etc. has many applications. There are a large number of methods to perform these tasks. They vary in performance, quality of results, type of results, types of raw data and so on. This project aims to detect and track objects, exclusively from depth video. Depth video is a video sequence captured by a Kinect camera with the pixel index values of each frame being the distance of the real point represented by that pixel from the camera. Detection is performed using a morphological segmentation technique called watershed transform. The detection parameters chosen are derived from the objects of interest in the test dataset. Two methods, edge-based detection and region-based detection, are used for pre-processing and the result with the largest detected area is selected. The two methods complement each other in many cases, making the use of both necessary. The objects of interest in the dataset are human beings. Thus various types of situations have been used to test the efficiency of the algorithm, such as crowded areas, noncrowded areas, single object, multiple objects, occluded objects and non-occluded objects. Challenges arise when multiple objects move across a scene. This problem has been addressed with a method for tracking multiple objects with, theoretically, no limit to the maximum number of objects. However, occlusion deteriorates algorithm performance. Test results are presented and compared to existing methods. While the accuracy and efficiency of the proposed system is moderately high, its implementation is simple, reducing processing time greatly.
first_indexed 2024-10-01T05:26:11Z
format Thesis
id ntu-10356/64815
institution Nanyang Technological University
language English
last_indexed 2024-10-01T05:26:11Z
publishDate 2015
record_format dspace
spelling ntu-10356/648152023-07-04T15:46:35Z Detecting and tracking objects in RGB-D video Chakrabarty Siddhanta Chan Kap Luk School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Detecting and tracking objects in videos and images is a rapidly growing field of research. Identifying, recognising, detecting and tracking objects such as humans, cars, obstacles etc. has many applications. There are a large number of methods to perform these tasks. They vary in performance, quality of results, type of results, types of raw data and so on. This project aims to detect and track objects, exclusively from depth video. Depth video is a video sequence captured by a Kinect camera with the pixel index values of each frame being the distance of the real point represented by that pixel from the camera. Detection is performed using a morphological segmentation technique called watershed transform. The detection parameters chosen are derived from the objects of interest in the test dataset. Two methods, edge-based detection and region-based detection, are used for pre-processing and the result with the largest detected area is selected. The two methods complement each other in many cases, making the use of both necessary. The objects of interest in the dataset are human beings. Thus various types of situations have been used to test the efficiency of the algorithm, such as crowded areas, noncrowded areas, single object, multiple objects, occluded objects and non-occluded objects. Challenges arise when multiple objects move across a scene. This problem has been addressed with a method for tracking multiple objects with, theoretically, no limit to the maximum number of objects. However, occlusion deteriorates algorithm performance. Test results are presented and compared to existing methods. While the accuracy and efficiency of the proposed system is moderately high, its implementation is simple, reducing processing time greatly. Master of Science (Signal Processing) 2015-06-04T07:13:43Z 2015-06-04T07:13:43Z 2014 2014 Thesis http://hdl.handle.net/10356/64815 en 60 p. application/pdf
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Chakrabarty Siddhanta
Detecting and tracking objects in RGB-D video
title Detecting and tracking objects in RGB-D video
title_full Detecting and tracking objects in RGB-D video
title_fullStr Detecting and tracking objects in RGB-D video
title_full_unstemmed Detecting and tracking objects in RGB-D video
title_short Detecting and tracking objects in RGB-D video
title_sort detecting and tracking objects in rgb d video
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
url http://hdl.handle.net/10356/64815
work_keys_str_mv AT chakrabartysiddhanta detectingandtrackingobjectsinrgbdvideo