Detecting and tracking objects in RGB-D video

Detecting and tracking objects in videos and images is a rapidly growing field of research. Identifying, recognising, detecting and tracking objects such as humans, cars, obstacles etc. has many applications. There are a large number of methods to perform these tasks. They vary in performan...

Full description

Bibliographic Details
Main Author:	Chakrabarty Siddhanta
Other Authors:	Chan Kap Luk
Format:	Thesis
Language:	English
Published:	2015
Subjects:	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Online Access:	http://hdl.handle.net/10356/64815

_version_	1826121036552208384
author	Chakrabarty Siddhanta
author2	Chan Kap Luk
author_facet	Chan Kap Luk Chakrabarty Siddhanta
author_sort	Chakrabarty Siddhanta
collection	NTU
description	Detecting and tracking objects in videos and images is a rapidly growing field of research. Identifying, recognising, detecting and tracking objects such as humans, cars, obstacles etc. has many applications. There are a large number of methods to perform these tasks. They vary in performance, quality of results, type of results, types of raw data and so on. This project aims to detect and track objects, exclusively from depth video. Depth video is a video sequence captured by a Kinect camera with the pixel index values of each frame being the distance of the real point represented by that pixel from the camera. Detection is performed using a morphological segmentation technique called watershed transform. The detection parameters chosen are derived from the objects of interest in the test dataset. Two methods, edge-based detection and region-based detection, are used for pre-processing and the result with the largest detected area is selected. The two methods complement each other in many cases, making the use of both necessary. The objects of interest in the dataset are human beings. Thus various types of situations have been used to test the efficiency of the algorithm, such as crowded areas, noncrowded areas, single object, multiple objects, occluded objects and non-occluded objects. Challenges arise when multiple objects move across a scene. This problem has been addressed with a method for tracking multiple objects with, theoretically, no limit to the maximum number of objects. However, occlusion deteriorates algorithm performance. Test results are presented and compared to existing methods. While the accuracy and efficiency of the proposed system is moderately high, its implementation is simple, reducing processing time greatly.
first_indexed	2024-10-01T05:26:11Z
format	Thesis
id	ntu-10356/64815
institution	Nanyang Technological University
language	English
last_indexed	2024-10-01T05:26:11Z
publishDate	2015
record_format	dspace
spelling	ntu-10356/648152023-07-04T15:46:35Z Detecting and tracking objects in RGB-D video Chakrabarty Siddhanta Chan Kap Luk School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Detecting and tracking objects in videos and images is a rapidly growing field of research. Identifying, recognising, detecting and tracking objects such as humans, cars, obstacles etc. has many applications. There are a large number of methods to perform these tasks. They vary in performance, quality of results, type of results, types of raw data and so on. This project aims to detect and track objects, exclusively from depth video. Depth video is a video sequence captured by a Kinect camera with the pixel index values of each frame being the distance of the real point represented by that pixel from the camera. Detection is performed using a morphological segmentation technique called watershed transform. The detection parameters chosen are derived from the objects of interest in the test dataset. Two methods, edge-based detection and region-based detection, are used for pre-processing and the result with the largest detected area is selected. The two methods complement each other in many cases, making the use of both necessary. The objects of interest in the dataset are human beings. Thus various types of situations have been used to test the efficiency of the algorithm, such as crowded areas, noncrowded areas, single object, multiple objects, occluded objects and non-occluded objects. Challenges arise when multiple objects move across a scene. This problem has been addressed with a method for tracking multiple objects with, theoretically, no limit to the maximum number of objects. However, occlusion deteriorates algorithm performance. Test results are presented and compared to existing methods. While the accuracy and efficiency of the proposed system is moderately high, its implementation is simple, reducing processing time greatly. Master of Science (Signal Processing) 2015-06-04T07:13:43Z 2015-06-04T07:13:43Z 2014 2014 Thesis http://hdl.handle.net/10356/64815 en 60 p. application/pdf
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Chakrabarty Siddhanta Detecting and tracking objects in RGB-D video
title	Detecting and tracking objects in RGB-D video
title_full	Detecting and tracking objects in RGB-D video
title_fullStr	Detecting and tracking objects in RGB-D video
title_full_unstemmed	Detecting and tracking objects in RGB-D video
title_short	Detecting and tracking objects in RGB-D video
title_sort	detecting and tracking objects in rgb d video
topic	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
url	http://hdl.handle.net/10356/64815
work_keys_str_mv	AT chakrabartysiddhanta detectingandtrackingobjectsinrgbdvideo

Detecting and tracking objects in RGB-D video

Similar Items