Deep learning based people detection using 3D point cloud

With the advancement of computational devices and 3D sensor technology, it has become increasingly viable to develop a highly accurate detection system within the constraints of a mobile service robot. As such robots need to navigate in unfamiliar environments with less than optimal conditions, the...

Full description

Bibliographic Details
Main Author:	Tan, Kye Min
Other Authors:	Teoh Eam Khwang
Format:	Final Year Project (FYP)
Language:	English
Published:	2018
Subjects:	DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Online Access:	http://hdl.handle.net/10356/74956

_version_	1811678881884667904
author	Tan, Kye Min
author2	Teoh Eam Khwang
author_facet	Teoh Eam Khwang Tan, Kye Min
author_sort	Tan, Kye Min
collection	NTU
description	With the advancement of computational devices and 3D sensor technology, it has become increasingly viable to develop a highly accurate detection system within the constraints of a mobile service robot. As such robots need to navigate in unfamiliar environments with less than optimal conditions, the algorithms responsible for detection, tracking and guidance must be robust. Deep learning is a recent field of artificial intelligence which potentially provides such features. By harnessing large amounts of computational power and datasets, deep learning systems can achieve significantly better performance in computer vision tasks such as classification and detection compared to previous methods. The usage of 3D point cloud data allows spatial information to be obtained while overcoming adverse conditions such as poor illumination and complex texture information. This project combines the advantages of deep learning methods and 3D point cloud data to perform people detection, which is a task required of mobile service robots. Depth images from the Microsoft Kinect sensors are converted into 3D point cloud form before being used to train an advanced network known as DenseNet for the task of detecting the presence of people. DenseNet was chosen due to its very deep architecture which allows high performance while its dense connections mitigate the risk of the model overfitting on the limited data available. By training DenseNet on the Darknet framework, it is qualitatively shown that DenseNet can perform better than networks like You Only Look Once (YOLO) on new data while being sufficiently fast to process images in real time.
first_indexed	2024-10-01T03:00:19Z
format	Final Year Project (FYP)
id	ntu-10356/74956
institution	Nanyang Technological University
language	English
last_indexed	2024-10-01T03:00:19Z
publishDate	2018
record_format	dspace
spelling	ntu-10356/749562023-07-07T17:19:26Z Deep learning based people detection using 3D point cloud Tan, Kye Min Teoh Eam Khwang School of Electrical and Electronic Engineering A*STAR Institute for Infocomm Research DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision With the advancement of computational devices and 3D sensor technology, it has become increasingly viable to develop a highly accurate detection system within the constraints of a mobile service robot. As such robots need to navigate in unfamiliar environments with less than optimal conditions, the algorithms responsible for detection, tracking and guidance must be robust. Deep learning is a recent field of artificial intelligence which potentially provides such features. By harnessing large amounts of computational power and datasets, deep learning systems can achieve significantly better performance in computer vision tasks such as classification and detection compared to previous methods. The usage of 3D point cloud data allows spatial information to be obtained while overcoming adverse conditions such as poor illumination and complex texture information. This project combines the advantages of deep learning methods and 3D point cloud data to perform people detection, which is a task required of mobile service robots. Depth images from the Microsoft Kinect sensors are converted into 3D point cloud form before being used to train an advanced network known as DenseNet for the task of detecting the presence of people. DenseNet was chosen due to its very deep architecture which allows high performance while its dense connections mitigate the risk of the model overfitting on the limited data available. By training DenseNet on the Darknet framework, it is qualitatively shown that DenseNet can perform better than networks like You Only Look Once (YOLO) on new data while being sufficiently fast to process images in real time. Bachelor of Engineering 2018-05-25T05:02:26Z 2018-05-25T05:02:26Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74956 en Nanyang Technological University 96 p. application/pdf
spellingShingle	DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Tan, Kye Min Deep learning based people detection using 3D point cloud
title	Deep learning based people detection using 3D point cloud
title_full	Deep learning based people detection using 3D point cloud
title_fullStr	Deep learning based people detection using 3D point cloud
title_full_unstemmed	Deep learning based people detection using 3D point cloud
title_short	Deep learning based people detection using 3D point cloud
title_sort	deep learning based people detection using 3d point cloud
topic	DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
url	http://hdl.handle.net/10356/74956
work_keys_str_mv	AT tankyemin deeplearningbasedpeopledetectionusing3dpointcloud

Deep learning based people detection using 3D point cloud

Similar Items