Deep learning based people detection using 3D point cloud

With the advancement of computational devices and 3D sensor technology, it has become increasingly viable to develop a highly accurate detection system within the constraints of a mobile service robot. As such robots need to navigate in unfamiliar environments with less than optimal conditions, the...

Full description

Bibliographic Details
Main Author: Tan, Kye Min
Other Authors: Teoh Eam Khwang
Format: Final Year Project (FYP)
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/74956
_version_ 1811678881884667904
author Tan, Kye Min
author2 Teoh Eam Khwang
author_facet Teoh Eam Khwang
Tan, Kye Min
author_sort Tan, Kye Min
collection NTU
description With the advancement of computational devices and 3D sensor technology, it has become increasingly viable to develop a highly accurate detection system within the constraints of a mobile service robot. As such robots need to navigate in unfamiliar environments with less than optimal conditions, the algorithms responsible for detection, tracking and guidance must be robust. Deep learning is a recent field of artificial intelligence which potentially provides such features. By harnessing large amounts of computational power and datasets, deep learning systems can achieve significantly better performance in computer vision tasks such as classification and detection compared to previous methods. The usage of 3D point cloud data allows spatial information to be obtained while overcoming adverse conditions such as poor illumination and complex texture information. This project combines the advantages of deep learning methods and 3D point cloud data to perform people detection, which is a task required of mobile service robots. Depth images from the Microsoft Kinect sensors are converted into 3D point cloud form before being used to train an advanced network known as DenseNet for the task of detecting the presence of people. DenseNet was chosen due to its very deep architecture which allows high performance while its dense connections mitigate the risk of the model overfitting on the limited data available. By training DenseNet on the Darknet framework, it is qualitatively shown that DenseNet can perform better than networks like You Only Look Once (YOLO) on new data while being sufficiently fast to process images in real time.
first_indexed 2024-10-01T03:00:19Z
format Final Year Project (FYP)
id ntu-10356/74956
institution Nanyang Technological University
language English
last_indexed 2024-10-01T03:00:19Z
publishDate 2018
record_format dspace
spelling ntu-10356/749562023-07-07T17:19:26Z Deep learning based people detection using 3D point cloud Tan, Kye Min Teoh Eam Khwang School of Electrical and Electronic Engineering A*STAR Institute for Infocomm Research DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision With the advancement of computational devices and 3D sensor technology, it has become increasingly viable to develop a highly accurate detection system within the constraints of a mobile service robot. As such robots need to navigate in unfamiliar environments with less than optimal conditions, the algorithms responsible for detection, tracking and guidance must be robust. Deep learning is a recent field of artificial intelligence which potentially provides such features. By harnessing large amounts of computational power and datasets, deep learning systems can achieve significantly better performance in computer vision tasks such as classification and detection compared to previous methods. The usage of 3D point cloud data allows spatial information to be obtained while overcoming adverse conditions such as poor illumination and complex texture information. This project combines the advantages of deep learning methods and 3D point cloud data to perform people detection, which is a task required of mobile service robots. Depth images from the Microsoft Kinect sensors are converted into 3D point cloud form before being used to train an advanced network known as DenseNet for the task of detecting the presence of people. DenseNet was chosen due to its very deep architecture which allows high performance while its dense connections mitigate the risk of the model overfitting on the limited data available. By training DenseNet on the Darknet framework, it is qualitatively shown that DenseNet can perform better than networks like You Only Look Once (YOLO) on new data while being sufficiently fast to process images in real time. Bachelor of Engineering 2018-05-25T05:02:26Z 2018-05-25T05:02:26Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74956 en Nanyang Technological University 96 p. application/pdf
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Tan, Kye Min
Deep learning based people detection using 3D point cloud
title Deep learning based people detection using 3D point cloud
title_full Deep learning based people detection using 3D point cloud
title_fullStr Deep learning based people detection using 3D point cloud
title_full_unstemmed Deep learning based people detection using 3D point cloud
title_short Deep learning based people detection using 3D point cloud
title_sort deep learning based people detection using 3d point cloud
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
url http://hdl.handle.net/10356/74956
work_keys_str_mv AT tankyemin deeplearningbasedpeopledetectionusing3dpointcloud