Hybrid SLAM and object recognition on an embedded platform

Simultaneous Localization and Mapping (SLAM) is a key localisation technique for systems such as autonomous agents. Visual SLAM is a subset of SLAM which analyzes visual information captured by cameras using visual SLAM algorithms, enabling the estimation of the camera's position and orie...

Full description

Bibliographic Details
Main Author:	Low, Timothy Zhi Hao
Other Authors:	Lam Siew Kei
Format:	Final Year Project (FYP)
Language:	English
Published:	Nanyang Technological University 2023
Subjects:	Engineering::Computer science and engineering
Online Access:	https://hdl.handle.net/10356/171979

_version_	1824454551971823616
author	Low, Timothy Zhi Hao
author2	Lam Siew Kei
author_facet	Lam Siew Kei Low, Timothy Zhi Hao
author_sort	Low, Timothy Zhi Hao
collection	NTU
description	Simultaneous Localization and Mapping (SLAM) is a key localisation technique for systems such as autonomous agents. Visual SLAM is a subset of SLAM which analyzes visual information captured by cameras using visual SLAM algorithms, enabling the estimation of the camera's position and orientation while simultaneously constructing a map of the environment. Conventional visual SLAM mainly uses sparse point clouds which fails to capture a comprehensive representation of the environment. Thus, the maps created lacks semantic information of the environment which causes limitations in scene understanding. Autonomous agents that utilize visual SLAM are also commonly implemented on embedded systems. As such, the objective of this project is to implement a hybrid SLAM system on an embedded platform which incorporates conventional SLAM algorithms with semantic segmentation to construct semantic dense maps. The hybrid SLAM system is implemented and tested on the NVIDIA Jetson Xavier NX embedded system together with the ZED2 Stereo camera for input. ORB SLAM3, a well established visual SLAM algorithm was chosen for estimating the camera poses and orientation. The SCALE semantic segmentation is used to perform semantic segmentation on the input provided by the ZED 2 camera. Kimera semantics is then used for processing the camera poses, semantic and depth images to generate a semantic dense map.
first_indexed	2025-02-19T03:24:07Z
format	Final Year Project (FYP)
id	ntu-10356/171979
institution	Nanyang Technological University
language	English
last_indexed	2025-02-19T03:24:07Z
publishDate	2023
publisher	Nanyang Technological University
record_format	dspace
spelling	ntu-10356/1719792023-11-24T15:38:04Z Hybrid SLAM and object recognition on an embedded platform Low, Timothy Zhi Hao Lam Siew Kei School of Computer Science and Engineering ASSKLam@ntu.edu.sg Engineering::Computer science and engineering Simultaneous Localization and Mapping (SLAM) is a key localisation technique for systems such as autonomous agents. Visual SLAM is a subset of SLAM which analyzes visual information captured by cameras using visual SLAM algorithms, enabling the estimation of the camera's position and orientation while simultaneously constructing a map of the environment. Conventional visual SLAM mainly uses sparse point clouds which fails to capture a comprehensive representation of the environment. Thus, the maps created lacks semantic information of the environment which causes limitations in scene understanding. Autonomous agents that utilize visual SLAM are also commonly implemented on embedded systems. As such, the objective of this project is to implement a hybrid SLAM system on an embedded platform which incorporates conventional SLAM algorithms with semantic segmentation to construct semantic dense maps. The hybrid SLAM system is implemented and tested on the NVIDIA Jetson Xavier NX embedded system together with the ZED2 Stereo camera for input. ORB SLAM3, a well established visual SLAM algorithm was chosen for estimating the camera poses and orientation. The SCALE semantic segmentation is used to perform semantic segmentation on the input provided by the ZED 2 camera. Kimera semantics is then used for processing the camera poses, semantic and depth images to generate a semantic dense map. Bachelor of Engineering (Computer Science) 2023-11-20T02:54:14Z 2023-11-20T02:54:14Z 2023 Final Year Project (FYP) Low, T. Z. H. (2023). Hybrid SLAM and object recognition on an embedded platform. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/171979 https://hdl.handle.net/10356/171979 en SCSE22-0772 application/pdf Nanyang Technological University
spellingShingle	Engineering::Computer science and engineering Low, Timothy Zhi Hao Hybrid SLAM and object recognition on an embedded platform
title	Hybrid SLAM and object recognition on an embedded platform
title_full	Hybrid SLAM and object recognition on an embedded platform
title_fullStr	Hybrid SLAM and object recognition on an embedded platform
title_full_unstemmed	Hybrid SLAM and object recognition on an embedded platform
title_short	Hybrid SLAM and object recognition on an embedded platform
title_sort	hybrid slam and object recognition on an embedded platform
topic	Engineering::Computer science and engineering
url	https://hdl.handle.net/10356/171979
work_keys_str_mv	AT lowtimothyzhihao hybridslamandobjectrecognitiononanembeddedplatform

Hybrid SLAM and object recognition on an embedded platform

Similar Items