Improved Visual SLAM Using Semantic Segmentation and Layout Estimation

The technological advances in computational systems have enabled very complex computer vision and machine learning approaches to perform efficiently and accurately. These new approaches can be considered a new set of tools to reshape the visual SLAM solutions. We present an investigation of the late...

Full description

Bibliographic Details
Main Authors: Ahmed Mahmoud, Mohamed Atia
Format: Article
Language:English
Published: MDPI AG 2022-09-01
Series:Robotics
Subjects:
Online Access:https://www.mdpi.com/2218-6581/11/5/91
_version_ 1797470045448699904
author Ahmed Mahmoud
Mohamed Atia
author_facet Ahmed Mahmoud
Mohamed Atia
author_sort Ahmed Mahmoud
collection DOAJ
description The technological advances in computational systems have enabled very complex computer vision and machine learning approaches to perform efficiently and accurately. These new approaches can be considered a new set of tools to reshape the visual SLAM solutions. We present an investigation of the latest neuroscientific research that explains how the human brain can accurately navigate and map unknown environments. The accuracy suggests that human navigation is not affected by traditional visual odometry drifts resulting from tracking visual features. It utilises the geometrical structures of the surrounding objects within the navigated space. The identified objects and space geometrical shapes anchor the estimated space representation and mitigate the overall drift. Inspired by the human brain’s navigation techniques, this paper presents our efforts to incorporate two machine learning techniques into a VSLAM solution: semantic segmentation and layout estimation to imitate human abilities to map new environments. The proposed system benefits from the geometrical relations between the corner points of the cuboid environments to improve the accuracy of trajectory estimation. Moreover, the implemented SLAM solution semantically groups the map points and then tracks each group independently to limit the system drift. The implemented solution yielded higher trajectory accuracy and immunity to large pure rotations.
first_indexed 2024-03-09T19:32:13Z
format Article
id doaj.art-ec32250656f846bab09ef8ca9977b6a8
institution Directory Open Access Journal
issn 2218-6581
language English
last_indexed 2024-03-09T19:32:13Z
publishDate 2022-09-01
publisher MDPI AG
record_format Article
series Robotics
spelling doaj.art-ec32250656f846bab09ef8ca9977b6a82023-11-24T02:23:26ZengMDPI AGRobotics2218-65812022-09-011159110.3390/robotics11050091Improved Visual SLAM Using Semantic Segmentation and Layout EstimationAhmed Mahmoud0Mohamed Atia1Department of Systems and Computer Engineering, Carleton University, Ottawa, ON K1S 5B6, CanadaDepartment of Systems and Computer Engineering, Carleton University, Ottawa, ON K1S 5B6, CanadaThe technological advances in computational systems have enabled very complex computer vision and machine learning approaches to perform efficiently and accurately. These new approaches can be considered a new set of tools to reshape the visual SLAM solutions. We present an investigation of the latest neuroscientific research that explains how the human brain can accurately navigate and map unknown environments. The accuracy suggests that human navigation is not affected by traditional visual odometry drifts resulting from tracking visual features. It utilises the geometrical structures of the surrounding objects within the navigated space. The identified objects and space geometrical shapes anchor the estimated space representation and mitigate the overall drift. Inspired by the human brain’s navigation techniques, this paper presents our efforts to incorporate two machine learning techniques into a VSLAM solution: semantic segmentation and layout estimation to imitate human abilities to map new environments. The proposed system benefits from the geometrical relations between the corner points of the cuboid environments to improve the accuracy of trajectory estimation. Moreover, the implemented SLAM solution semantically groups the map points and then tracks each group independently to limit the system drift. The implemented solution yielded higher trajectory accuracy and immunity to large pure rotations.https://www.mdpi.com/2218-6581/11/5/91visual SLAMlayout estimationsemantic SLAMvisual navigation
spellingShingle Ahmed Mahmoud
Mohamed Atia
Improved Visual SLAM Using Semantic Segmentation and Layout Estimation
Robotics
visual SLAM
layout estimation
semantic SLAM
visual navigation
title Improved Visual SLAM Using Semantic Segmentation and Layout Estimation
title_full Improved Visual SLAM Using Semantic Segmentation and Layout Estimation
title_fullStr Improved Visual SLAM Using Semantic Segmentation and Layout Estimation
title_full_unstemmed Improved Visual SLAM Using Semantic Segmentation and Layout Estimation
title_short Improved Visual SLAM Using Semantic Segmentation and Layout Estimation
title_sort improved visual slam using semantic segmentation and layout estimation
topic visual SLAM
layout estimation
semantic SLAM
visual navigation
url https://www.mdpi.com/2218-6581/11/5/91
work_keys_str_mv AT ahmedmahmoud improvedvisualslamusingsemanticsegmentationandlayoutestimation
AT mohamedatia improvedvisualslamusingsemanticsegmentationandlayoutestimation