Improved Visual SLAM Using Semantic Segmentation and Layout Estimation
The technological advances in computational systems have enabled very complex computer vision and machine learning approaches to perform efficiently and accurately. These new approaches can be considered a new set of tools to reshape the visual SLAM solutions. We present an investigation of the late...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-09-01
|
Series: | Robotics |
Subjects: | |
Online Access: | https://www.mdpi.com/2218-6581/11/5/91 |
_version_ | 1797470045448699904 |
---|---|
author | Ahmed Mahmoud Mohamed Atia |
author_facet | Ahmed Mahmoud Mohamed Atia |
author_sort | Ahmed Mahmoud |
collection | DOAJ |
description | The technological advances in computational systems have enabled very complex computer vision and machine learning approaches to perform efficiently and accurately. These new approaches can be considered a new set of tools to reshape the visual SLAM solutions. We present an investigation of the latest neuroscientific research that explains how the human brain can accurately navigate and map unknown environments. The accuracy suggests that human navigation is not affected by traditional visual odometry drifts resulting from tracking visual features. It utilises the geometrical structures of the surrounding objects within the navigated space. The identified objects and space geometrical shapes anchor the estimated space representation and mitigate the overall drift. Inspired by the human brain’s navigation techniques, this paper presents our efforts to incorporate two machine learning techniques into a VSLAM solution: semantic segmentation and layout estimation to imitate human abilities to map new environments. The proposed system benefits from the geometrical relations between the corner points of the cuboid environments to improve the accuracy of trajectory estimation. Moreover, the implemented SLAM solution semantically groups the map points and then tracks each group independently to limit the system drift. The implemented solution yielded higher trajectory accuracy and immunity to large pure rotations. |
first_indexed | 2024-03-09T19:32:13Z |
format | Article |
id | doaj.art-ec32250656f846bab09ef8ca9977b6a8 |
institution | Directory Open Access Journal |
issn | 2218-6581 |
language | English |
last_indexed | 2024-03-09T19:32:13Z |
publishDate | 2022-09-01 |
publisher | MDPI AG |
record_format | Article |
series | Robotics |
spelling | doaj.art-ec32250656f846bab09ef8ca9977b6a82023-11-24T02:23:26ZengMDPI AGRobotics2218-65812022-09-011159110.3390/robotics11050091Improved Visual SLAM Using Semantic Segmentation and Layout EstimationAhmed Mahmoud0Mohamed Atia1Department of Systems and Computer Engineering, Carleton University, Ottawa, ON K1S 5B6, CanadaDepartment of Systems and Computer Engineering, Carleton University, Ottawa, ON K1S 5B6, CanadaThe technological advances in computational systems have enabled very complex computer vision and machine learning approaches to perform efficiently and accurately. These new approaches can be considered a new set of tools to reshape the visual SLAM solutions. We present an investigation of the latest neuroscientific research that explains how the human brain can accurately navigate and map unknown environments. The accuracy suggests that human navigation is not affected by traditional visual odometry drifts resulting from tracking visual features. It utilises the geometrical structures of the surrounding objects within the navigated space. The identified objects and space geometrical shapes anchor the estimated space representation and mitigate the overall drift. Inspired by the human brain’s navigation techniques, this paper presents our efforts to incorporate two machine learning techniques into a VSLAM solution: semantic segmentation and layout estimation to imitate human abilities to map new environments. The proposed system benefits from the geometrical relations between the corner points of the cuboid environments to improve the accuracy of trajectory estimation. Moreover, the implemented SLAM solution semantically groups the map points and then tracks each group independently to limit the system drift. The implemented solution yielded higher trajectory accuracy and immunity to large pure rotations.https://www.mdpi.com/2218-6581/11/5/91visual SLAMlayout estimationsemantic SLAMvisual navigation |
spellingShingle | Ahmed Mahmoud Mohamed Atia Improved Visual SLAM Using Semantic Segmentation and Layout Estimation Robotics visual SLAM layout estimation semantic SLAM visual navigation |
title | Improved Visual SLAM Using Semantic Segmentation and Layout Estimation |
title_full | Improved Visual SLAM Using Semantic Segmentation and Layout Estimation |
title_fullStr | Improved Visual SLAM Using Semantic Segmentation and Layout Estimation |
title_full_unstemmed | Improved Visual SLAM Using Semantic Segmentation and Layout Estimation |
title_short | Improved Visual SLAM Using Semantic Segmentation and Layout Estimation |
title_sort | improved visual slam using semantic segmentation and layout estimation |
topic | visual SLAM layout estimation semantic SLAM visual navigation |
url | https://www.mdpi.com/2218-6581/11/5/91 |
work_keys_str_mv | AT ahmedmahmoud improvedvisualslamusingsemanticsegmentationandlayoutestimation AT mohamedatia improvedvisualslamusingsemanticsegmentationandlayoutestimation |