RGB-D Object SLAM Using Quadrics for Indoor Environments
Indoor service robots need to build an object-centric semantic map to understand and execute human instructions. Conventional visual simultaneous localization and mapping (SLAM) systems build a map using geometric features such as points, lines, and planes as landmarks. However, they lack a semantic...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-09-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/20/18/5150 |
_version_ | 1797554131295010816 |
---|---|
author | Ziwei Liao Wei Wang Xianyu Qi Xiaoyu Zhang |
author_facet | Ziwei Liao Wei Wang Xianyu Qi Xiaoyu Zhang |
author_sort | Ziwei Liao |
collection | DOAJ |
description | Indoor service robots need to build an object-centric semantic map to understand and execute human instructions. Conventional visual simultaneous localization and mapping (SLAM) systems build a map using geometric features such as points, lines, and planes as landmarks. However, they lack a semantic understanding of the environment. This paper proposes an object-level semantic SLAM algorithm based on RGB-D data, which uses a quadric surface as an object model to compactly represent the object’s position, orientation, and shape. This paper proposes and derives two types of RGB-D camera-quadric observation models: a complete model and a partial model. The complete model combines object detection and point cloud data to estimate a complete ellipsoid in a single RGB-D frame. The partial model is activated when the depth data is severely missing because of illuminations or occlusions, which uses bounding boxes from object detection to constrain objects. Compared with the state-of-the-art quadric SLAM algorithms that use a monocular observation model, the RGB-D observation model reduces the requirements of the observation number and viewing angle changes, which helps improve the accuracy and robustness. This paper introduces a nonparametric pose graph to solve data associations in the back end, and innovatively applies it to the quadric surface model. We thoroughly evaluated the algorithm on two public datasets and an author-collected mobile robot dataset in a home-like environment. We obtained obvious improvements on the localization accuracy and mapping effects compared with two state-of-the-art object SLAM algorithms. |
first_indexed | 2024-03-10T16:27:24Z |
format | Article |
id | doaj.art-68f773f8936743c38522d97d4544b318 |
institution | Directory Open Access Journal |
issn | 1424-8220 |
language | English |
last_indexed | 2024-03-10T16:27:24Z |
publishDate | 2020-09-01 |
publisher | MDPI AG |
record_format | Article |
series | Sensors |
spelling | doaj.art-68f773f8936743c38522d97d4544b3182023-11-20T13:08:29ZengMDPI AGSensors1424-82202020-09-012018515010.3390/s20185150RGB-D Object SLAM Using Quadrics for Indoor EnvironmentsZiwei Liao0Wei Wang1Xianyu Qi2Xiaoyu Zhang3Robotics Institute, Beihang University, Beijing 100191, ChinaRobotics Institute, Beihang University, Beijing 100191, ChinaRobotics Institute, Beihang University, Beijing 100191, ChinaRobotics Institute, Beihang University, Beijing 100191, ChinaIndoor service robots need to build an object-centric semantic map to understand and execute human instructions. Conventional visual simultaneous localization and mapping (SLAM) systems build a map using geometric features such as points, lines, and planes as landmarks. However, they lack a semantic understanding of the environment. This paper proposes an object-level semantic SLAM algorithm based on RGB-D data, which uses a quadric surface as an object model to compactly represent the object’s position, orientation, and shape. This paper proposes and derives two types of RGB-D camera-quadric observation models: a complete model and a partial model. The complete model combines object detection and point cloud data to estimate a complete ellipsoid in a single RGB-D frame. The partial model is activated when the depth data is severely missing because of illuminations or occlusions, which uses bounding boxes from object detection to constrain objects. Compared with the state-of-the-art quadric SLAM algorithms that use a monocular observation model, the RGB-D observation model reduces the requirements of the observation number and viewing angle changes, which helps improve the accuracy and robustness. This paper introduces a nonparametric pose graph to solve data associations in the back end, and innovatively applies it to the quadric surface model. We thoroughly evaluated the algorithm on two public datasets and an author-collected mobile robot dataset in a home-like environment. We obtained obvious improvements on the localization accuracy and mapping effects compared with two state-of-the-art object SLAM algorithms.https://www.mdpi.com/1424-8220/20/18/5150semantic SLAMobject SLAMquadricsRGB-Dmobile robotsdata association |
spellingShingle | Ziwei Liao Wei Wang Xianyu Qi Xiaoyu Zhang RGB-D Object SLAM Using Quadrics for Indoor Environments Sensors semantic SLAM object SLAM quadrics RGB-D mobile robots data association |
title | RGB-D Object SLAM Using Quadrics for Indoor Environments |
title_full | RGB-D Object SLAM Using Quadrics for Indoor Environments |
title_fullStr | RGB-D Object SLAM Using Quadrics for Indoor Environments |
title_full_unstemmed | RGB-D Object SLAM Using Quadrics for Indoor Environments |
title_short | RGB-D Object SLAM Using Quadrics for Indoor Environments |
title_sort | rgb d object slam using quadrics for indoor environments |
topic | semantic SLAM object SLAM quadrics RGB-D mobile robots data association |
url | https://www.mdpi.com/1424-8220/20/18/5150 |
work_keys_str_mv | AT ziweiliao rgbdobjectslamusingquadricsforindoorenvironments AT weiwang rgbdobjectslamusingquadricsforindoorenvironments AT xianyuqi rgbdobjectslamusingquadricsforindoorenvironments AT xiaoyuzhang rgbdobjectslamusingquadricsforindoorenvironments |