RGB-D Object SLAM Using Quadrics for Indoor Environments

Indoor service robots need to build an object-centric semantic map to understand and execute human instructions. Conventional visual simultaneous localization and mapping (SLAM) systems build a map using geometric features such as points, lines, and planes as landmarks. However, they lack a semantic...

Full description

Bibliographic Details
Main Authors: Ziwei Liao, Wei Wang, Xianyu Qi, Xiaoyu Zhang
Format: Article
Language:English
Published: MDPI AG 2020-09-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/20/18/5150
_version_ 1797554131295010816
author Ziwei Liao
Wei Wang
Xianyu Qi
Xiaoyu Zhang
author_facet Ziwei Liao
Wei Wang
Xianyu Qi
Xiaoyu Zhang
author_sort Ziwei Liao
collection DOAJ
description Indoor service robots need to build an object-centric semantic map to understand and execute human instructions. Conventional visual simultaneous localization and mapping (SLAM) systems build a map using geometric features such as points, lines, and planes as landmarks. However, they lack a semantic understanding of the environment. This paper proposes an object-level semantic SLAM algorithm based on RGB-D data, which uses a quadric surface as an object model to compactly represent the object’s position, orientation, and shape. This paper proposes and derives two types of RGB-D camera-quadric observation models: a complete model and a partial model. The complete model combines object detection and point cloud data to estimate a complete ellipsoid in a single RGB-D frame. The partial model is activated when the depth data is severely missing because of illuminations or occlusions, which uses bounding boxes from object detection to constrain objects. Compared with the state-of-the-art quadric SLAM algorithms that use a monocular observation model, the RGB-D observation model reduces the requirements of the observation number and viewing angle changes, which helps improve the accuracy and robustness. This paper introduces a nonparametric pose graph to solve data associations in the back end, and innovatively applies it to the quadric surface model. We thoroughly evaluated the algorithm on two public datasets and an author-collected mobile robot dataset in a home-like environment. We obtained obvious improvements on the localization accuracy and mapping effects compared with two state-of-the-art object SLAM algorithms.
first_indexed 2024-03-10T16:27:24Z
format Article
id doaj.art-68f773f8936743c38522d97d4544b318
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-10T16:27:24Z
publishDate 2020-09-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-68f773f8936743c38522d97d4544b3182023-11-20T13:08:29ZengMDPI AGSensors1424-82202020-09-012018515010.3390/s20185150RGB-D Object SLAM Using Quadrics for Indoor EnvironmentsZiwei Liao0Wei Wang1Xianyu Qi2Xiaoyu Zhang3Robotics Institute, Beihang University, Beijing 100191, ChinaRobotics Institute, Beihang University, Beijing 100191, ChinaRobotics Institute, Beihang University, Beijing 100191, ChinaRobotics Institute, Beihang University, Beijing 100191, ChinaIndoor service robots need to build an object-centric semantic map to understand and execute human instructions. Conventional visual simultaneous localization and mapping (SLAM) systems build a map using geometric features such as points, lines, and planes as landmarks. However, they lack a semantic understanding of the environment. This paper proposes an object-level semantic SLAM algorithm based on RGB-D data, which uses a quadric surface as an object model to compactly represent the object’s position, orientation, and shape. This paper proposes and derives two types of RGB-D camera-quadric observation models: a complete model and a partial model. The complete model combines object detection and point cloud data to estimate a complete ellipsoid in a single RGB-D frame. The partial model is activated when the depth data is severely missing because of illuminations or occlusions, which uses bounding boxes from object detection to constrain objects. Compared with the state-of-the-art quadric SLAM algorithms that use a monocular observation model, the RGB-D observation model reduces the requirements of the observation number and viewing angle changes, which helps improve the accuracy and robustness. This paper introduces a nonparametric pose graph to solve data associations in the back end, and innovatively applies it to the quadric surface model. We thoroughly evaluated the algorithm on two public datasets and an author-collected mobile robot dataset in a home-like environment. We obtained obvious improvements on the localization accuracy and mapping effects compared with two state-of-the-art object SLAM algorithms.https://www.mdpi.com/1424-8220/20/18/5150semantic SLAMobject SLAMquadricsRGB-Dmobile robotsdata association
spellingShingle Ziwei Liao
Wei Wang
Xianyu Qi
Xiaoyu Zhang
RGB-D Object SLAM Using Quadrics for Indoor Environments
Sensors
semantic SLAM
object SLAM
quadrics
RGB-D
mobile robots
data association
title RGB-D Object SLAM Using Quadrics for Indoor Environments
title_full RGB-D Object SLAM Using Quadrics for Indoor Environments
title_fullStr RGB-D Object SLAM Using Quadrics for Indoor Environments
title_full_unstemmed RGB-D Object SLAM Using Quadrics for Indoor Environments
title_short RGB-D Object SLAM Using Quadrics for Indoor Environments
title_sort rgb d object slam using quadrics for indoor environments
topic semantic SLAM
object SLAM
quadrics
RGB-D
mobile robots
data association
url https://www.mdpi.com/1424-8220/20/18/5150
work_keys_str_mv AT ziweiliao rgbdobjectslamusingquadricsforindoorenvironments
AT weiwang rgbdobjectslamusingquadricsforindoorenvironments
AT xianyuqi rgbdobjectslamusingquadricsforindoorenvironments
AT xiaoyuzhang rgbdobjectslamusingquadricsforindoorenvironments