Hydra: A Spatial Perception Engine for Constructing and Optimizing 3D Scene Graphs in Real-time
3D scene graphs have emerged as a powerful map representation for robotics. Scene graphs encode spatial and semantic concepts at multiple levels of abstraction as nodes in a graph, and use edges to represent relationships. Such representations offer an efficient way to model diverse environments, an...
Autor principal: | |
---|---|
Otros Autores: | |
Formato: | Tesis |
Publicado: |
Massachusetts Institute of Technology
2022
|
Acceso en línea: | https://hdl.handle.net/1721.1/145096 https://orcid.org/0000-0002-1201-7032 |
_version_ | 1826212393538027520 |
---|---|
author | Hughes, Nathan |
author2 | Carlone, Luca |
author_facet | Carlone, Luca Hughes, Nathan |
author_sort | Hughes, Nathan |
collection | MIT |
description | 3D scene graphs have emerged as a powerful map representation for robotics. Scene graphs encode spatial and semantic concepts at multiple levels of abstraction as nodes in a graph, and use edges to represent relationships. Such representations offer an efficient way to model diverse environments, and can be used as an aid for planning tasks requiring semantic knowledge. However, current approaches that construct scene graphs lack the ability to operate in real-time on robots. This thesis addresses this research gap, investigating how to build scene graphs from sensor data.
We first introduce the concept of a 3D scene graph, and then detail our contributions to Kimera, the first work to build a hierarchical 3D scene graph directly from visual-inertial sensor data in post-processing. We discuss experiments that explore the runtime performance of Kimera, and produce scene graphs for real-life environments.
We then introduce Hydra, a real-time spatial perception system that can construct 3D scene graphs incrementally from visual-inertial sensor data, overcoming several limitations of Kimera. We also propose an approach for enhancing traditional vision-based loop closure detection using scene graphs, and detail the first method for correcting a scene graph in response to odometric drift. We provide an extensive evaluation of Hydra, including comparing the produced scene graphs to the scene graphs produced by Kimera. We discuss the suitability of Hydra for usage on mobile robots; towards this we show a scene graph produced by data collected on a real robot and evaluate the runtime performance of Hydra on an embedded processor.
Finally, we turn to applications of Hydra to planning and navigation tasks in robotics. We detail how 3D scene graphs can be used as observations in a reinforcement learning framework. Experiments evaluating this framework show that the hierarchy of the scene graph improves the effectiveness of the learned policy. We then discuss the role that Hydra could play in deploying learned policies to real robots. |
first_indexed | 2024-09-23T15:21:00Z |
format | Thesis |
id | mit-1721.1/145096 |
institution | Massachusetts Institute of Technology |
last_indexed | 2024-09-23T15:21:00Z |
publishDate | 2022 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1450962022-08-30T03:10:13Z Hydra: A Spatial Perception Engine for Constructing and Optimizing 3D Scene Graphs in Real-time Hughes, Nathan Carlone, Luca Massachusetts Institute of Technology. Department of Aeronautics and Astronautics 3D scene graphs have emerged as a powerful map representation for robotics. Scene graphs encode spatial and semantic concepts at multiple levels of abstraction as nodes in a graph, and use edges to represent relationships. Such representations offer an efficient way to model diverse environments, and can be used as an aid for planning tasks requiring semantic knowledge. However, current approaches that construct scene graphs lack the ability to operate in real-time on robots. This thesis addresses this research gap, investigating how to build scene graphs from sensor data. We first introduce the concept of a 3D scene graph, and then detail our contributions to Kimera, the first work to build a hierarchical 3D scene graph directly from visual-inertial sensor data in post-processing. We discuss experiments that explore the runtime performance of Kimera, and produce scene graphs for real-life environments. We then introduce Hydra, a real-time spatial perception system that can construct 3D scene graphs incrementally from visual-inertial sensor data, overcoming several limitations of Kimera. We also propose an approach for enhancing traditional vision-based loop closure detection using scene graphs, and detail the first method for correcting a scene graph in response to odometric drift. We provide an extensive evaluation of Hydra, including comparing the produced scene graphs to the scene graphs produced by Kimera. We discuss the suitability of Hydra for usage on mobile robots; towards this we show a scene graph produced by data collected on a real robot and evaluate the runtime performance of Hydra on an embedded processor. Finally, we turn to applications of Hydra to planning and navigation tasks in robotics. We detail how 3D scene graphs can be used as observations in a reinforcement learning framework. Experiments evaluating this framework show that the hierarchy of the scene graph improves the effectiveness of the learned policy. We then discuss the role that Hydra could play in deploying learned policies to real robots. S.M. 2022-08-29T16:32:37Z 2022-08-29T16:32:37Z 2022-05 2022-06-09T16:14:26.517Z Thesis https://hdl.handle.net/1721.1/145096 https://orcid.org/0000-0002-1201-7032 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology |
spellingShingle | Hughes, Nathan Hydra: A Spatial Perception Engine for Constructing and Optimizing 3D Scene Graphs in Real-time |
title | Hydra: A Spatial Perception Engine for Constructing and Optimizing 3D Scene Graphs in Real-time |
title_full | Hydra: A Spatial Perception Engine for Constructing and Optimizing 3D Scene Graphs in Real-time |
title_fullStr | Hydra: A Spatial Perception Engine for Constructing and Optimizing 3D Scene Graphs in Real-time |
title_full_unstemmed | Hydra: A Spatial Perception Engine for Constructing and Optimizing 3D Scene Graphs in Real-time |
title_short | Hydra: A Spatial Perception Engine for Constructing and Optimizing 3D Scene Graphs in Real-time |
title_sort | hydra a spatial perception engine for constructing and optimizing 3d scene graphs in real time |
url | https://hdl.handle.net/1721.1/145096 https://orcid.org/0000-0002-1201-7032 |
work_keys_str_mv | AT hughesnathan hydraaspatialperceptionengineforconstructingandoptimizing3dscenegraphsinrealtime |