InfiniTAM v3: a framework for large-scale 3D reconstruction with loop closure

Volumetric models have become a popular representation for 3D scenes in recent years. One breakthrough leading to their popularity was KinectFusion, which focuses on 3D reconstruction using RGB-D sensors. However, monocular SLAM has since also been tackled with very similar approaches. Representing...

Full description

Bibliographic Details
Main Authors:	Prisacariu, VA, Kähler, O, Golodetz, S, Sapienza, M, Cavallari, T, Torr, P, Murray, DW
Format:	Internet publication
Language:	English
Published:	ArXiv 2017

_version_	1811139696304062464
author	Prisacariu, VA Kähler, O Golodetz, S Sapienza, M Cavallari, T Torr, P Murray, DW
author_facet	Prisacariu, VA Kähler, O Golodetz, S Sapienza, M Cavallari, T Torr, P Murray, DW
author_sort	Prisacariu, VA
collection	OXFORD
description	Volumetric models have become a popular representation for 3D scenes in recent years. One breakthrough leading to their popularity was KinectFusion, which focuses on 3D reconstruction using RGB-D sensors. However, monocular SLAM has since also been tackled with very similar approaches. Representing the reconstruction volumetrically as a TSDF leads to most of the simplicity and efficiency that can be achieved with GPU implementations of these systems. However, this representation is memory-intensive and limits applicability to small-scale reconstructions. Several avenues have been explored to overcome this. With the aim of summarizing them and providing for a fast, flexible 3D reconstruction pipeline, we propose a new, unifying framework called InfiniTAM. The idea is that steps like camera tracking, scene representation and integration of new data can easily be replaced and adapted to the user's needs. This report describes the technical implementation details of InfiniTAM v3, the third version of our InfiniTAM system. We have added various new features, as well as making numerous enhancements to the low-level code that significantly improve our camera tracking performance. The new features that we expect to be of most interest are (i) a robust camera tracking module; (ii) an implementation of Glocker et al.'s keyframe-based random ferns camera relocaliser; (iii) a novel approach to globally-consistent TSDF-based reconstruction, based on dividing the scene into rigid submaps and optimising the relative poses between them; and (iv) an implementation of Keller et al.'s surfel-based reconstruction approach.
first_indexed	2024-09-25T04:10:11Z
format	Internet publication
id	oxford-uuid:36cf7111-17cd-4c09-a71d-450253d02f64
institution	University of Oxford
language	English
last_indexed	2024-09-25T04:10:11Z
publishDate	2017
publisher	ArXiv
record_format	dspace
spelling	oxford-uuid:36cf7111-17cd-4c09-a71d-450253d02f642024-06-20T15:47:34ZInfiniTAM v3: a framework for large-scale 3D reconstruction with loop closureInternet publicationhttp://purl.org/coar/resource_type/c_7ad9uuid:36cf7111-17cd-4c09-a71d-450253d02f64EnglishSymplectic ElementsArXiv2017Prisacariu, VAKähler, OGolodetz, SSapienza, MCavallari, TTorr, PMurray, DWVolumetric models have become a popular representation for 3D scenes in recent years. One breakthrough leading to their popularity was KinectFusion, which focuses on 3D reconstruction using RGB-D sensors. However, monocular SLAM has since also been tackled with very similar approaches. Representing the reconstruction volumetrically as a TSDF leads to most of the simplicity and efficiency that can be achieved with GPU implementations of these systems. However, this representation is memory-intensive and limits applicability to small-scale reconstructions. Several avenues have been explored to overcome this. With the aim of summarizing them and providing for a fast, flexible 3D reconstruction pipeline, we propose a new, unifying framework called InfiniTAM. The idea is that steps like camera tracking, scene representation and integration of new data can easily be replaced and adapted to the user's needs. This report describes the technical implementation details of InfiniTAM v3, the third version of our InfiniTAM system. We have added various new features, as well as making numerous enhancements to the low-level code that significantly improve our camera tracking performance. The new features that we expect to be of most interest are (i) a robust camera tracking module; (ii) an implementation of Glocker et al.'s keyframe-based random ferns camera relocaliser; (iii) a novel approach to globally-consistent TSDF-based reconstruction, based on dividing the scene into rigid submaps and optimising the relative poses between them; and (iv) an implementation of Keller et al.'s surfel-based reconstruction approach.
spellingShingle	Prisacariu, VA Kähler, O Golodetz, S Sapienza, M Cavallari, T Torr, P Murray, DW InfiniTAM v3: a framework for large-scale 3D reconstruction with loop closure
title	InfiniTAM v3: a framework for large-scale 3D reconstruction with loop closure
title_full	InfiniTAM v3: a framework for large-scale 3D reconstruction with loop closure
title_fullStr	InfiniTAM v3: a framework for large-scale 3D reconstruction with loop closure
title_full_unstemmed	InfiniTAM v3: a framework for large-scale 3D reconstruction with loop closure
title_short	InfiniTAM v3: a framework for large-scale 3D reconstruction with loop closure
title_sort	infinitam v3 a framework for large scale 3d reconstruction with loop closure
work_keys_str_mv	AT prisacariuva infinitamv3aframeworkforlargescale3dreconstructionwithloopclosure AT kahlero infinitamv3aframeworkforlargescale3dreconstructionwithloopclosure AT golodetzs infinitamv3aframeworkforlargescale3dreconstructionwithloopclosure AT sapienzam infinitamv3aframeworkforlargescale3dreconstructionwithloopclosure AT cavallarit infinitamv3aframeworkforlargescale3dreconstructionwithloopclosure AT torrp infinitamv3aframeworkforlargescale3dreconstructionwithloopclosure AT murraydw infinitamv3aframeworkforlargescale3dreconstructionwithloopclosure

InfiniTAM v3: a framework for large-scale 3D reconstruction with loop closure

Similar Items