Map-relative pose regression for visual re-localization

Pose regression networks predict the camera pose of a query image relative to a known environment. Within this family of methods, absolute pose regression (APR) has recently shown promising accuracy in the range of a few centimeters in position error. APR networks encode the scene geometry implicitl...

Full description

Bibliographic Details
Main Authors: Chen, S, Cavallari, T, Prisacariu, VA, Brachmann, E
Format: Internet publication
Language:English
Published: 2024
_version_ 1811139495301480448
author Chen, S
Cavallari, T
Prisacariu, VA
Brachmann, E
author_facet Chen, S
Cavallari, T
Prisacariu, VA
Brachmann, E
author_sort Chen, S
collection OXFORD
description Pose regression networks predict the camera pose of a query image relative to a known environment. Within this family of methods, absolute pose regression (APR) has recently shown promising accuracy in the range of a few centimeters in position error. APR networks encode the scene geometry implicitly in their weights. To achieve high accuracy, they require vast amounts of training data that, realistically, can only be created using novel view synthesis in a days-long process. This process has to be repeated for each new scene again and again. We present a new approach to pose regression, map-relative pose regression (marepo), that satisfies the data hunger of the pose regression network in a scene-agnostic fashion. We condition the pose regressor on a scene-specific map representation such that its pose predictions are relative to the scene map. This allows us to train the pose regressor across hundreds of scenes to learn the generic relation between a scene-specific map representation and the camera pose. Our map-relative pose regressor can be applied to new map representations immediately or after mere minutes of fine-tuning for the highest accuracy. Our approach outperforms previous pose regression methods by far on two public datasets, indoor and outdoor. Code is available: https://nianticlabs.github.io/marepo/
first_indexed 2024-09-25T04:07:00Z
format Internet publication
id oxford-uuid:685cfb83-21d0-429c-9d58-905f1f7597ef
institution University of Oxford
language English
last_indexed 2024-09-25T04:07:00Z
publishDate 2024
record_format dspace
spelling oxford-uuid:685cfb83-21d0-429c-9d58-905f1f7597ef2024-06-04T11:18:56ZMap-relative pose regression for visual re-localizationInternet publicationhttp://purl.org/coar/resource_type/c_7ad9uuid:685cfb83-21d0-429c-9d58-905f1f7597efEnglishSymplectic Elements2024Chen, SCavallari, TPrisacariu, VABrachmann, EPose regression networks predict the camera pose of a query image relative to a known environment. Within this family of methods, absolute pose regression (APR) has recently shown promising accuracy in the range of a few centimeters in position error. APR networks encode the scene geometry implicitly in their weights. To achieve high accuracy, they require vast amounts of training data that, realistically, can only be created using novel view synthesis in a days-long process. This process has to be repeated for each new scene again and again. We present a new approach to pose regression, map-relative pose regression (marepo), that satisfies the data hunger of the pose regression network in a scene-agnostic fashion. We condition the pose regressor on a scene-specific map representation such that its pose predictions are relative to the scene map. This allows us to train the pose regressor across hundreds of scenes to learn the generic relation between a scene-specific map representation and the camera pose. Our map-relative pose regressor can be applied to new map representations immediately or after mere minutes of fine-tuning for the highest accuracy. Our approach outperforms previous pose regression methods by far on two public datasets, indoor and outdoor. Code is available: https://nianticlabs.github.io/marepo/
spellingShingle Chen, S
Cavallari, T
Prisacariu, VA
Brachmann, E
Map-relative pose regression for visual re-localization
title Map-relative pose regression for visual re-localization
title_full Map-relative pose regression for visual re-localization
title_fullStr Map-relative pose regression for visual re-localization
title_full_unstemmed Map-relative pose regression for visual re-localization
title_short Map-relative pose regression for visual re-localization
title_sort map relative pose regression for visual re localization
work_keys_str_mv AT chens maprelativeposeregressionforvisualrelocalization
AT cavallarit maprelativeposeregressionforvisualrelocalization
AT prisacariuva maprelativeposeregressionforvisualrelocalization
AT brachmanne maprelativeposeregressionforvisualrelocalization