Let's take this online: adapting scene coordinate regression network predictions for online RGB-D camera relocalisation

Many applications require a camera to be relocalised online, without expensive offline training on the target scene. Whilst both keyframe and sparse keypoint matching methods can be used online, the former often fail away from the training trajectory, and the latter can struggle in textureless regio...

Full description

Bibliographic Details
Main Authors: Cavallari, T, Bertinetto, L, Mukhoti, J, Torr, P, Golodetz, S
Format: Conference item
Published: Institute of Electrical and Electronics Engineers 2019
_version_ 1797067778069364736
author Cavallari, T
Bertinetto, L
Mukhoti, J
Torr, P
Golodetz, S
author_facet Cavallari, T
Bertinetto, L
Mukhoti, J
Torr, P
Golodetz, S
author_sort Cavallari, T
collection OXFORD
description Many applications require a camera to be relocalised online, without expensive offline training on the target scene. Whilst both keyframe and sparse keypoint matching methods can be used online, the former often fail away from the training trajectory, and the latter can struggle in textureless regions. By contrast, scene coordinate regression (SCoRe) methods generalise to novel poses and can leverage dense correspondences to improve robustness, and recent work has shown how to adapt SCoRe forests between scenes, allowing their state-of-the-art performance to be leveraged online. However, because they use features hand-crafted for indoor use, they do not generalise well to harder outdoor scenes. Whilst replacing the forest with a neural network and learning suitable features for outdoor use is possible, the techniques used to adapt forests between scenes are unfortunately harder to transfer to a network context. In this paper, we address this by proposing a novel way of leveraging a network trained on one scene to predict points in another scene. Our approach replaces the appearance clustering performed by the branching structure of a regression forest with a two-step process that first uses the network to predict points in the original scene, and then uses these predicted points to look up clusters of points from the new scene. We show experimentally that our online approach achieves state-of-the-art performance on both the 7-Scenes and Cambridge Landmarks datasets, whilst running in under 300ms, making it highly effective in live scenarios.
first_indexed 2024-03-06T22:01:12Z
format Conference item
id oxford-uuid:4ea540c3-057f-4642-9388-891494343890
institution University of Oxford
last_indexed 2024-03-06T22:01:12Z
publishDate 2019
publisher Institute of Electrical and Electronics Engineers
record_format dspace
spelling oxford-uuid:4ea540c3-057f-4642-9388-8914943438902022-03-26T16:02:21ZLet's take this online: adapting scene coordinate regression network predictions for online RGB-D camera relocalisationConference itemhttp://purl.org/coar/resource_type/c_5794uuid:4ea540c3-057f-4642-9388-891494343890Symplectic Elements at OxfordInstitute of Electrical and Electronics Engineers2019Cavallari, TBertinetto, LMukhoti, JTorr, PGolodetz, SMany applications require a camera to be relocalised online, without expensive offline training on the target scene. Whilst both keyframe and sparse keypoint matching methods can be used online, the former often fail away from the training trajectory, and the latter can struggle in textureless regions. By contrast, scene coordinate regression (SCoRe) methods generalise to novel poses and can leverage dense correspondences to improve robustness, and recent work has shown how to adapt SCoRe forests between scenes, allowing their state-of-the-art performance to be leveraged online. However, because they use features hand-crafted for indoor use, they do not generalise well to harder outdoor scenes. Whilst replacing the forest with a neural network and learning suitable features for outdoor use is possible, the techniques used to adapt forests between scenes are unfortunately harder to transfer to a network context. In this paper, we address this by proposing a novel way of leveraging a network trained on one scene to predict points in another scene. Our approach replaces the appearance clustering performed by the branching structure of a regression forest with a two-step process that first uses the network to predict points in the original scene, and then uses these predicted points to look up clusters of points from the new scene. We show experimentally that our online approach achieves state-of-the-art performance on both the 7-Scenes and Cambridge Landmarks datasets, whilst running in under 300ms, making it highly effective in live scenarios.
spellingShingle Cavallari, T
Bertinetto, L
Mukhoti, J
Torr, P
Golodetz, S
Let's take this online: adapting scene coordinate regression network predictions for online RGB-D camera relocalisation
title Let's take this online: adapting scene coordinate regression network predictions for online RGB-D camera relocalisation
title_full Let's take this online: adapting scene coordinate regression network predictions for online RGB-D camera relocalisation
title_fullStr Let's take this online: adapting scene coordinate regression network predictions for online RGB-D camera relocalisation
title_full_unstemmed Let's take this online: adapting scene coordinate regression network predictions for online RGB-D camera relocalisation
title_short Let's take this online: adapting scene coordinate regression network predictions for online RGB-D camera relocalisation
title_sort let s take this online adapting scene coordinate regression network predictions for online rgb d camera relocalisation
work_keys_str_mv AT cavallarit letstakethisonlineadaptingscenecoordinateregressionnetworkpredictionsforonlinergbdcamerarelocalisation
AT bertinettol letstakethisonlineadaptingscenecoordinateregressionnetworkpredictionsforonlinergbdcamerarelocalisation
AT mukhotij letstakethisonlineadaptingscenecoordinateregressionnetworkpredictionsforonlinergbdcamerarelocalisation
AT torrp letstakethisonlineadaptingscenecoordinateregressionnetworkpredictionsforonlinergbdcamerarelocalisation
AT golodetzs letstakethisonlineadaptingscenecoordinateregressionnetworkpredictionsforonlinergbdcamerarelocalisation