Supervising the new with the old: Learning SFM from SFM

Recent work has demonstrated that it is possible to learn deep neural networks for monocular depth and ego-motion estimation from unlabelled video sequences, an interesting theoretical development with numerous advantages in applications. In this paper, we propose a number of improvements to these a...

Full description

Bibliographic Details
Main Authors: Klodt, M, Vedaldi, A
Format: Conference item
Published: Springer 2018
_version_ 1797065736277983232
author Klodt, M
Vedaldi, A
author_facet Klodt, M
Vedaldi, A
author_sort Klodt, M
collection OXFORD
description Recent work has demonstrated that it is possible to learn deep neural networks for monocular depth and ego-motion estimation from unlabelled video sequences, an interesting theoretical development with numerous advantages in applications. In this paper, we propose a number of improvements to these approaches. First, since such self-supervised approaches are based on the brightness constancy assumption, which is valid only for a subset of pixels, we propose a probabilistic learning formulation where the network predicts distributions over variables rather than specific values. As these distributions are conditioned on the observed image, the network can learn which scene and object types are likely to violate the model assumptions, resulting in more robust learning. We also propose to build on dozens of years of experience in developing handcrafted structure-from-motion (SFM) algorithms. We do so by using an off-the-shelf SFM system to generate a supervisory signal for the deep neural network. While this signal is also noisy, we show that our probabilistic formulation can learn and account for the defects of SFM, helping to integrate different sources of information and boosting the overall performance of the network.
first_indexed 2024-03-06T21:32:50Z
format Conference item
id oxford-uuid:45401dc3-44cb-4e74-a589-16937f5c3450
institution University of Oxford
last_indexed 2024-03-06T21:32:50Z
publishDate 2018
publisher Springer
record_format dspace
spelling oxford-uuid:45401dc3-44cb-4e74-a589-16937f5c34502022-03-26T15:06:41ZSupervising the new with the old: Learning SFM from SFMConference itemhttp://purl.org/coar/resource_type/c_5794uuid:45401dc3-44cb-4e74-a589-16937f5c3450Symplectic Elements at OxfordSpringer2018Klodt, MVedaldi, ARecent work has demonstrated that it is possible to learn deep neural networks for monocular depth and ego-motion estimation from unlabelled video sequences, an interesting theoretical development with numerous advantages in applications. In this paper, we propose a number of improvements to these approaches. First, since such self-supervised approaches are based on the brightness constancy assumption, which is valid only for a subset of pixels, we propose a probabilistic learning formulation where the network predicts distributions over variables rather than specific values. As these distributions are conditioned on the observed image, the network can learn which scene and object types are likely to violate the model assumptions, resulting in more robust learning. We also propose to build on dozens of years of experience in developing handcrafted structure-from-motion (SFM) algorithms. We do so by using an off-the-shelf SFM system to generate a supervisory signal for the deep neural network. While this signal is also noisy, we show that our probabilistic formulation can learn and account for the defects of SFM, helping to integrate different sources of information and boosting the overall performance of the network.
spellingShingle Klodt, M
Vedaldi, A
Supervising the new with the old: Learning SFM from SFM
title Supervising the new with the old: Learning SFM from SFM
title_full Supervising the new with the old: Learning SFM from SFM
title_fullStr Supervising the new with the old: Learning SFM from SFM
title_full_unstemmed Supervising the new with the old: Learning SFM from SFM
title_short Supervising the new with the old: Learning SFM from SFM
title_sort supervising the new with the old learning sfm from sfm
work_keys_str_mv AT klodtm supervisingthenewwiththeoldlearningsfmfromsfm
AT vedaldia supervisingthenewwiththeoldlearningsfmfromsfm