Robust camera pose estimation by viewpoint classification using deep learning

Abstract Camera pose estimation with respect to target scenes is an important technology for superimposing virtual information in augmented reality (AR). However, it is difficult to estimate the camera pose for all possible view angles because feature descriptors such as SIFT are not completely inva...

Full description

Bibliographic Details
Main Authors: Yoshikatsu Nakajima, Hideo Saito
Format: Article
Language:English
Published: SpringerOpen 2016-12-01
Series:Computational Visual Media
Subjects:
Online Access:http://link.springer.com/article/10.1007/s41095-016-0067-z
_version_ 1811322347942051840
author Yoshikatsu Nakajima
Hideo Saito
author_facet Yoshikatsu Nakajima
Hideo Saito
author_sort Yoshikatsu Nakajima
collection DOAJ
description Abstract Camera pose estimation with respect to target scenes is an important technology for superimposing virtual information in augmented reality (AR). However, it is difficult to estimate the camera pose for all possible view angles because feature descriptors such as SIFT are not completely invariant from every perspective. We propose a novel method of robust camera pose estimation using multiple feature descriptor databases generated for each partitioned viewpoint, in which the feature descriptor of each keypoint is almost invariant. Our method estimates the viewpoint class for each input image using deep learning based on a set of training images prepared for each viewpoint class. We give two ways to prepare these images for deep learning and generating databases. In the first method, images are generated using a projection matrix to ensure robust learning in a range of environments with changing backgrounds. The second method uses real images to learn a given environment around a planar pattern. Our evaluation results confirm that our approach increases the number of correct matches and the accuracy of camera pose estimation compared to the conventional method.
first_indexed 2024-04-13T13:33:43Z
format Article
id doaj.art-5cc151e3a53440c29ceebe0473ced07c
institution Directory Open Access Journal
issn 2096-0433
2096-0662
language English
last_indexed 2024-04-13T13:33:43Z
publishDate 2016-12-01
publisher SpringerOpen
record_format Article
series Computational Visual Media
spelling doaj.art-5cc151e3a53440c29ceebe0473ced07c2022-12-22T02:44:51ZengSpringerOpenComputational Visual Media2096-04332096-06622016-12-013218919810.1007/s41095-016-0067-zRobust camera pose estimation by viewpoint classification using deep learningYoshikatsu Nakajima0Hideo Saito1Department of Science and Technology, Keio UniversityDepartment of Science and Technology, Keio UniversityAbstract Camera pose estimation with respect to target scenes is an important technology for superimposing virtual information in augmented reality (AR). However, it is difficult to estimate the camera pose for all possible view angles because feature descriptors such as SIFT are not completely invariant from every perspective. We propose a novel method of robust camera pose estimation using multiple feature descriptor databases generated for each partitioned viewpoint, in which the feature descriptor of each keypoint is almost invariant. Our method estimates the viewpoint class for each input image using deep learning based on a set of training images prepared for each viewpoint class. We give two ways to prepare these images for deep learning and generating databases. In the first method, images are generated using a projection matrix to ensure robust learning in a range of environments with changing backgrounds. The second method uses real images to learn a given environment around a planar pattern. Our evaluation results confirm that our approach increases the number of correct matches and the accuracy of camera pose estimation compared to the conventional method.http://link.springer.com/article/10.1007/s41095-016-0067-zpose estimationaugmented reality (AR)deep learningconvolutional neural network
spellingShingle Yoshikatsu Nakajima
Hideo Saito
Robust camera pose estimation by viewpoint classification using deep learning
Computational Visual Media
pose estimation
augmented reality (AR)
deep learning
convolutional neural network
title Robust camera pose estimation by viewpoint classification using deep learning
title_full Robust camera pose estimation by viewpoint classification using deep learning
title_fullStr Robust camera pose estimation by viewpoint classification using deep learning
title_full_unstemmed Robust camera pose estimation by viewpoint classification using deep learning
title_short Robust camera pose estimation by viewpoint classification using deep learning
title_sort robust camera pose estimation by viewpoint classification using deep learning
topic pose estimation
augmented reality (AR)
deep learning
convolutional neural network
url http://link.springer.com/article/10.1007/s41095-016-0067-z
work_keys_str_mv AT yoshikatsunakajima robustcameraposeestimationbyviewpointclassificationusingdeeplearning
AT hideosaito robustcameraposeestimationbyviewpointclassificationusingdeeplearning