Wide-area egomotion from omnidirectional video and coarse 3D structure

Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2007.

Bibliographic Details
Main Author:	Koch, Olivier (Olivier A.)
Other Authors:	Seth Teller.
Format:	Thesis
Language:	eng
Published:	Massachusetts Institute of Technology 2007
Subjects:	Electrical Engineering and Computer Science.
Online Access:	http://hdl.handle.net/1721.1/38668

_version_	1826199271216513024
author	Koch, Olivier (Olivier A.)
author2	Seth Teller.
author_facet	Seth Teller. Koch, Olivier (Olivier A.)
author_sort	Koch, Olivier (Olivier A.)
collection	MIT
description	Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2007.
first_indexed	2024-09-23T11:17:07Z
format	Thesis
id	mit-1721.1/38668
institution	Massachusetts Institute of Technology
language	eng
last_indexed	2024-09-23T11:17:07Z
publishDate	2007
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/386682019-04-12T13:27:45Z Wide-area egomotion from omnidirectional video and coarse 3D structure Koch, Olivier (Olivier A.) Seth Teller. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2007. Includes bibliographical references (p. 85-89). This thesis describes a method for real-time vision-based localization in human-made environments. Given a coarse model of the structure (walls, floors, ceilings, doors and windows) and a video sequence, the system computes the camera pose (translation and rotation) in model coordinates with an accuracy of a few centimeters in translation and a few degrees in rotation. The system has several novel aspects: it performs 6-DOF localization; it handles visually cluttered and dynamic environments; it scales well over regions extending through several buildings; and it runs over several hours without losing lock. We demonstrate that the localization problem can be split into two distinct problems: an initialization phase and a maintenance phase. In the initialization phase, the system determines the camera pose with no other information than a search region provided by the user (building, floor, area, room). This step is computationally intensive and is run only once, at startup. We present a probabilistic method to address the initialization problem using a RANSAC framework. In the maintenance phase, the system keeps track of the camera pose from frame to frame without any user interaction. (cont.) This phase is computationally light-weight to allow a high processing frame rate and is coupled with a feedback loop that helps reacquire "lock" when lock has been lost. We demonstrate a simple, robust geometric tracking algorithm based on correspondences between 3D model lines and 2D image edges. We present navigation results on several real datasets across the MIT campus with cluttered, dynamic environments. The first dataset consists of a five-minute robotic exploration across the Robotics, Vision and Sensor Network Lab. The second dataset consists of a two-minute hand-held, 3D motion in the same lab space. The third dataset consists of a 26-minute exploration across MIT buildings 26 and 36. by Olivier Koch. S.M. 2007-08-29T20:41:50Z 2007-08-29T20:41:50Z 2007 2007 Thesis http://hdl.handle.net/1721.1/38668 163582267 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 110 p. application/pdf Massachusetts Institute of Technology
spellingShingle	Electrical Engineering and Computer Science. Koch, Olivier (Olivier A.) Wide-area egomotion from omnidirectional video and coarse 3D structure
title	Wide-area egomotion from omnidirectional video and coarse 3D structure
title_full	Wide-area egomotion from omnidirectional video and coarse 3D structure
title_fullStr	Wide-area egomotion from omnidirectional video and coarse 3D structure
title_full_unstemmed	Wide-area egomotion from omnidirectional video and coarse 3D structure
title_short	Wide-area egomotion from omnidirectional video and coarse 3D structure
title_sort	wide area egomotion from omnidirectional video and coarse 3d structure
topic	Electrical Engineering and Computer Science.
url	http://hdl.handle.net/1721.1/38668
work_keys_str_mv	AT kocholivieroliviera wideareaegomotionfromomnidirectionalvideoandcoarse3dstructure

Wide-area egomotion from omnidirectional video and coarse 3D structure

Similar Items