Coresets for visual summarization with applications to loop closure

In continuously operating robotic systems, efficient representation of the previously seen camera feed is crucial. Using a highly efficient compression coreset method, we formulate a new method for hierarchical retrieval of frames from large video streams collected online by a moving robot. We demon...

Full description

Bibliographic Details
Main Authors: Volkov, Mikhail, Rosman, Guy, Feldman, Dan, Fisher III, John W., Rus, Daniela L.
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:en_US
Published: Institute of Electrical and Electronics Engineers (IEEE) 2016
Online Access:http://hdl.handle.net/1721.1/101027
https://orcid.org/0000-0003-4844-3495
https://orcid.org/0000-0001-5473-3566
https://orcid.org/0000-0001-9632-754X
https://orcid.org/0000-0002-9334-1706
Description
Summary:In continuously operating robotic systems, efficient representation of the previously seen camera feed is crucial. Using a highly efficient compression coreset method, we formulate a new method for hierarchical retrieval of frames from large video streams collected online by a moving robot. We demonstrate how to utilize the resulting structure for efficient loop-closure by a novel sampling approach that is adaptive to the structure of the video. The same structure also allows us to create a highly-effective search tool for large-scale videos, which we demonstrate in this paper. We show the efficiency of proposed approaches for retrieval and loop closure on standard datasets, and on a large-scale video from a mobile camera.