Matching sets of features for efficient retrieval and recognition

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006.

Bibliographic Details
Main Author: Grauman, Kristen Lorraine, 1979-
Other Authors: Trevor Darrell.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2007
Subjects:
Online Access:http://hdl.handle.net/1721.1/38296
_version_ 1811081842940444672
author Grauman, Kristen Lorraine, 1979-
author2 Trevor Darrell.
author_facet Trevor Darrell.
Grauman, Kristen Lorraine, 1979-
author_sort Grauman, Kristen Lorraine, 1979-
collection MIT
description Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006.
first_indexed 2024-09-23T11:53:21Z
format Thesis
id mit-1721.1/38296
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T11:53:21Z
publishDate 2007
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/382962019-04-10T12:50:15Z Matching sets of features for efficient retrieval and recognition Grauman, Kristen Lorraine, 1979- Trevor Darrell. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. Includes bibliographical references (p. 145-153). In numerous domains it is useful to represent a single example by the collection of local features or parts that comprise it. In computer vision in particular, local image features are a powerful way to describe images of objects and scenes. Their stability under variable image conditions is critical for success in a wide range of recognition and retrieval applications. However, many conventional similarity measures and machine learning algorithms assume vector inputs. Comparing and learning from images represented by sets of local features is therefore challenging, since each set may vary in cardinality and its elements lack a meaningful ordering. In this thesis I present computationally efficient techniques to handle comparisons, learning, and indexing with examples represented by sets of features. The primary goal of this research is to design and demonstrate algorithms that can effectively accommodate this useful representation in a way that scales with both the representation size as well as the number of images available for indexing or learning. I introduce the pyramid match algorithm, which efficiently forms an implicit partial matching between two sets of feature vectors. (cont.) The matching has a linear time complexity, naturally forms a Mercer kernel, and is robust to clutter or outlier features, a critical advantage for handling images with variable backgrounds, occlusions, and viewpoint changes. I provide bounds on the expected error relative to the optimal partial matching. For very large databases, even extremely efficient pairwise comparisons may not offer adequately responsive query times. I show how to perform sub-linear time retrievals under the matching measure with randomized hashing techniques, even when input sets have varying numbers of features. My results are focused on several important vision tasks, including applications to content-based image retrieval, discriminative classification for object recognition, kernel regression, and unsupervised learning of categories. I show how the dramatic increase in performance enables accurate and flexible image comparisons to be made on large-scale data sets, and removes the need to artificially limit the number of local descriptions used per image when learning visual categories. by Kristen Lorraine Grauman. Ph.D. 2007-08-03T18:26:44Z 2007-08-03T18:26:44Z 2006 2006 Thesis http://hdl.handle.net/1721.1/38296 153915528 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 153 p. application/pdf Massachusetts Institute of Technology
spellingShingle Electrical Engineering and Computer Science.
Grauman, Kristen Lorraine, 1979-
Matching sets of features for efficient retrieval and recognition
title Matching sets of features for efficient retrieval and recognition
title_full Matching sets of features for efficient retrieval and recognition
title_fullStr Matching sets of features for efficient retrieval and recognition
title_full_unstemmed Matching sets of features for efficient retrieval and recognition
title_short Matching sets of features for efficient retrieval and recognition
title_sort matching sets of features for efficient retrieval and recognition
topic Electrical Engineering and Computer Science.
url http://hdl.handle.net/1721.1/38296
work_keys_str_mv AT graumankristenlorraine1979 matchingsetsoffeaturesforefficientretrievalandrecognition