Efficient Object Recognition and Image Retrieval for Large-Scale Applications

Algorithms for recognition and retrieval tasks generally call for both speed and accuracy. When scaling up to very large applications, however, we encounter additional significant requirements: adaptability and scalability. In many real-world systems, large numbers of images are constantly added to...

Full description

Bibliographic Details
Main Author:	Lee, John J.
Other Authors:	Trevor Darrell
Published:	2008
Online Access:	http://hdl.handle.net/1721.1/41519

_version_	1826188298326900736
author	Lee, John J.
author2	Trevor Darrell
author_facet	Trevor Darrell Lee, John J.
author_sort	Lee, John J.
collection	MIT
description	Algorithms for recognition and retrieval tasks generally call for both speed and accuracy. When scaling up to very large applications, however, we encounter additional significant requirements: adaptability and scalability. In many real-world systems, large numbers of images are constantly added to the database, requiring the algorithm to quickly tune itself to recent trends so it can serve queries more effectively. Moreover, the systems need to be able to meet the demands of simultaneous queries from many users. In this thesis, I describe two new algorithms intended to meet these requirements and give an extensive experimental evaluation for both. The first algorithm constructs an adaptive vocabulary forest, which is an efficient image-database model that grows and shrinks as needed while adapting its structure to tune itself to recent trends. The second algorithm is a method for efficiently performing classification tasks by comparing query images to only afixed number of training examples, regardless of the size of the image database. These two methods can be combined to create a fast, adaptable, and scalable vision system suitable for large-scale applications. I also introduce LIBPMK, a fast implementation of common computer vision processing pipelines such as that of the pyramid match kernel. This implementation was used to build several successful interactive applications as well as batch experiments for research settings. This implementation, in addition to the two new algorithms introduced by this thesis, are a step toward meeting the speed, adaptability, and scalability requirements of practical large-scale vision systems.
first_indexed	2024-09-23T07:57:32Z
id	mit-1721.1/41519
institution	Massachusetts Institute of Technology
last_indexed	2024-09-23T07:57:32Z
publishDate	2008
record_format	dspace
spelling	mit-1721.1/415192019-04-09T15:45:18Z Efficient Object Recognition and Image Retrieval for Large-Scale Applications Lee, John J. Trevor Darrell Vision Algorithms for recognition and retrieval tasks generally call for both speed and accuracy. When scaling up to very large applications, however, we encounter additional significant requirements: adaptability and scalability. In many real-world systems, large numbers of images are constantly added to the database, requiring the algorithm to quickly tune itself to recent trends so it can serve queries more effectively. Moreover, the systems need to be able to meet the demands of simultaneous queries from many users. In this thesis, I describe two new algorithms intended to meet these requirements and give an extensive experimental evaluation for both. The first algorithm constructs an adaptive vocabulary forest, which is an efficient image-database model that grows and shrinks as needed while adapting its structure to tune itself to recent trends. The second algorithm is a method for efficiently performing classification tasks by comparing query images to only afixed number of training examples, regardless of the size of the image database. These two methods can be combined to create a fast, adaptable, and scalable vision system suitable for large-scale applications. I also introduce LIBPMK, a fast implementation of common computer vision processing pipelines such as that of the pyramid match kernel. This implementation was used to build several successful interactive applications as well as batch experiments for research settings. This implementation, in addition to the two new algorithms introduced by this thesis, are a step toward meeting the speed, adaptability, and scalability requirements of practical large-scale vision systems. 2008-05-06T23:00:12Z 2008-05-06T23:00:12Z 2008-05-06 MIT-CSAIL-TR-2008-026 http://hdl.handle.net/1721.1/41519 Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory 93 p. application/pdf application/postscript
spellingShingle	Lee, John J. Efficient Object Recognition and Image Retrieval for Large-Scale Applications
title	Efficient Object Recognition and Image Retrieval for Large-Scale Applications
title_full	Efficient Object Recognition and Image Retrieval for Large-Scale Applications
title_fullStr	Efficient Object Recognition and Image Retrieval for Large-Scale Applications
title_full_unstemmed	Efficient Object Recognition and Image Retrieval for Large-Scale Applications
title_short	Efficient Object Recognition and Image Retrieval for Large-Scale Applications
title_sort	efficient object recognition and image retrieval for large scale applications
url	http://hdl.handle.net/1721.1/41519
work_keys_str_mv	AT leejohnj efficientobjectrecognitionandimageretrievalforlargescaleapplications

Efficient Object Recognition and Image Retrieval for Large-Scale Applications

Similar Items