A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection

Recently, several powerful image features have been proposed whichcan be described as spatial histograms of oriented energy. Forinstance, the HoG, HMAX C1, SIFT, and Shape Context feature allrepresent an input image using with a discrete set of bins whichaccumulate evidence for oriented structures o...

Full description

Bibliographic Details
Main Author: Bileschi, Stanley M
Other Authors: Tomaso Poggio
Published: 2008
Subjects:
Online Access:http://hdl.handle.net/1721.1/41093
_version_ 1811092850580914176
author Bileschi, Stanley M
author2 Tomaso Poggio
author_facet Tomaso Poggio
Bileschi, Stanley M
author_sort Bileschi, Stanley M
collection MIT
description Recently, several powerful image features have been proposed whichcan be described as spatial histograms of oriented energy. Forinstance, the HoG, HMAX C1, SIFT, and Shape Context feature allrepresent an input image using with a discrete set of bins whichaccumulate evidence for oriented structures over a spatial regionand a range of orientations. In this work, we generalize thesetechniques to allow for a foveated input image, rather than arectilinear raster. It will be shown that improved object detectionaccuracy can be achieved via inputting a spectrum of imagemeasurements, from sharp, fine-scale image sampling within a smallspatial region within the target to coarse-scale sampling of a widefield of view around the target. Several alternative featuregeneration algorithms are proposed and tested which suitably makeuse of foveated image inputs. In the experiments we show thatfeatures generated from the foveated input format produce detectorsof greater accuracy, as measured for four object types from commonlyavailable data-sets. Finally, a flexible algorithm for generatingfeatures is described and tested which is independent of inputtopology and uses ICA to learn appropriate filters.
first_indexed 2024-09-23T15:30:21Z
id mit-1721.1/41093
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T15:30:21Z
publishDate 2008
record_format dspace
spelling mit-1721.1/410932019-04-10T23:12:16Z A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection Bileschi, Stanley M Tomaso Poggio Center for Biological and Computational Learning (CBCL) Object Detection, ICA, Multi-Scale, Image Features Recently, several powerful image features have been proposed whichcan be described as spatial histograms of oriented energy. Forinstance, the HoG, HMAX C1, SIFT, and Shape Context feature allrepresent an input image using with a discrete set of bins whichaccumulate evidence for oriented structures over a spatial regionand a range of orientations. In this work, we generalize thesetechniques to allow for a foveated input image, rather than arectilinear raster. It will be shown that improved object detectionaccuracy can be achieved via inputting a spectrum of imagemeasurements, from sharp, fine-scale image sampling within a smallspatial region within the target to coarse-scale sampling of a widefield of view around the target. Several alternative featuregeneration algorithms are proposed and tested which suitably makeuse of foveated image inputs. In the experiments we show thatfeatures generated from the foveated input format produce detectorsof greater accuracy, as measured for four object types from commonlyavailable data-sets. Finally, a flexible algorithm for generatingfeatures is described and tested which is independent of inputtopology and uses ICA to learn appropriate filters. 2008-04-09T20:15:10Z 2008-04-09T20:15:10Z 2008-04-09 MIT-CSAIL-TR-2008-019 CBCL-271 http://hdl.handle.net/1721.1/41093 Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory 8 p. application/pdf application/postscript
spellingShingle Object Detection, ICA, Multi-Scale, Image Features
Bileschi, Stanley M
A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection
title A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection
title_full A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection
title_fullStr A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection
title_full_unstemmed A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection
title_short A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection
title_sort multi scale generalization of the hog and hmax image descriptors for object detection
topic Object Detection, ICA, Multi-Scale, Image Features
url http://hdl.handle.net/1721.1/41093
work_keys_str_mv AT bileschistanleym amultiscalegeneralizationofthehogandhmaximagedescriptorsforobjectdetection
AT bileschistanleym multiscalegeneralizationofthehogandhmaximagedescriptorsforobjectdetection