Attentive processing improves object recognition

The human visual system can recognize several thousand object categories irrespective of their position and size. This combination of selectivity and invariance is built up gradually across several stages of visual processing. However, the recognition of multiple objects in cluttered visual scenes p...

Full description

Bibliographic Details
Main Authors: Chikkerur, Sharat, Poggio, Tomaso, Serre, Thomas
Other Authors: Tomaso Poggio
Published: 2009
Online Access:http://hdl.handle.net/1721.1/49415
_version_ 1811092385399046144
author Chikkerur, Sharat
Poggio, Tomaso
Serre, Thomas
author2 Tomaso Poggio
author_facet Tomaso Poggio
Chikkerur, Sharat
Poggio, Tomaso
Serre, Thomas
author_sort Chikkerur, Sharat
collection MIT
description The human visual system can recognize several thousand object categories irrespective of their position and size. This combination of selectivity and invariance is built up gradually across several stages of visual processing. However, the recognition of multiple objects in cluttered visual scenes presents a difficult problem for human as well as machine vision systems. The human visual system has evolved to perform two stages of visual processing: a pre-attentive parallel processing stage, in which the entire visual field is processed at once and a slow serial attentive processing stage, in which aregion of interest in an input image is selected for "specialized" analysis by an attentional spotlight. We argue that this strategy evolved to overcome the limitation of purely feed forward processing in the presence of clutter and crowding. Using a Bayesian model of attention along with a hierarchical model of feed forward recognition on a data set of real world images, we show that this two stage attentive processing can improve recognition in cluttered and crowded conditions.
first_indexed 2024-09-23T15:17:19Z
id mit-1721.1/49415
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T15:17:19Z
publishDate 2009
record_format dspace
spelling mit-1721.1/494152019-04-12T23:25:14Z Attentive processing improves object recognition Chikkerur, Sharat Poggio, Tomaso Serre, Thomas Tomaso Poggio Center for Biological and Computational Learning (CBCL) The human visual system can recognize several thousand object categories irrespective of their position and size. This combination of selectivity and invariance is built up gradually across several stages of visual processing. However, the recognition of multiple objects in cluttered visual scenes presents a difficult problem for human as well as machine vision systems. The human visual system has evolved to perform two stages of visual processing: a pre-attentive parallel processing stage, in which the entire visual field is processed at once and a slow serial attentive processing stage, in which aregion of interest in an input image is selected for "specialized" analysis by an attentional spotlight. We argue that this strategy evolved to overcome the limitation of purely feed forward processing in the presence of clutter and crowding. Using a Bayesian model of attention along with a hierarchical model of feed forward recognition on a data set of real world images, we show that this two stage attentive processing can improve recognition in cluttered and crowded conditions. 2009-10-06T22:45:05Z 2009-10-06T22:45:05Z 2009-10-02 http://hdl.handle.net/1721.1/49415 CBCL-279 MIT-CSAIL-TR-2009-046 12 p. application/pdf application/postscript
spellingShingle Chikkerur, Sharat
Poggio, Tomaso
Serre, Thomas
Attentive processing improves object recognition
title Attentive processing improves object recognition
title_full Attentive processing improves object recognition
title_fullStr Attentive processing improves object recognition
title_full_unstemmed Attentive processing improves object recognition
title_short Attentive processing improves object recognition
title_sort attentive processing improves object recognition
url http://hdl.handle.net/1721.1/49415
work_keys_str_mv AT chikkerursharat attentiveprocessingimprovesobjectrecognition
AT poggiotomaso attentiveprocessingimprovesobjectrecognition
AT serrethomas attentiveprocessingimprovesobjectrecognition