Spatial Reference Frames for Object Recognition: Tuning for Rotations in Depth

The inferior temporal cortex (IT) of monkeys is thought to play an essential role in visual object recognition. Inferotemporal neurons are known to respond to complex visual stimuli, including patterns like faces, hands, or other body parts. What is the role of such neurons in object recogniti...

Full description

Bibliographic Details
Main Authors: Logothetis, N.K., Pauls, J., Poggio, T.
Language:en_US
Published: 2004
Online Access:http://hdl.handle.net/1721.1/6637
_version_ 1811074874650656768
author Logothetis, N.K.
Pauls, J.
Poggio, T.
author_facet Logothetis, N.K.
Pauls, J.
Poggio, T.
author_sort Logothetis, N.K.
collection MIT
description The inferior temporal cortex (IT) of monkeys is thought to play an essential role in visual object recognition. Inferotemporal neurons are known to respond to complex visual stimuli, including patterns like faces, hands, or other body parts. What is the role of such neurons in object recognition? The present study examines this question in combined psychophysical and electrophysiological experiments, in which monkeys learned to classify and recognize novel visual 3D objects. A population of neurons in IT were found to respond selectively to such objects that the monkeys had recently learned to recognize. A large majority of these cells discharged maximally for one view of the object, while their response fell off gradually as the object was rotated away from the neuron"s preferred view. Most neurons exhibited orientation-dependent responses also during view-plane rotations. Some neurons were found tuned around two views of the same object, while a very small number of cells responded in a view- invariant manner. For five different objects that were extensively used during the training of the animals, and for which behavioral performance became view-independent, multiple cells were found that were tuned around different views of the same object. No selective responses were ever encountered for views that the animal systematically failed to recognize. The results of our experiments suggest that neurons in this area can develop a complex receptive field organization as a consequence of extensive training in the discrimination and recognition of objects. Simple geometric features did not appear to account for the neurons" selective responses. These findings support the idea that a population of neurons -- each tuned to a different object aspect, and each showing a certain degree of invariance to image transformations -- may, as an assembly, encode complex 3D objects. In such a system, several neurons may be active for any given vantage point, with a single unit acting like a blurred template for a limited neighborhood of a single view.
first_indexed 2024-09-23T09:56:44Z
id mit-1721.1/6637
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T09:56:44Z
publishDate 2004
record_format dspace
spelling mit-1721.1/66372019-04-12T08:31:42Z Spatial Reference Frames for Object Recognition: Tuning for Rotations in Depth Logothetis, N.K. Pauls, J. Poggio, T. The inferior temporal cortex (IT) of monkeys is thought to play an essential role in visual object recognition. Inferotemporal neurons are known to respond to complex visual stimuli, including patterns like faces, hands, or other body parts. What is the role of such neurons in object recognition? The present study examines this question in combined psychophysical and electrophysiological experiments, in which monkeys learned to classify and recognize novel visual 3D objects. A population of neurons in IT were found to respond selectively to such objects that the monkeys had recently learned to recognize. A large majority of these cells discharged maximally for one view of the object, while their response fell off gradually as the object was rotated away from the neuron"s preferred view. Most neurons exhibited orientation-dependent responses also during view-plane rotations. Some neurons were found tuned around two views of the same object, while a very small number of cells responded in a view- invariant manner. For five different objects that were extensively used during the training of the animals, and for which behavioral performance became view-independent, multiple cells were found that were tuned around different views of the same object. No selective responses were ever encountered for views that the animal systematically failed to recognize. The results of our experiments suggest that neurons in this area can develop a complex receptive field organization as a consequence of extensive training in the discrimination and recognition of objects. Simple geometric features did not appear to account for the neurons" selective responses. These findings support the idea that a population of neurons -- each tuned to a different object aspect, and each showing a certain degree of invariance to image transformations -- may, as an assembly, encode complex 3D objects. In such a system, several neurons may be active for any given vantage point, with a single unit acting like a blurred template for a limited neighborhood of a single view. 2004-10-08T20:36:01Z 2004-10-08T20:36:01Z 1995-03-01 AIM-1533 http://hdl.handle.net/1721.1/6637 en_US AIM-1533 1535116 bytes 1799567 bytes application/postscript application/pdf application/postscript application/pdf
spellingShingle Logothetis, N.K.
Pauls, J.
Poggio, T.
Spatial Reference Frames for Object Recognition: Tuning for Rotations in Depth
title Spatial Reference Frames for Object Recognition: Tuning for Rotations in Depth
title_full Spatial Reference Frames for Object Recognition: Tuning for Rotations in Depth
title_fullStr Spatial Reference Frames for Object Recognition: Tuning for Rotations in Depth
title_full_unstemmed Spatial Reference Frames for Object Recognition: Tuning for Rotations in Depth
title_short Spatial Reference Frames for Object Recognition: Tuning for Rotations in Depth
title_sort spatial reference frames for object recognition tuning for rotations in depth
url http://hdl.handle.net/1721.1/6637
work_keys_str_mv AT logothetisnk spatialreferenceframesforobjectrecognitiontuningforrotationsindepth
AT paulsj spatialreferenceframesforobjectrecognitiontuningforrotationsindepth
AT poggiot spatialreferenceframesforobjectrecognitiontuningforrotationsindepth