Learning transform invariant object recognition in the visual system with multiple stimuli present during training.

Over successive stages, the visual system develops neurons that respond with view, size and position invariance to objects or faces. A number of computational models have been developed to explain how transform-invariant cells could develop in the visual system. However, a major limitation of comput...

Full description

Bibliographic Details
Main Authors: Stringer, S, Rolls, E
Format: Journal article
Language:English
Published: 2008
_version_ 1797057230660435968
author Stringer, S
Rolls, E
author_facet Stringer, S
Rolls, E
author_sort Stringer, S
collection OXFORD
description Over successive stages, the visual system develops neurons that respond with view, size and position invariance to objects or faces. A number of computational models have been developed to explain how transform-invariant cells could develop in the visual system. However, a major limitation of computer modelling studies to date has been that the visual stimuli are typically presented one at a time to the network during training. In this paper, we investigate how vision models may self-organize when multiple stimuli are presented together within each visual image during training. We show that as the number of independent stimuli grows large enough, standard competitive neural networks can suddenly switch from learning representations of the multi-stimulus input patterns to representing the individual stimuli. Furthermore, the competitive networks can learn transform (e.g. position or view) invariant representations of the individual stimuli if the network is presented with input patterns containing multiple transforming stimuli during training. Finally, we extend these results to a multi-layer hierarchical network model (VisNet) of the ventral visual system. The network is trained on input images containing multiple rotating 3D objects. We show that the network is able to develop view-invariant representations of the individual objects.
first_indexed 2024-03-06T19:33:20Z
format Journal article
id oxford-uuid:1e3457cd-7649-4e58-b12f-620a502c611e
institution University of Oxford
language English
last_indexed 2024-03-06T19:33:20Z
publishDate 2008
record_format dspace
spelling oxford-uuid:1e3457cd-7649-4e58-b12f-620a502c611e2022-03-26T11:15:04ZLearning transform invariant object recognition in the visual system with multiple stimuli present during training.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:1e3457cd-7649-4e58-b12f-620a502c611eEnglishSymplectic Elements at Oxford2008Stringer, SRolls, EOver successive stages, the visual system develops neurons that respond with view, size and position invariance to objects or faces. A number of computational models have been developed to explain how transform-invariant cells could develop in the visual system. However, a major limitation of computer modelling studies to date has been that the visual stimuli are typically presented one at a time to the network during training. In this paper, we investigate how vision models may self-organize when multiple stimuli are presented together within each visual image during training. We show that as the number of independent stimuli grows large enough, standard competitive neural networks can suddenly switch from learning representations of the multi-stimulus input patterns to representing the individual stimuli. Furthermore, the competitive networks can learn transform (e.g. position or view) invariant representations of the individual stimuli if the network is presented with input patterns containing multiple transforming stimuli during training. Finally, we extend these results to a multi-layer hierarchical network model (VisNet) of the ventral visual system. The network is trained on input images containing multiple rotating 3D objects. We show that the network is able to develop view-invariant representations of the individual objects.
spellingShingle Stringer, S
Rolls, E
Learning transform invariant object recognition in the visual system with multiple stimuli present during training.
title Learning transform invariant object recognition in the visual system with multiple stimuli present during training.
title_full Learning transform invariant object recognition in the visual system with multiple stimuli present during training.
title_fullStr Learning transform invariant object recognition in the visual system with multiple stimuli present during training.
title_full_unstemmed Learning transform invariant object recognition in the visual system with multiple stimuli present during training.
title_short Learning transform invariant object recognition in the visual system with multiple stimuli present during training.
title_sort learning transform invariant object recognition in the visual system with multiple stimuli present during training
work_keys_str_mv AT stringers learningtransforminvariantobjectrecognitioninthevisualsystemwithmultiplestimulipresentduringtraining
AT rollse learningtransforminvariantobjectrecognitioninthevisualsystemwithmultiplestimulipresentduringtraining