Finding and recognising objects in natural scenes: complementary computations in the dorsal and ventral visual systems

Searching for and recognising objects in complex natural scenes is implemented by multiple saccades until the eyes reach within the reduced receptive field sizes of inferior temporal cortex (IT) neurons. We analyse and model how the dorsal and ventral visual streams both contribute to this. Saliency...

Full description

Bibliographic Details
Main Authors: Edmund eRolls, Tristan James Webb
Format: Article
Language:English
Published: Frontiers Media S.A. 2014-08-01
Series:Frontiers in Computational Neuroscience
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/fncom.2014.00085/full
_version_ 1819093820211462144
author Edmund eRolls
Tristan James Webb
author_facet Edmund eRolls
Tristan James Webb
author_sort Edmund eRolls
collection DOAJ
description Searching for and recognising objects in complex natural scenes is implemented by multiple saccades until the eyes reach within the reduced receptive field sizes of inferior temporal cortex (IT) neurons. We analyse and model how the dorsal and ventral visual streams both contribute to this. Saliency detection in the dorsal visual system including area LIP is modelled by graph-based visual saliency, and allows the eyes to fixate potential objects within several degrees. Visual information at the fixated location subtending approximately 9 degrees corresponding to the receptive fields of IT neurons is then passed through a four layer hierarchical model of the ventral cortical visual system, VisNet. We show that VisNet can be trained using a synaptic modification rule with a short-term memory trace of recent neuronal activity to capture both the required view and translation invariances to allow in the model approximately 90% correct object recognition for 4 objects shown in any view across a range of 135 degrees anywhere in a scene.The model was able to generalize correctly within the four trained views and the 25 trained translations.This approach analyses the principles by which complementary computations in the dorsal and ventral visual cortical streams enable objects to be located and recognised in complex natural scenes.
first_indexed 2024-12-21T23:17:35Z
format Article
id doaj.art-9839065897434e5cab7c676d02fd76e4
institution Directory Open Access Journal
issn 1662-5188
language English
last_indexed 2024-12-21T23:17:35Z
publishDate 2014-08-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Computational Neuroscience
spelling doaj.art-9839065897434e5cab7c676d02fd76e42022-12-21T18:46:53ZengFrontiers Media S.A.Frontiers in Computational Neuroscience1662-51882014-08-01810.3389/fncom.2014.00085102339Finding and recognising objects in natural scenes: complementary computations in the dorsal and ventral visual systemsEdmund eRolls0Tristan James Webb1Oxford Centre for Computational NeuroscienceUniversity of WarwickSearching for and recognising objects in complex natural scenes is implemented by multiple saccades until the eyes reach within the reduced receptive field sizes of inferior temporal cortex (IT) neurons. We analyse and model how the dorsal and ventral visual streams both contribute to this. Saliency detection in the dorsal visual system including area LIP is modelled by graph-based visual saliency, and allows the eyes to fixate potential objects within several degrees. Visual information at the fixated location subtending approximately 9 degrees corresponding to the receptive fields of IT neurons is then passed through a four layer hierarchical model of the ventral cortical visual system, VisNet. We show that VisNet can be trained using a synaptic modification rule with a short-term memory trace of recent neuronal activity to capture both the required view and translation invariances to allow in the model approximately 90% correct object recognition for 4 objects shown in any view across a range of 135 degrees anywhere in a scene.The model was able to generalize correctly within the four trained views and the 25 trained translations.This approach analyses the principles by which complementary computations in the dorsal and ventral visual cortical streams enable objects to be located and recognised in complex natural scenes.http://journal.frontiersin.org/Journal/10.3389/fncom.2014.00085/fullobject recognitioninvarianceinferior temporal visual cortexVisNetsaliencytrace learning rule
spellingShingle Edmund eRolls
Tristan James Webb
Finding and recognising objects in natural scenes: complementary computations in the dorsal and ventral visual systems
Frontiers in Computational Neuroscience
object recognition
invariance
inferior temporal visual cortex
VisNet
saliency
trace learning rule
title Finding and recognising objects in natural scenes: complementary computations in the dorsal and ventral visual systems
title_full Finding and recognising objects in natural scenes: complementary computations in the dorsal and ventral visual systems
title_fullStr Finding and recognising objects in natural scenes: complementary computations in the dorsal and ventral visual systems
title_full_unstemmed Finding and recognising objects in natural scenes: complementary computations in the dorsal and ventral visual systems
title_short Finding and recognising objects in natural scenes: complementary computations in the dorsal and ventral visual systems
title_sort finding and recognising objects in natural scenes complementary computations in the dorsal and ventral visual systems
topic object recognition
invariance
inferior temporal visual cortex
VisNet
saliency
trace learning rule
url http://journal.frontiersin.org/Journal/10.3389/fncom.2014.00085/full
work_keys_str_mv AT edmunderolls findingandrecognisingobjectsinnaturalscenescomplementarycomputationsinthedorsalandventralvisualsystems
AT tristanjameswebb findingandrecognisingobjectsinnaturalscenescomplementarycomputationsinthedorsalandventralvisualsystems