Are Topographic Deep Convolutional Neural Networks Better Models of the Ventral Visual Stream?

Neural computations along the ventral visual stream, -- which culminates in the inferior temporal (IT) cortex -- enable humans and monkeys to recognize objects quickly. Primate IT is organized topographically: nearby neurons have similar response properties. Yet the best models of the ventral visual...

Full description

Bibliographic Details
Main Authors: Jozwik, Kamila Maria, Lee, Hyo-Dong, Kanwisher, Nancy, DiCarlo, James
Other Authors: McGovern Institute for Brain Research at MIT
Format: Article
Language:English
Published: Cognitive Computational Neuroscience 2021
Online Access:https://hdl.handle.net/1721.1/130332
_version_ 1826211175907459072
author Jozwik, Kamila Maria
Lee, Hyo-Dong
Kanwisher, Nancy
DiCarlo, James
author2 McGovern Institute for Brain Research at MIT
author_facet McGovern Institute for Brain Research at MIT
Jozwik, Kamila Maria
Lee, Hyo-Dong
Kanwisher, Nancy
DiCarlo, James
author_sort Jozwik, Kamila Maria
collection MIT
description Neural computations along the ventral visual stream, -- which culminates in the inferior temporal (IT) cortex -- enable humans and monkeys to recognize objects quickly. Primate IT is organized topographically: nearby neurons have similar response properties. Yet the best models of the ventral visual stream - deep artificial neural networks (ANNs) – have “IT” layers that lack topography. We built Topographic Deep ANNs (TDANNs) by incorporating a proxy wiring cost alongside the standard ImageNet categorization cost in the two “IT-like” layers of AlexNet (Lee et al., 2018), by specifying that “neurons” that have similar response properties should be physically close to each other. This cost both induced topographic structure and altered tuning characteristics of model IT neurons. We presented 2560 naturalistic images to monkeys and to ANNs. We found that, relative to the base (nontopographic) model, the “neurons” in the “IT” layer of some of the TDANN models matched actual IT neurons slightly better, and the dimensionality of the TDANN “IT” neural population was much closer to that of the measured monkey IT neural population. We also found that, while TDANNs did not show a statistically significant better match to human object discrimination behavior, detailed analysis suggests a trend in that direction. Taken together, TDANNs may better capture properties of IT cortex and wiring costs might be the cause of topographic organization in primate IT.
first_indexed 2024-09-23T15:01:47Z
format Article
id mit-1721.1/130332
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T15:01:47Z
publishDate 2021
publisher Cognitive Computational Neuroscience
record_format dspace
spelling mit-1721.1/1303322022-10-02T00:07:05Z Are Topographic Deep Convolutional Neural Networks Better Models of the Ventral Visual Stream? Jozwik, Kamila Maria Lee, Hyo-Dong Kanwisher, Nancy DiCarlo, James McGovern Institute for Brain Research at MIT Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Neural computations along the ventral visual stream, -- which culminates in the inferior temporal (IT) cortex -- enable humans and monkeys to recognize objects quickly. Primate IT is organized topographically: nearby neurons have similar response properties. Yet the best models of the ventral visual stream - deep artificial neural networks (ANNs) – have “IT” layers that lack topography. We built Topographic Deep ANNs (TDANNs) by incorporating a proxy wiring cost alongside the standard ImageNet categorization cost in the two “IT-like” layers of AlexNet (Lee et al., 2018), by specifying that “neurons” that have similar response properties should be physically close to each other. This cost both induced topographic structure and altered tuning characteristics of model IT neurons. We presented 2560 naturalistic images to monkeys and to ANNs. We found that, relative to the base (nontopographic) model, the “neurons” in the “IT” layer of some of the TDANN models matched actual IT neurons slightly better, and the dimensionality of the TDANN “IT” neural population was much closer to that of the measured monkey IT neural population. We also found that, while TDANNs did not show a statistically significant better match to human object discrimination behavior, detailed analysis suggests a trend in that direction. Taken together, TDANNs may better capture properties of IT cortex and wiring costs might be the cause of topographic organization in primate IT. Wellcome Trust (Award 206521/Z/17/Z) National Institutes of Health (Grant DP1HD091947) Simons Foundation (Grants SCGB 325500, 542965) NSF (Award CCF-1231216) 2021-04-01T15:13:07Z 2021-04-01T15:13:07Z 2019-12 2019-09 2021-03-29T16:15:20Z Article http://purl.org/eprint/type/ConferencePaper https://hdl.handle.net/1721.1/130332 Jozwik, Kamila Maria et al. "Are Topographic Deep Convolutional Neural Networks Better Models of the Ventral Visual Stream?" 2019 Conference on Cognitive Computational Neuroscience, September 2019, Berlin, Germany, Cognitive Computational Neuroscience, December 2019. en http://dx.doi.org/10.32470/ccn.2019.1019-0 2019 Conference on Cognitive Computational Neuroscience Creative Commons Attribution 3.0 unported license https://creativecommons.org/licenses/by/3.0/ application/pdf Cognitive Computational Neuroscience Cognitive Computational Neuroscience
spellingShingle Jozwik, Kamila Maria
Lee, Hyo-Dong
Kanwisher, Nancy
DiCarlo, James
Are Topographic Deep Convolutional Neural Networks Better Models of the Ventral Visual Stream?
title Are Topographic Deep Convolutional Neural Networks Better Models of the Ventral Visual Stream?
title_full Are Topographic Deep Convolutional Neural Networks Better Models of the Ventral Visual Stream?
title_fullStr Are Topographic Deep Convolutional Neural Networks Better Models of the Ventral Visual Stream?
title_full_unstemmed Are Topographic Deep Convolutional Neural Networks Better Models of the Ventral Visual Stream?
title_short Are Topographic Deep Convolutional Neural Networks Better Models of the Ventral Visual Stream?
title_sort are topographic deep convolutional neural networks better models of the ventral visual stream
url https://hdl.handle.net/1721.1/130332
work_keys_str_mv AT jozwikkamilamaria aretopographicdeepconvolutionalneuralnetworksbettermodelsoftheventralvisualstream
AT leehyodong aretopographicdeepconvolutionalneuralnetworksbettermodelsoftheventralvisualstream
AT kanwishernancy aretopographicdeepconvolutionalneuralnetworksbettermodelsoftheventralvisualstream
AT dicarlojames aretopographicdeepconvolutionalneuralnetworksbettermodelsoftheventralvisualstream