Visual number sense for real-world scenes shared by deep neural networks and humans

Recently, visual number sense has been identified from deep neural networks (DNNs). However, whether DNNs have the same capacity for real-world scenes, rather than the simple geometric figures that are often tested, is unclear. In this study, we explore the number perception of scenes using AlexNet...

Full description

Bibliographic Details
Main Authors: Wu Wencheng, Yingxi Ge, Zhentao Zuo, Lin Chen, Xu Qin, Liu Zuxiang
Format: Article
Language:English
Published: Elsevier 2023-08-01
Series:Heliyon
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405844023057250
_version_ 1797732972042911744
author Wu Wencheng
Yingxi Ge
Zhentao Zuo
Lin Chen
Xu Qin
Liu Zuxiang
author_facet Wu Wencheng
Yingxi Ge
Zhentao Zuo
Lin Chen
Xu Qin
Liu Zuxiang
author_sort Wu Wencheng
collection DOAJ
description Recently, visual number sense has been identified from deep neural networks (DNNs). However, whether DNNs have the same capacity for real-world scenes, rather than the simple geometric figures that are often tested, is unclear. In this study, we explore the number perception of scenes using AlexNet and find that numerosity can be represented by the pattern of group activation of the category layer units. The global activation of these units increases with the number of objects in the scene, and the variations in their activation decrease accordingly. By decoding the numerosity from this pattern, we reveal that the embedding coefficient of a scene determines the likelihood of potential objects to contribute to numerical perception. This was demonstrated by the more optimized performance for pictures with relatively high embedding coefficients in both DNNs and humans. This study for the first time shows that a distinct feature in visual environments, revealed by DNNs, can modulate human perception, supported by a group-coding mechanism.
first_indexed 2024-03-12T12:22:14Z
format Article
id doaj.art-0331c471ae1d40ffb4e8c5d28050d76a
institution Directory Open Access Journal
issn 2405-8440
language English
last_indexed 2024-03-12T12:22:14Z
publishDate 2023-08-01
publisher Elsevier
record_format Article
series Heliyon
spelling doaj.art-0331c471ae1d40ffb4e8c5d28050d76a2023-08-30T05:51:36ZengElsevierHeliyon2405-84402023-08-0198e18517Visual number sense for real-world scenes shared by deep neural networks and humansWu Wencheng0Yingxi Ge1Zhentao Zuo2Lin Chen3Xu Qin4Liu Zuxiang5AHU-IAI AI Joint Laboratory, Anhui University, Hefei, 230601, China; Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, 230088, ChinaState Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, 15 Datun Road, Beijing, 100101, China; CAS Center for Excellence in Brain Science and Intelligence Technology, China; University of Chinese Academy of Sciences, 19A Yuquan Road, Beijing, 100049, ChinaInstitute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, 230088, China; State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, 15 Datun Road, Beijing, 100101, China; CAS Center for Excellence in Brain Science and Intelligence Technology, China; University of Chinese Academy of Sciences, 19A Yuquan Road, Beijing, 100049, ChinaInstitute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, 230088, China; State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, 15 Datun Road, Beijing, 100101, China; CAS Center for Excellence in Brain Science and Intelligence Technology, China; University of Chinese Academy of Sciences, 19A Yuquan Road, Beijing, 100049, ChinaKey Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Hefei, 230601, China; Anhui Provincial Key Laboratory of Multimodal Cognitive Computation, Anhui University, Hefei, 230601, China; School of Computer Science and Technology, Anhui University, Hefei 230601, China; Corresponding author. School of Computer Science and Technology, Anhui University, Hefei 230601, China.Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, 230088, China; State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, 15 Datun Road, Beijing, 100101, China; CAS Center for Excellence in Brain Science and Intelligence Technology, China; University of Chinese Academy of Sciences, 19A Yuquan Road, Beijing, 100049, China; Corresponding author. State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, 15 Datun Road, Beijing 100101, China.Recently, visual number sense has been identified from deep neural networks (DNNs). However, whether DNNs have the same capacity for real-world scenes, rather than the simple geometric figures that are often tested, is unclear. In this study, we explore the number perception of scenes using AlexNet and find that numerosity can be represented by the pattern of group activation of the category layer units. The global activation of these units increases with the number of objects in the scene, and the variations in their activation decrease accordingly. By decoding the numerosity from this pattern, we reveal that the embedding coefficient of a scene determines the likelihood of potential objects to contribute to numerical perception. This was demonstrated by the more optimized performance for pictures with relatively high embedding coefficients in both DNNs and humans. This study for the first time shows that a distinct feature in visual environments, revealed by DNNs, can modulate human perception, supported by a group-coding mechanism.http://www.sciencedirect.com/science/article/pii/S2405844023057250Number senseDeep neural networkReal-world sceneGroup codingEmbedded representation
spellingShingle Wu Wencheng
Yingxi Ge
Zhentao Zuo
Lin Chen
Xu Qin
Liu Zuxiang
Visual number sense for real-world scenes shared by deep neural networks and humans
Heliyon
Number sense
Deep neural network
Real-world scene
Group coding
Embedded representation
title Visual number sense for real-world scenes shared by deep neural networks and humans
title_full Visual number sense for real-world scenes shared by deep neural networks and humans
title_fullStr Visual number sense for real-world scenes shared by deep neural networks and humans
title_full_unstemmed Visual number sense for real-world scenes shared by deep neural networks and humans
title_short Visual number sense for real-world scenes shared by deep neural networks and humans
title_sort visual number sense for real world scenes shared by deep neural networks and humans
topic Number sense
Deep neural network
Real-world scene
Group coding
Embedded representation
url http://www.sciencedirect.com/science/article/pii/S2405844023057250
work_keys_str_mv AT wuwencheng visualnumbersenseforrealworldscenessharedbydeepneuralnetworksandhumans
AT yingxige visualnumbersenseforrealworldscenessharedbydeepneuralnetworksandhumans
AT zhentaozuo visualnumbersenseforrealworldscenessharedbydeepneuralnetworksandhumans
AT linchen visualnumbersenseforrealworldscenessharedbydeepneuralnetworksandhumans
AT xuqin visualnumbersenseforrealworldscenessharedbydeepneuralnetworksandhumans
AT liuzuxiang visualnumbersenseforrealworldscenessharedbydeepneuralnetworksandhumans