Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons

In order to better understand how the brain perceives faces, it is important to know what objective drives learning in the ventral visual stream. To answer this question, we model neural responses to faces in the macaque inferotemporal (IT) cortex with a deep self-supervised generative model, β-VAE,...

Full description

Bibliographic Details
Main Authors: Higgins, I, Chang, L, Langston, V, Hassabis, D, Summerfield, C, Tsao, D, Botvinick, M
Format: Journal article
Language:English
Published: Springer Nature 2021
_version_ 1797077472235225088
author Higgins, I
Chang, L
Langston, V
Hassabis, D
Summerfield, C
Tsao, D
Botvinick, M
author_facet Higgins, I
Chang, L
Langston, V
Hassabis, D
Summerfield, C
Tsao, D
Botvinick, M
author_sort Higgins, I
collection OXFORD
description In order to better understand how the brain perceives faces, it is important to know what objective drives learning in the ventral visual stream. To answer this question, we model neural responses to faces in the macaque inferotemporal (IT) cortex with a deep self-supervised generative model, β-VAE, which disentangles sensory data into interpretable latent factors, such as gender or age. Our results demonstrate a strong correspondence between the generative factors discovered by β-VAE and those coded by single IT neurons, beyond that found for the baselines, including the handcrafted state-of-the-art model of face perception, the Active Appearance Model, and deep classifiers. Moreover, β-VAE is able to reconstruct novel face images using signals from just a handful of cells. Together our results imply that optimising the disentangling objective leads to representations that closely resemble those in the IT at the single unit level. This points at disentangling as a plausible learning objective for the visual brain.
first_indexed 2024-03-07T00:18:31Z
format Journal article
id oxford-uuid:7bb0019a-9a38-44f8-99e4-ec2ce8a6f4a6
institution University of Oxford
language English
last_indexed 2024-03-07T00:18:31Z
publishDate 2021
publisher Springer Nature
record_format dspace
spelling oxford-uuid:7bb0019a-9a38-44f8-99e4-ec2ce8a6f4a62022-03-26T20:52:15ZUnsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neuronsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:7bb0019a-9a38-44f8-99e4-ec2ce8a6f4a6EnglishSymplectic ElementsSpringer Nature2021Higgins, IChang, LLangston, VHassabis, DSummerfield, CTsao, DBotvinick, MIn order to better understand how the brain perceives faces, it is important to know what objective drives learning in the ventral visual stream. To answer this question, we model neural responses to faces in the macaque inferotemporal (IT) cortex with a deep self-supervised generative model, β-VAE, which disentangles sensory data into interpretable latent factors, such as gender or age. Our results demonstrate a strong correspondence between the generative factors discovered by β-VAE and those coded by single IT neurons, beyond that found for the baselines, including the handcrafted state-of-the-art model of face perception, the Active Appearance Model, and deep classifiers. Moreover, β-VAE is able to reconstruct novel face images using signals from just a handful of cells. Together our results imply that optimising the disentangling objective leads to representations that closely resemble those in the IT at the single unit level. This points at disentangling as a plausible learning objective for the visual brain.
spellingShingle Higgins, I
Chang, L
Langston, V
Hassabis, D
Summerfield, C
Tsao, D
Botvinick, M
Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title_full Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title_fullStr Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title_full_unstemmed Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title_short Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title_sort unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
work_keys_str_mv AT higginsi unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT changl unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT langstonv unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT hassabisd unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT summerfieldc unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT tsaod unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT botvinickm unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons