Two stages of bandwidth scaling drives efficient neural coding of natural sounds.

Theories of efficient coding propose that the auditory system is optimized for the statistical structure of natural sounds, yet the transformations underlying optimal acoustic representations are not well understood. Using a database of natural sounds including human speech and a physiologically-ins...

Full description

Bibliographic Details
Main Authors:	Fengrong He, Ian H Stevenson, Monty A Escabí
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2023-02-01
Series:	PLoS Computational Biology
Online Access:	https://doi.org/10.1371/journal.pcbi.1010862

_version_	1827966826266492928
author	Fengrong He Ian H Stevenson Monty A Escabí
author_facet	Fengrong He Ian H Stevenson Monty A Escabí
author_sort	Fengrong He
collection	DOAJ
description	Theories of efficient coding propose that the auditory system is optimized for the statistical structure of natural sounds, yet the transformations underlying optimal acoustic representations are not well understood. Using a database of natural sounds including human speech and a physiologically-inspired auditory model, we explore the consequences of peripheral (cochlear) and mid-level (auditory midbrain) filter tuning transformations on the representation of natural sound spectra and modulation statistics. Whereas Fourier-based sound decompositions have constant time-frequency resolution at all frequencies, cochlear and auditory midbrain filters bandwidths increase proportional to the filter center frequency. This form of bandwidth scaling produces a systematic decrease in spectral resolution and increase in temporal resolution with increasing frequency. Here we demonstrate that cochlear bandwidth scaling produces a frequency-dependent gain that counteracts the tendency of natural sound power to decrease with frequency, resulting in a whitened output representation. Similarly, bandwidth scaling in mid-level auditory filters further enhances the representation of natural sounds by producing a whitened modulation power spectrum (MPS) with higher modulation entropy than both the cochlear outputs and the conventional Fourier MPS. These findings suggest that the tuning characteristics of the peripheral and mid-level auditory system together produce a whitened output representation in three dimensions (frequency, temporal and spectral modulation) that reduces redundancies and allows for a more efficient use of neural resources. This hierarchical multi-stage tuning strategy is thus likely optimized to extract available information and may underlies perceptual sensitivity to natural sounds.
first_indexed	2024-04-09T17:55:47Z
format	Article
id	doaj.art-6549f77174024e7a808eb3e021f6535f
institution	Directory Open Access Journal
issn	1553-734X 1553-7358
language	English
last_indexed	2024-04-09T17:55:47Z
publishDate	2023-02-01
publisher	Public Library of Science (PLoS)
record_format	Article
series	PLoS Computational Biology
spelling	doaj.art-6549f77174024e7a808eb3e021f6535f2023-04-15T05:31:28ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582023-02-01192e101086210.1371/journal.pcbi.1010862Two stages of bandwidth scaling drives efficient neural coding of natural sounds.Fengrong HeIan H StevensonMonty A EscabíTheories of efficient coding propose that the auditory system is optimized for the statistical structure of natural sounds, yet the transformations underlying optimal acoustic representations are not well understood. Using a database of natural sounds including human speech and a physiologically-inspired auditory model, we explore the consequences of peripheral (cochlear) and mid-level (auditory midbrain) filter tuning transformations on the representation of natural sound spectra and modulation statistics. Whereas Fourier-based sound decompositions have constant time-frequency resolution at all frequencies, cochlear and auditory midbrain filters bandwidths increase proportional to the filter center frequency. This form of bandwidth scaling produces a systematic decrease in spectral resolution and increase in temporal resolution with increasing frequency. Here we demonstrate that cochlear bandwidth scaling produces a frequency-dependent gain that counteracts the tendency of natural sound power to decrease with frequency, resulting in a whitened output representation. Similarly, bandwidth scaling in mid-level auditory filters further enhances the representation of natural sounds by producing a whitened modulation power spectrum (MPS) with higher modulation entropy than both the cochlear outputs and the conventional Fourier MPS. These findings suggest that the tuning characteristics of the peripheral and mid-level auditory system together produce a whitened output representation in three dimensions (frequency, temporal and spectral modulation) that reduces redundancies and allows for a more efficient use of neural resources. This hierarchical multi-stage tuning strategy is thus likely optimized to extract available information and may underlies perceptual sensitivity to natural sounds.https://doi.org/10.1371/journal.pcbi.1010862
spellingShingle	Fengrong He Ian H Stevenson Monty A Escabí Two stages of bandwidth scaling drives efficient neural coding of natural sounds. PLoS Computational Biology
title	Two stages of bandwidth scaling drives efficient neural coding of natural sounds.
title_full	Two stages of bandwidth scaling drives efficient neural coding of natural sounds.
title_fullStr	Two stages of bandwidth scaling drives efficient neural coding of natural sounds.
title_full_unstemmed	Two stages of bandwidth scaling drives efficient neural coding of natural sounds.
title_short	Two stages of bandwidth scaling drives efficient neural coding of natural sounds.
title_sort	two stages of bandwidth scaling drives efficient neural coding of natural sounds
url	https://doi.org/10.1371/journal.pcbi.1010862
work_keys_str_mv	AT fengronghe twostagesofbandwidthscalingdrivesefficientneuralcodingofnaturalsounds AT ianhstevenson twostagesofbandwidthscalingdrivesefficientneuralcodingofnaturalsounds AT montyaescabi twostagesofbandwidthscalingdrivesefficientneuralcodingofnaturalsounds

Two stages of bandwidth scaling drives efficient neural coding of natural sounds.

Similar Items