The modulation transfer function for speech intelligibility.

We systematically determined which spectrotemporal modulations in speech are necessary for comprehension by human listeners. Speech comprehension has been shown to be robust to spectral and temporal degradations, but the specific relevance of particular degradations is arguable due to the complexity...

Full description

Bibliographic Details
Main Authors: Taffeta M Elliott, Frédéric E Theunissen
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2009-03-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC2639724?pdf=render
_version_ 1811281415616069632
author Taffeta M Elliott
Frédéric E Theunissen
author_facet Taffeta M Elliott
Frédéric E Theunissen
author_sort Taffeta M Elliott
collection DOAJ
description We systematically determined which spectrotemporal modulations in speech are necessary for comprehension by human listeners. Speech comprehension has been shown to be robust to spectral and temporal degradations, but the specific relevance of particular degradations is arguable due to the complexity of the joint spectral and temporal information in the speech signal. We applied a novel modulation filtering technique to recorded sentences to restrict acoustic information quantitatively and to obtain a joint spectrotemporal modulation transfer function for speech comprehension, the speech MTF. For American English, the speech MTF showed the criticality of low modulation frequencies in both time and frequency. Comprehension was significantly impaired when temporal modulations <12 Hz or spectral modulations <4 cycles/kHz were removed. More specifically, the MTF was bandpass in temporal modulations and low-pass in spectral modulations: temporal modulations from 1 to 7 Hz and spectral modulations <1 cycles/kHz were the most important. We evaluated the importance of spectrotemporal modulations for vocal gender identification and found a different region of interest: removing spectral modulations between 3 and 7 cycles/kHz significantly increases gender misidentifications of female speakers. The determination of the speech MTF furnishes an additional method for producing speech signals with reduced bandwidth but high intelligibility. Such compression could be used for audio applications such as file compression or noise removal and for clinical applications such as signal processing for cochlear implants.
first_indexed 2024-04-13T01:33:26Z
format Article
id doaj.art-99da7f12a2db4e87b0271a85c235447a
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-04-13T01:33:26Z
publishDate 2009-03-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-99da7f12a2db4e87b0271a85c235447a2022-12-22T03:08:27ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582009-03-0153e100030210.1371/journal.pcbi.1000302The modulation transfer function for speech intelligibility.Taffeta M ElliottFrédéric E TheunissenWe systematically determined which spectrotemporal modulations in speech are necessary for comprehension by human listeners. Speech comprehension has been shown to be robust to spectral and temporal degradations, but the specific relevance of particular degradations is arguable due to the complexity of the joint spectral and temporal information in the speech signal. We applied a novel modulation filtering technique to recorded sentences to restrict acoustic information quantitatively and to obtain a joint spectrotemporal modulation transfer function for speech comprehension, the speech MTF. For American English, the speech MTF showed the criticality of low modulation frequencies in both time and frequency. Comprehension was significantly impaired when temporal modulations <12 Hz or spectral modulations <4 cycles/kHz were removed. More specifically, the MTF was bandpass in temporal modulations and low-pass in spectral modulations: temporal modulations from 1 to 7 Hz and spectral modulations <1 cycles/kHz were the most important. We evaluated the importance of spectrotemporal modulations for vocal gender identification and found a different region of interest: removing spectral modulations between 3 and 7 cycles/kHz significantly increases gender misidentifications of female speakers. The determination of the speech MTF furnishes an additional method for producing speech signals with reduced bandwidth but high intelligibility. Such compression could be used for audio applications such as file compression or noise removal and for clinical applications such as signal processing for cochlear implants.http://europepmc.org/articles/PMC2639724?pdf=render
spellingShingle Taffeta M Elliott
Frédéric E Theunissen
The modulation transfer function for speech intelligibility.
PLoS Computational Biology
title The modulation transfer function for speech intelligibility.
title_full The modulation transfer function for speech intelligibility.
title_fullStr The modulation transfer function for speech intelligibility.
title_full_unstemmed The modulation transfer function for speech intelligibility.
title_short The modulation transfer function for speech intelligibility.
title_sort modulation transfer function for speech intelligibility
url http://europepmc.org/articles/PMC2639724?pdf=render
work_keys_str_mv AT taffetamelliott themodulationtransferfunctionforspeechintelligibility
AT fredericetheunissen themodulationtransferfunctionforspeechintelligibility
AT taffetamelliott modulationtransferfunctionforspeechintelligibility
AT fredericetheunissen modulationtransferfunctionforspeechintelligibility