Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility

Auditory perception involves cues in the monaural auditory pathways, as well as binaural cues based on interaural differences. So far, auditory models have often focused on either monaural or binaural experiments in isolation. Although binaural models typically build upon stages of (existing) monaur...

Full description

Bibliographic Details
Main Authors: Biberger Thomas, Ewert Stephan D.
Format: Article
Language:English
Published: EDP Sciences 2022-01-01
Series:Acta Acustica
Subjects:
Online Access:https://acta-acustica.edpsciences.org/articles/aacus/full_html/2022/01/aacus210051/aacus210051.html
_version_ 1797713121068974080
author Biberger Thomas
Ewert Stephan D.
author_facet Biberger Thomas
Ewert Stephan D.
author_sort Biberger Thomas
collection DOAJ
description Auditory perception involves cues in the monaural auditory pathways, as well as binaural cues based on interaural differences. So far, auditory models have often focused on either monaural or binaural experiments in isolation. Although binaural models typically build upon stages of (existing) monaural models, only a few attempts have been made to extend a monaural model by a binaural stage using a unified decision stage for monaural and binaural cues. A typical prototype of binaural processing has been the classical equalization-cancelation mechanism, which either involves signal-adaptive delays and provides a single channel output, or can be implemented with tapped delays providing a high-dimensional multichannel output. This contribution extends the (monaural) generalized envelope power spectrum model by a non-adaptive binaural stage with only a few, fixed output channels. The binaural stage resembles features of physiologically motivated hemispheric binaural processing, as simplified signal-processing stages, yielding a 5-channel monaural and binaural matrix feature “decoder” (BMFD). The back end of the existing monaural model is applied to the BMFD output and calculates short-time envelope power and power features. The resulting model accounts for several published psychoacoustic and speech-intelligibility experiments and achieves a prediction performance comparable to existing state-of-the-art models with more complex binaural processing.
first_indexed 2024-03-12T07:31:47Z
format Article
id doaj.art-05f1c1e76472486289957fb2c50e1519
institution Directory Open Access Journal
issn 2681-4617
language English
last_indexed 2024-03-12T07:31:47Z
publishDate 2022-01-01
publisher EDP Sciences
record_format Article
series Acta Acustica
spelling doaj.art-05f1c1e76472486289957fb2c50e15192023-09-02T21:44:01ZengEDP SciencesActa Acustica2681-46172022-01-0162310.1051/aacus/2022018aacus210051Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibilityBiberger Thomas0https://orcid.org/0000-0002-6314-1914Ewert Stephan D.1https://orcid.org/0000-0002-1644-4947Medizinische Physik and Cluster of Excellence Hearing4all, Universität OldenburgMedizinische Physik and Cluster of Excellence Hearing4all, Universität OldenburgAuditory perception involves cues in the monaural auditory pathways, as well as binaural cues based on interaural differences. So far, auditory models have often focused on either monaural or binaural experiments in isolation. Although binaural models typically build upon stages of (existing) monaural models, only a few attempts have been made to extend a monaural model by a binaural stage using a unified decision stage for monaural and binaural cues. A typical prototype of binaural processing has been the classical equalization-cancelation mechanism, which either involves signal-adaptive delays and provides a single channel output, or can be implemented with tapped delays providing a high-dimensional multichannel output. This contribution extends the (monaural) generalized envelope power spectrum model by a non-adaptive binaural stage with only a few, fixed output channels. The binaural stage resembles features of physiologically motivated hemispheric binaural processing, as simplified signal-processing stages, yielding a 5-channel monaural and binaural matrix feature “decoder” (BMFD). The back end of the existing monaural model is applied to the BMFD output and calculates short-time envelope power and power features. The resulting model accounts for several published psychoacoustic and speech-intelligibility experiments and achieves a prediction performance comparable to existing state-of-the-art models with more complex binaural processing.https://acta-acustica.edpsciences.org/articles/aacus/full_html/2022/01/aacus210051/aacus210051.htmlauditory modelingpsychoacoustic maskingbinaural hearingspeech intelligibility
spellingShingle Biberger Thomas
Ewert Stephan D.
Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility
Acta Acustica
auditory modeling
psychoacoustic masking
binaural hearing
speech intelligibility
title Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility
title_full Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility
title_fullStr Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility
title_full_unstemmed Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility
title_short Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility
title_sort towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility
topic auditory modeling
psychoacoustic masking
binaural hearing
speech intelligibility
url https://acta-acustica.edpsciences.org/articles/aacus/full_html/2022/01/aacus210051/aacus210051.html
work_keys_str_mv AT bibergerthomas towardsasimplifiedandgeneralizedmonauralandbinauralauditorymodelforpsychoacousticsandspeechintelligibility
AT ewertstephand towardsasimplifiedandgeneralizedmonauralandbinauralauditorymodelforpsychoacousticsandspeechintelligibility