Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility
Auditory perception involves cues in the monaural auditory pathways, as well as binaural cues based on interaural differences. So far, auditory models have often focused on either monaural or binaural experiments in isolation. Although binaural models typically build upon stages of (existing) monaur...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
EDP Sciences
2022-01-01
|
Series: | Acta Acustica |
Subjects: | |
Online Access: | https://acta-acustica.edpsciences.org/articles/aacus/full_html/2022/01/aacus210051/aacus210051.html |
_version_ | 1797713121068974080 |
---|---|
author | Biberger Thomas Ewert Stephan D. |
author_facet | Biberger Thomas Ewert Stephan D. |
author_sort | Biberger Thomas |
collection | DOAJ |
description | Auditory perception involves cues in the monaural auditory pathways, as well as binaural cues based on interaural differences. So far, auditory models have often focused on either monaural or binaural experiments in isolation. Although binaural models typically build upon stages of (existing) monaural models, only a few attempts have been made to extend a monaural model by a binaural stage using a unified decision stage for monaural and binaural cues. A typical prototype of binaural processing has been the classical equalization-cancelation mechanism, which either involves signal-adaptive delays and provides a single channel output, or can be implemented with tapped delays providing a high-dimensional multichannel output. This contribution extends the (monaural) generalized envelope power spectrum model by a non-adaptive binaural stage with only a few, fixed output channels. The binaural stage resembles features of physiologically motivated hemispheric binaural processing, as simplified signal-processing stages, yielding a 5-channel monaural and binaural matrix feature “decoder” (BMFD). The back end of the existing monaural model is applied to the BMFD output and calculates short-time envelope power and power features. The resulting model accounts for several published psychoacoustic and speech-intelligibility experiments and achieves a prediction performance comparable to existing state-of-the-art models with more complex binaural processing. |
first_indexed | 2024-03-12T07:31:47Z |
format | Article |
id | doaj.art-05f1c1e76472486289957fb2c50e1519 |
institution | Directory Open Access Journal |
issn | 2681-4617 |
language | English |
last_indexed | 2024-03-12T07:31:47Z |
publishDate | 2022-01-01 |
publisher | EDP Sciences |
record_format | Article |
series | Acta Acustica |
spelling | doaj.art-05f1c1e76472486289957fb2c50e15192023-09-02T21:44:01ZengEDP SciencesActa Acustica2681-46172022-01-0162310.1051/aacus/2022018aacus210051Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibilityBiberger Thomas0https://orcid.org/0000-0002-6314-1914Ewert Stephan D.1https://orcid.org/0000-0002-1644-4947Medizinische Physik and Cluster of Excellence Hearing4all, Universität OldenburgMedizinische Physik and Cluster of Excellence Hearing4all, Universität OldenburgAuditory perception involves cues in the monaural auditory pathways, as well as binaural cues based on interaural differences. So far, auditory models have often focused on either monaural or binaural experiments in isolation. Although binaural models typically build upon stages of (existing) monaural models, only a few attempts have been made to extend a monaural model by a binaural stage using a unified decision stage for monaural and binaural cues. A typical prototype of binaural processing has been the classical equalization-cancelation mechanism, which either involves signal-adaptive delays and provides a single channel output, or can be implemented with tapped delays providing a high-dimensional multichannel output. This contribution extends the (monaural) generalized envelope power spectrum model by a non-adaptive binaural stage with only a few, fixed output channels. The binaural stage resembles features of physiologically motivated hemispheric binaural processing, as simplified signal-processing stages, yielding a 5-channel monaural and binaural matrix feature “decoder” (BMFD). The back end of the existing monaural model is applied to the BMFD output and calculates short-time envelope power and power features. The resulting model accounts for several published psychoacoustic and speech-intelligibility experiments and achieves a prediction performance comparable to existing state-of-the-art models with more complex binaural processing.https://acta-acustica.edpsciences.org/articles/aacus/full_html/2022/01/aacus210051/aacus210051.htmlauditory modelingpsychoacoustic maskingbinaural hearingspeech intelligibility |
spellingShingle | Biberger Thomas Ewert Stephan D. Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility Acta Acustica auditory modeling psychoacoustic masking binaural hearing speech intelligibility |
title | Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility |
title_full | Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility |
title_fullStr | Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility |
title_full_unstemmed | Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility |
title_short | Towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility |
title_sort | towards a simplified and generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility |
topic | auditory modeling psychoacoustic masking binaural hearing speech intelligibility |
url | https://acta-acustica.edpsciences.org/articles/aacus/full_html/2022/01/aacus210051/aacus210051.html |
work_keys_str_mv | AT bibergerthomas towardsasimplifiedandgeneralizedmonauralandbinauralauditorymodelforpsychoacousticsandspeechintelligibility AT ewertstephand towardsasimplifiedandgeneralizedmonauralandbinauralauditorymodelforpsychoacousticsandspeechintelligibility |