Improving Speech Intelligibility Using Ideal Binary Mask
Background: The application of the ideal binary mask (IBM) for speech signal processing provides remarkable intelligibility improvements in both normal-hearing and hearing-impaired listeners. Binary mask widely applies to the time-frequency (T–F) representation of a noisy signal and eliminates units...
Main Authors: | , |
---|---|
Format: | Article |
Language: | fas |
Published: |
Isfahan University of Medical Sciences
2014-01-01
|
Series: | مجله دانشکده پزشکی اصفهان |
Subjects: | |
Online Access: | http://jims.mui.ac.ir/index.php/jims/article/view/3366 |
_version_ | 1797719134581030912 |
---|---|
author | Nader Naseri Saeid Kermani |
author_facet | Nader Naseri Saeid Kermani |
author_sort | Nader Naseri |
collection | DOAJ |
description | Background: The application of the ideal binary mask (IBM) for speech signal processing provides remarkable intelligibility improvements in both normal-hearing and hearing-impaired listeners. Binary mask widely applies to the time-frequency (T–F) representation of a noisy signal and eliminates units of a signal below a signal-to-noise-ratio (SNR) threshold while retains others.
Methods: The factors underlying intelligibility of ideal binary-masked speech were examined and evaluated in the present study. The effects of the local SNR threshold, input SNR level, masker type, and ideal mask-estimator were examined. New estimators including weighted Euclidean and COSH were proposed in which, the human perceptual auditory masking effect and perceptual perception were incorporated.
Findings: High-performance plateau for SNR thresholds ranging from −20 to 5 dB was observed. Findings could be used for hearing-aid and cochlear-implant designs.
Conclusion: Intelligibility of speech was high even at −10 dB SNR for all maskers tested. Performance assessment shows that our proposed estimators can achieve more significant noise estimation as compared to the Wiener estimator. |
first_indexed | 2024-03-12T09:00:21Z |
format | Article |
id | doaj.art-14013d5893e8406b91c1fa97a5dc12c6 |
institution | Directory Open Access Journal |
issn | 1027-7595 1735-854X |
language | fas |
last_indexed | 2024-03-12T09:00:21Z |
publishDate | 2014-01-01 |
publisher | Isfahan University of Medical Sciences |
record_format | Article |
series | مجله دانشکده پزشکی اصفهان |
spelling | doaj.art-14013d5893e8406b91c1fa97a5dc12c62023-09-02T15:41:27ZfasIsfahan University of Medical Sciencesمجله دانشکده پزشکی اصفهان1027-75951735-854X2014-01-0131259178717961453Improving Speech Intelligibility Using Ideal Binary MaskNader Naseri0Saeid Kermani1MSc Student, Department of Medical Physics and Medical Engineering, School of Medicine AND Student Research Committee, Isfahan University of Medical Sciences, Isfahan, IranAssistant Professor, Department of Medical Physics and Medical Engineering, School of Medicine, Isfahan University of Medical Sciences, Isfahan, IranBackground: The application of the ideal binary mask (IBM) for speech signal processing provides remarkable intelligibility improvements in both normal-hearing and hearing-impaired listeners. Binary mask widely applies to the time-frequency (T–F) representation of a noisy signal and eliminates units of a signal below a signal-to-noise-ratio (SNR) threshold while retains others. Methods: The factors underlying intelligibility of ideal binary-masked speech were examined and evaluated in the present study. The effects of the local SNR threshold, input SNR level, masker type, and ideal mask-estimator were examined. New estimators including weighted Euclidean and COSH were proposed in which, the human perceptual auditory masking effect and perceptual perception were incorporated. Findings: High-performance plateau for SNR thresholds ranging from −20 to 5 dB was observed. Findings could be used for hearing-aid and cochlear-implant designs. Conclusion: Intelligibility of speech was high even at −10 dB SNR for all maskers tested. Performance assessment shows that our proposed estimators can achieve more significant noise estimation as compared to the Wiener estimator.http://jims.mui.ac.ir/index.php/jims/article/view/3366Speech enhancementBinary maskingSpeech intelligibility |
spellingShingle | Nader Naseri Saeid Kermani Improving Speech Intelligibility Using Ideal Binary Mask مجله دانشکده پزشکی اصفهان Speech enhancement Binary masking Speech intelligibility |
title | Improving Speech Intelligibility Using Ideal Binary Mask |
title_full | Improving Speech Intelligibility Using Ideal Binary Mask |
title_fullStr | Improving Speech Intelligibility Using Ideal Binary Mask |
title_full_unstemmed | Improving Speech Intelligibility Using Ideal Binary Mask |
title_short | Improving Speech Intelligibility Using Ideal Binary Mask |
title_sort | improving speech intelligibility using ideal binary mask |
topic | Speech enhancement Binary masking Speech intelligibility |
url | http://jims.mui.ac.ir/index.php/jims/article/view/3366 |
work_keys_str_mv | AT nadernaseri improvingspeechintelligibilityusingidealbinarymask AT saeidkermani improvingspeechintelligibilityusingidealbinarymask |