Improving Speech Intelligibility Using Ideal Binary Mask

Background: The application of the ideal binary mask (IBM) for speech signal processing provides remarkable intelligibility improvements in both normal-hearing and hearing-impaired listeners. Binary mask widely applies to the time-frequency (T–F) representation of a noisy signal and eliminates units...

Full description

Bibliographic Details
Main Authors: Nader Naseri, Saeid Kermani
Format: Article
Language:fas
Published: Isfahan University of Medical Sciences 2014-01-01
Series:مجله دانشکده پزشکی اصفهان
Subjects:
Online Access:http://jims.mui.ac.ir/index.php/jims/article/view/3366
_version_ 1797719134581030912
author Nader Naseri
Saeid Kermani
author_facet Nader Naseri
Saeid Kermani
author_sort Nader Naseri
collection DOAJ
description Background: The application of the ideal binary mask (IBM) for speech signal processing provides remarkable intelligibility improvements in both normal-hearing and hearing-impaired listeners. Binary mask widely applies to the time-frequency (T–F) representation of a noisy signal and eliminates units of a signal below a signal-to-noise-ratio (SNR) threshold while retains others. Methods: The factors underlying intelligibility of ideal binary-masked speech were examined and evaluated in the present study. The effects of the local SNR threshold, input SNR level, masker type, and ideal mask-estimator were examined. New estimators including weighted Euclidean and COSH were proposed in which, the human perceptual auditory masking effect and perceptual perception were incorporated. Findings: High-performance plateau for SNR thresholds ranging from −20 to 5 dB was observed. Findings could be used for hearing-aid and cochlear-implant designs. Conclusion: Intelligibility of speech was high even at −10 dB SNR for all maskers tested. Performance assessment shows that our proposed estimators can achieve more significant noise estimation as compared to the Wiener estimator.
first_indexed 2024-03-12T09:00:21Z
format Article
id doaj.art-14013d5893e8406b91c1fa97a5dc12c6
institution Directory Open Access Journal
issn 1027-7595
1735-854X
language fas
last_indexed 2024-03-12T09:00:21Z
publishDate 2014-01-01
publisher Isfahan University of Medical Sciences
record_format Article
series مجله دانشکده پزشکی اصفهان
spelling doaj.art-14013d5893e8406b91c1fa97a5dc12c62023-09-02T15:41:27ZfasIsfahan University of Medical Sciencesمجله دانشکده پزشکی اصفهان1027-75951735-854X2014-01-0131259178717961453Improving Speech Intelligibility Using Ideal Binary MaskNader Naseri0Saeid Kermani1MSc Student, Department of Medical Physics and Medical Engineering, School of Medicine AND Student Research Committee, Isfahan University of Medical Sciences, Isfahan, IranAssistant Professor, Department of Medical Physics and Medical Engineering, School of Medicine, Isfahan University of Medical Sciences, Isfahan, IranBackground: The application of the ideal binary mask (IBM) for speech signal processing provides remarkable intelligibility improvements in both normal-hearing and hearing-impaired listeners. Binary mask widely applies to the time-frequency (T–F) representation of a noisy signal and eliminates units of a signal below a signal-to-noise-ratio (SNR) threshold while retains others. Methods: The factors underlying intelligibility of ideal binary-masked speech were examined and evaluated in the present study. The effects of the local SNR threshold, input SNR level, masker type, and ideal mask-estimator were examined. New estimators including weighted Euclidean and COSH were proposed in which, the human perceptual auditory masking effect and perceptual perception were incorporated. Findings: High-performance plateau for SNR thresholds ranging from −20 to 5 dB was observed. Findings could be used for hearing-aid and cochlear-implant designs. Conclusion: Intelligibility of speech was high even at −10 dB SNR for all maskers tested. Performance assessment shows that our proposed estimators can achieve more significant noise estimation as compared to the Wiener estimator.http://jims.mui.ac.ir/index.php/jims/article/view/3366Speech enhancementBinary maskingSpeech intelligibility
spellingShingle Nader Naseri
Saeid Kermani
Improving Speech Intelligibility Using Ideal Binary Mask
مجله دانشکده پزشکی اصفهان
Speech enhancement
Binary masking
Speech intelligibility
title Improving Speech Intelligibility Using Ideal Binary Mask
title_full Improving Speech Intelligibility Using Ideal Binary Mask
title_fullStr Improving Speech Intelligibility Using Ideal Binary Mask
title_full_unstemmed Improving Speech Intelligibility Using Ideal Binary Mask
title_short Improving Speech Intelligibility Using Ideal Binary Mask
title_sort improving speech intelligibility using ideal binary mask
topic Speech enhancement
Binary masking
Speech intelligibility
url http://jims.mui.ac.ir/index.php/jims/article/view/3366
work_keys_str_mv AT nadernaseri improvingspeechintelligibilityusingidealbinarymask
AT saeidkermani improvingspeechintelligibilityusingidealbinarymask