Low-Distortion MMSE Speech Enhancement Estimator Based on Laplacian Prior

The most well-known conventional speech enhancement algorithms introduce unwanted artifact noise and speech distortion to the enhanced signal. Reducing the effects of such issues require more robust linear and non-linear estimators. This paper proposes new optimum linear and non-linear Laplacian dis...

Full description

Bibliographic Details
Main Authors: Basheera M. Mahmmod, Abd Rahman Ramli, Sadiq H. Abdulhussian, Syed Abdul Rahman Al-Haddad, Wissam A. Jassim
Format: Article
Language:English
Published: IEEE 2017-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/7914629/
_version_ 1818645006104133632
author Basheera M. Mahmmod
Abd Rahman Ramli
Sadiq H. Abdulhussian
Syed Abdul Rahman Al-Haddad
Wissam A. Jassim
author_facet Basheera M. Mahmmod
Abd Rahman Ramli
Sadiq H. Abdulhussian
Syed Abdul Rahman Al-Haddad
Wissam A. Jassim
author_sort Basheera M. Mahmmod
collection DOAJ
description The most well-known conventional speech enhancement algorithms introduce unwanted artifact noise and speech distortion to the enhanced signal. Reducing the effects of such issues require more robust linear and non-linear estimators. This paper proposes new optimum linear and non-linear Laplacian distribution-based estimators. The proposed estimators are derived based on a minimum mean squared error (MMSE) sense to minimize the distortion in different conditions of the underlying speech. Thus, artifact noise is reduced without compromising the noise reduction process. The analytical solutions of the Laplacian distribution-based estimators, linear bilateral Laplacian gain estimator (LBLG), and non-linear bilateral Laplacian gain estimator (NBLG), are presented. The proposed estimators are implemented in three steps. First, the observation signal is decorrelated through a real transform domain to obtain its transform coefficients. Second, the proposed estimators are applied to estimate the clean speech signal from the noisy signal in the decorrelated domain. Finally, the inverse of the real transform is applied to obtain the original speech signal in the time domain. Two conditions in these estimators account for interference events between the speech signal and noise coefficients in the decorrelated domain. Moreover, a mathematical aspect of mean square error of LBLG is evaluated, which presents a significant improvement over other methods. Furthermore, a comprehensive description of the whole variations of the LBLG and NBLG gains characteristics is presented. A comparative evaluation is performed with effective quality metrics, segmental signal-to-noise ratio and perceptual evaluation of speech quality, to demonstrate the advantage and effectiveness of the proposed estimators. The performance of the proposed estimators outperformed other methods, which are the traditional MMSE approach, perceptually motivated Bayesian estimator, dual gain Wiener estimator, and dual MMSE estimator in terms of different objective measurements.
first_indexed 2024-12-17T00:23:52Z
format Article
id doaj.art-6e30ebff4427435a93b22bbfc36ab596
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-17T00:23:52Z
publishDate 2017-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-6e30ebff4427435a93b22bbfc36ab5962022-12-21T22:10:30ZengIEEEIEEE Access2169-35362017-01-0159866988110.1109/ACCESS.2017.26997827914629Low-Distortion MMSE Speech Enhancement Estimator Based on Laplacian PriorBasheera M. Mahmmod0Abd Rahman Ramli1Sadiq H. Abdulhussian2https://orcid.org/0000-0002-6439-0082Syed Abdul Rahman Al-Haddad3Wissam A. Jassim4Department of Computer and Communication Systems Engineering, Universiti Putra Malaysia, Selangor, MalaysiaDepartment of Computer and Communication Systems Engineering, Universiti Putra Malaysia, Selangor, MalaysiaDepartment of Computer and Communication Systems Engineering, Universiti Putra Malaysia, Selangor, MalaysiaDepartment of Computer and Communication Systems Engineering, Universiti Putra Malaysia, Selangor, MalaysiaADAPT Center, School of Engineering, Trinity College Dublin, University of Dublin, Dublin, IrelandThe most well-known conventional speech enhancement algorithms introduce unwanted artifact noise and speech distortion to the enhanced signal. Reducing the effects of such issues require more robust linear and non-linear estimators. This paper proposes new optimum linear and non-linear Laplacian distribution-based estimators. The proposed estimators are derived based on a minimum mean squared error (MMSE) sense to minimize the distortion in different conditions of the underlying speech. Thus, artifact noise is reduced without compromising the noise reduction process. The analytical solutions of the Laplacian distribution-based estimators, linear bilateral Laplacian gain estimator (LBLG), and non-linear bilateral Laplacian gain estimator (NBLG), are presented. The proposed estimators are implemented in three steps. First, the observation signal is decorrelated through a real transform domain to obtain its transform coefficients. Second, the proposed estimators are applied to estimate the clean speech signal from the noisy signal in the decorrelated domain. Finally, the inverse of the real transform is applied to obtain the original speech signal in the time domain. Two conditions in these estimators account for interference events between the speech signal and noise coefficients in the decorrelated domain. Moreover, a mathematical aspect of mean square error of LBLG is evaluated, which presents a significant improvement over other methods. Furthermore, a comprehensive description of the whole variations of the LBLG and NBLG gains characteristics is presented. A comparative evaluation is performed with effective quality metrics, segmental signal-to-noise ratio and perceptual evaluation of speech quality, to demonstrate the advantage and effectiveness of the proposed estimators. The performance of the proposed estimators outperformed other methods, which are the traditional MMSE approach, perceptually motivated Bayesian estimator, dual gain Wiener estimator, and dual MMSE estimator in terms of different objective measurements.https://ieeexplore.ieee.org/document/7914629/Speech enhancementLaplacian distributionminimum mean square error estimatorspeech modelnoise model
spellingShingle Basheera M. Mahmmod
Abd Rahman Ramli
Sadiq H. Abdulhussian
Syed Abdul Rahman Al-Haddad
Wissam A. Jassim
Low-Distortion MMSE Speech Enhancement Estimator Based on Laplacian Prior
IEEE Access
Speech enhancement
Laplacian distribution
minimum mean square error estimator
speech model
noise model
title Low-Distortion MMSE Speech Enhancement Estimator Based on Laplacian Prior
title_full Low-Distortion MMSE Speech Enhancement Estimator Based on Laplacian Prior
title_fullStr Low-Distortion MMSE Speech Enhancement Estimator Based on Laplacian Prior
title_full_unstemmed Low-Distortion MMSE Speech Enhancement Estimator Based on Laplacian Prior
title_short Low-Distortion MMSE Speech Enhancement Estimator Based on Laplacian Prior
title_sort low distortion mmse speech enhancement estimator based on laplacian prior
topic Speech enhancement
Laplacian distribution
minimum mean square error estimator
speech model
noise model
url https://ieeexplore.ieee.org/document/7914629/
work_keys_str_mv AT basheerammahmmod lowdistortionmmsespeechenhancementestimatorbasedonlaplacianprior
AT abdrahmanramli lowdistortionmmsespeechenhancementestimatorbasedonlaplacianprior
AT sadiqhabdulhussian lowdistortionmmsespeechenhancementestimatorbasedonlaplacianprior
AT syedabdulrahmanalhaddad lowdistortionmmsespeechenhancementestimatorbasedonlaplacianprior
AT wissamajassim lowdistortionmmsespeechenhancementestimatorbasedonlaplacianprior