An adaptive a priori SNR estimator for perceptual speech enhancement

Abstract In this paper, an adaptive averaging a priori SNR estimation employing critical band processing is proposed. The proposed method modifies the current decision-directed a priori SNR estimation to achieve faster tracking when SNR changes. The decision-directed estimator (DD) employs a fixed w...

Full description

Bibliographic Details
Main Authors: Lara Nahma, Pei Chee Yong, Hai Huyen Dam, Sven Nordholm
Format: Article
Language:English
Published: SpringerOpen 2019-06-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13636-019-0150-3
_version_ 1819045849843367936
author Lara Nahma
Pei Chee Yong
Hai Huyen Dam
Sven Nordholm
author_facet Lara Nahma
Pei Chee Yong
Hai Huyen Dam
Sven Nordholm
author_sort Lara Nahma
collection DOAJ
description Abstract In this paper, an adaptive averaging a priori SNR estimation employing critical band processing is proposed. The proposed method modifies the current decision-directed a priori SNR estimation to achieve faster tracking when SNR changes. The decision-directed estimator (DD) employs a fixed weighting with the value close to one, which makes it slow in following the onsets of speech utterances. The proposed SNR estimator provides a means to solve this issue by employing an adaptive weighting factor. This allows an improved tracking of onset changes in the speech signal. As a consequence, it results in better preservation of speech components. This adaptive technique ensures that the weighting between the modified decision-directed a priori estimate and the maximum likelihood a priori estimate is a function of the speech absence probability. The estimate of the speech absence probability is modeled by a sigmoid function. Furthermore, a critical band mapping for the short-time Fourier transform analysis-synthesis system is utilized in the speech enhancement to achieve less musical noise. In addition, to evaluate the ability of the a priori SNR estimation method in preserving speech components, we proposed a modified objective measurement known as modified hamming distance. Evaluations are performed by utilizing both objective and subjective measurements. The experimental results show that the proposed method improves the speech quality under different noise conditions. Moreover, it maintains the advantage of the DD approach in eliminating the musical noise under different SNR conditions. The objective results are supported by subjective listening tests using 10 subjects (5 males and 5 females).
first_indexed 2024-12-21T10:35:07Z
format Article
id doaj.art-9c97cdedb02d4f269181e828e0292573
institution Directory Open Access Journal
issn 1687-4722
language English
last_indexed 2024-12-21T10:35:07Z
publishDate 2019-06-01
publisher SpringerOpen
record_format Article
series EURASIP Journal on Audio, Speech, and Music Processing
spelling doaj.art-9c97cdedb02d4f269181e828e02925732022-12-21T19:07:05ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47222019-06-012019112010.1186/s13636-019-0150-3An adaptive a priori SNR estimator for perceptual speech enhancementLara Nahma0Pei Chee Yong1Hai Huyen Dam2Sven Nordholm3Department of Electrical Engineering, computing and Mathematical Sciences, Curtin UniversityNuheara Ltd.Department of Electrical Engineering, computing and Mathematical Sciences, Curtin UniversityDepartment of Electrical Engineering, computing and Mathematical Sciences, Curtin UniversityAbstract In this paper, an adaptive averaging a priori SNR estimation employing critical band processing is proposed. The proposed method modifies the current decision-directed a priori SNR estimation to achieve faster tracking when SNR changes. The decision-directed estimator (DD) employs a fixed weighting with the value close to one, which makes it slow in following the onsets of speech utterances. The proposed SNR estimator provides a means to solve this issue by employing an adaptive weighting factor. This allows an improved tracking of onset changes in the speech signal. As a consequence, it results in better preservation of speech components. This adaptive technique ensures that the weighting between the modified decision-directed a priori estimate and the maximum likelihood a priori estimate is a function of the speech absence probability. The estimate of the speech absence probability is modeled by a sigmoid function. Furthermore, a critical band mapping for the short-time Fourier transform analysis-synthesis system is utilized in the speech enhancement to achieve less musical noise. In addition, to evaluate the ability of the a priori SNR estimation method in preserving speech components, we proposed a modified objective measurement known as modified hamming distance. Evaluations are performed by utilizing both objective and subjective measurements. The experimental results show that the proposed method improves the speech quality under different noise conditions. Moreover, it maintains the advantage of the DD approach in eliminating the musical noise under different SNR conditions. The objective results are supported by subjective listening tests using 10 subjects (5 males and 5 females).http://link.springer.com/article/10.1186/s13636-019-0150-3Single-channel speech enhancementA priori SNR estimationDecision-directed approachAdaptive smoothing factorAuditory system
spellingShingle Lara Nahma
Pei Chee Yong
Hai Huyen Dam
Sven Nordholm
An adaptive a priori SNR estimator for perceptual speech enhancement
EURASIP Journal on Audio, Speech, and Music Processing
Single-channel speech enhancement
A priori SNR estimation
Decision-directed approach
Adaptive smoothing factor
Auditory system
title An adaptive a priori SNR estimator for perceptual speech enhancement
title_full An adaptive a priori SNR estimator for perceptual speech enhancement
title_fullStr An adaptive a priori SNR estimator for perceptual speech enhancement
title_full_unstemmed An adaptive a priori SNR estimator for perceptual speech enhancement
title_short An adaptive a priori SNR estimator for perceptual speech enhancement
title_sort adaptive a priori snr estimator for perceptual speech enhancement
topic Single-channel speech enhancement
A priori SNR estimation
Decision-directed approach
Adaptive smoothing factor
Auditory system
url http://link.springer.com/article/10.1186/s13636-019-0150-3
work_keys_str_mv AT laranahma anadaptiveapriorisnrestimatorforperceptualspeechenhancement
AT peicheeyong anadaptiveapriorisnrestimatorforperceptualspeechenhancement
AT haihuyendam anadaptiveapriorisnrestimatorforperceptualspeechenhancement
AT svennordholm anadaptiveapriorisnrestimatorforperceptualspeechenhancement
AT laranahma adaptiveapriorisnrestimatorforperceptualspeechenhancement
AT peicheeyong adaptiveapriorisnrestimatorforperceptualspeechenhancement
AT haihuyendam adaptiveapriorisnrestimatorforperceptualspeechenhancement
AT svennordholm adaptiveapriorisnrestimatorforperceptualspeechenhancement