Two-channel noise reduction and post-processing for speech enhancement

This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-pr...

Full description

Bibliographic Details
Main Author: Zhang, Xinxin
Other Authors: Koh Soo Ngee
Format: Thesis
Published: 2008
Subjects:
Online Access:https://hdl.handle.net/10356/3522
_version_ 1826123378197528576
author Zhang, Xinxin
author2 Koh Soo Ngee
author_facet Koh Soo Ngee
Zhang, Xinxin
author_sort Zhang, Xinxin
collection NTU
description This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-processing techniques are proposed to improve the quality of the ?-masking enhanced speech signals. One technique involves non-linear high-frequency regeneration, which uses the lower-band spectral information to re-synthesize the upper-band spectral structure. The other technique involves re-synthesis of the weak spectral components using the autocorrelations of the strong spectral components. In addition to the single-channel speech enhancement methods, a two-channel speech enhancement method for communication in a car environment is also studied. To achieve a better performance, the single-channel ?-masking speech enhancement technique is incorporated within the two-channel enhancement system. The resulting output speech signals have low background noise and the distortion to the speech components is also very low, thus achieving an overall very satisfactory speech enhancement performance.
first_indexed 2024-10-01T06:03:42Z
format Thesis
id ntu-10356/3522
institution Nanyang Technological University
last_indexed 2024-10-01T06:03:42Z
publishDate 2008
record_format dspace
spelling ntu-10356/35222023-07-04T16:41:57Z Two-channel noise reduction and post-processing for speech enhancement Zhang, Xinxin Koh Soo Ngee School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-processing techniques are proposed to improve the quality of the ?-masking enhanced speech signals. One technique involves non-linear high-frequency regeneration, which uses the lower-band spectral information to re-synthesize the upper-band spectral structure. The other technique involves re-synthesis of the weak spectral components using the autocorrelations of the strong spectral components. In addition to the single-channel speech enhancement methods, a two-channel speech enhancement method for communication in a car environment is also studied. To achieve a better performance, the single-channel ?-masking speech enhancement technique is incorporated within the two-channel enhancement system. The resulting output speech signals have low background noise and the distortion to the speech components is also very low, thus achieving an overall very satisfactory speech enhancement performance. MASTER OF ENGINEERING (EEE) 2008-09-17T09:31:33Z 2008-09-17T09:31:33Z 2008 2008 Thesis Zhang, X. (2008). Two-channel noise reduction and post-processing for speech enhancement. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/3522 10.32657/10356/3522 Nanyang Technological University application/pdf
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Zhang, Xinxin
Two-channel noise reduction and post-processing for speech enhancement
title Two-channel noise reduction and post-processing for speech enhancement
title_full Two-channel noise reduction and post-processing for speech enhancement
title_fullStr Two-channel noise reduction and post-processing for speech enhancement
title_full_unstemmed Two-channel noise reduction and post-processing for speech enhancement
title_short Two-channel noise reduction and post-processing for speech enhancement
title_sort two channel noise reduction and post processing for speech enhancement
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
url https://hdl.handle.net/10356/3522
work_keys_str_mv AT zhangxinxin twochannelnoisereductionandpostprocessingforspeechenhancement