Two-channel noise reduction and post-processing for speech enhancement
This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-pr...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Published: |
2008
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/3522 |
_version_ | 1826123378197528576 |
---|---|
author | Zhang, Xinxin |
author2 | Koh Soo Ngee |
author_facet | Koh Soo Ngee Zhang, Xinxin |
author_sort | Zhang, Xinxin |
collection | NTU |
description | This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-processing techniques are proposed to improve the quality of the ?-masking enhanced speech signals. One technique involves non-linear high-frequency regeneration, which uses the lower-band spectral information to re-synthesize the upper-band spectral structure. The other technique involves re-synthesis of the weak spectral components using the autocorrelations of the strong spectral components. In addition to the single-channel speech enhancement methods, a two-channel speech enhancement method for communication in a car environment is also studied. To achieve a better performance, the single-channel ?-masking speech enhancement technique is incorporated within the two-channel enhancement system. The resulting output speech signals have low background noise and the distortion to the speech components is also very low, thus achieving an overall very satisfactory speech enhancement performance. |
first_indexed | 2024-10-01T06:03:42Z |
format | Thesis |
id | ntu-10356/3522 |
institution | Nanyang Technological University |
last_indexed | 2024-10-01T06:03:42Z |
publishDate | 2008 |
record_format | dspace |
spelling | ntu-10356/35222023-07-04T16:41:57Z Two-channel noise reduction and post-processing for speech enhancement Zhang, Xinxin Koh Soo Ngee School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing This thesis is focused on speech enhancement techniques based on short-time spectral amplitude (STSA) estimation. In particular, it addresses the weakness of the ?-order minimum mean-square error (MMSE) estimation method that incorporates auditory masking properties (?-masking in short). Two post-processing techniques are proposed to improve the quality of the ?-masking enhanced speech signals. One technique involves non-linear high-frequency regeneration, which uses the lower-band spectral information to re-synthesize the upper-band spectral structure. The other technique involves re-synthesis of the weak spectral components using the autocorrelations of the strong spectral components. In addition to the single-channel speech enhancement methods, a two-channel speech enhancement method for communication in a car environment is also studied. To achieve a better performance, the single-channel ?-masking speech enhancement technique is incorporated within the two-channel enhancement system. The resulting output speech signals have low background noise and the distortion to the speech components is also very low, thus achieving an overall very satisfactory speech enhancement performance. MASTER OF ENGINEERING (EEE) 2008-09-17T09:31:33Z 2008-09-17T09:31:33Z 2008 2008 Thesis Zhang, X. (2008). Two-channel noise reduction and post-processing for speech enhancement. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/3522 10.32657/10356/3522 Nanyang Technological University application/pdf |
spellingShingle | DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Zhang, Xinxin Two-channel noise reduction and post-processing for speech enhancement |
title | Two-channel noise reduction and post-processing for speech enhancement |
title_full | Two-channel noise reduction and post-processing for speech enhancement |
title_fullStr | Two-channel noise reduction and post-processing for speech enhancement |
title_full_unstemmed | Two-channel noise reduction and post-processing for speech enhancement |
title_short | Two-channel noise reduction and post-processing for speech enhancement |
title_sort | two channel noise reduction and post processing for speech enhancement |
topic | DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing |
url | https://hdl.handle.net/10356/3522 |
work_keys_str_mv | AT zhangxinxin twochannelnoisereductionandpostprocessingforspeechenhancement |