Spatial speech processing for multi-party teleconferencing
3D audio reproduction is expected to be widely deployed in applications such as entertainment, simulation and communication. This research focuses on developing a structural model based on 3D audio reproduction to deliver toll-quality speech for the implementation of multi-channel speech coding in t...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | English |
Published: |
2008
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/13286 |
_version_ | 1826114224484515840 |
---|---|
author | Phua, Kok Soon. |
author2 | Gan, Woon Seng |
author_facet | Gan, Woon Seng Phua, Kok Soon. |
author_sort | Phua, Kok Soon. |
collection | NTU |
description | 3D audio reproduction is expected to be widely deployed in applications such as entertainment, simulation and communication. This research focuses on developing a structural model based on 3D audio reproduction to deliver toll-quality speech for the implementation of multi-channel speech coding in teleconferencing. In this model, a monoaural speech signal is synthesized into a binaural signal by reproducing the sound localization cues for each ear so as to create the perceived position of that signal. |
first_indexed | 2024-10-01T03:35:40Z |
format | Thesis |
id | ntu-10356/13286 |
institution | Nanyang Technological University |
language | English |
last_indexed | 2024-10-01T03:35:40Z |
publishDate | 2008 |
record_format | dspace |
spelling | ntu-10356/132862023-07-04T15:52:05Z Spatial speech processing for multi-party teleconferencing Phua, Kok Soon. Gan, Woon Seng School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing 3D audio reproduction is expected to be widely deployed in applications such as entertainment, simulation and communication. This research focuses on developing a structural model based on 3D audio reproduction to deliver toll-quality speech for the implementation of multi-channel speech coding in teleconferencing. In this model, a monoaural speech signal is synthesized into a binaural signal by reproducing the sound localization cues for each ear so as to create the perceived position of that signal. Master of Engineering 2008-10-20T07:23:08Z 2008-10-20T07:23:08Z 1999 1999 Thesis http://hdl.handle.net/10356/13286 en 180 p. application/pdf |
spellingShingle | DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Phua, Kok Soon. Spatial speech processing for multi-party teleconferencing |
title | Spatial speech processing for multi-party teleconferencing |
title_full | Spatial speech processing for multi-party teleconferencing |
title_fullStr | Spatial speech processing for multi-party teleconferencing |
title_full_unstemmed | Spatial speech processing for multi-party teleconferencing |
title_short | Spatial speech processing for multi-party teleconferencing |
title_sort | spatial speech processing for multi party teleconferencing |
topic | DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing |
url | http://hdl.handle.net/10356/13286 |
work_keys_str_mv | AT phuakoksoon spatialspeechprocessingformultipartyteleconferencing |