Spatial speech processing for multi-party teleconferencing

3D audio reproduction is expected to be widely deployed in applications such as entertainment, simulation and communication. This research focuses on developing a structural model based on 3D audio reproduction to deliver toll-quality speech for the implementation of multi-channel speech coding in t...

Full description

Bibliographic Details
Main Author: Phua, Kok Soon.
Other Authors: Gan, Woon Seng
Format: Thesis
Language:English
Published: 2008
Subjects:
Online Access:http://hdl.handle.net/10356/13286
_version_ 1826114224484515840
author Phua, Kok Soon.
author2 Gan, Woon Seng
author_facet Gan, Woon Seng
Phua, Kok Soon.
author_sort Phua, Kok Soon.
collection NTU
description 3D audio reproduction is expected to be widely deployed in applications such as entertainment, simulation and communication. This research focuses on developing a structural model based on 3D audio reproduction to deliver toll-quality speech for the implementation of multi-channel speech coding in teleconferencing. In this model, a monoaural speech signal is synthesized into a binaural signal by reproducing the sound localization cues for each ear so as to create the perceived position of that signal.
first_indexed 2024-10-01T03:35:40Z
format Thesis
id ntu-10356/13286
institution Nanyang Technological University
language English
last_indexed 2024-10-01T03:35:40Z
publishDate 2008
record_format dspace
spelling ntu-10356/132862023-07-04T15:52:05Z Spatial speech processing for multi-party teleconferencing Phua, Kok Soon. Gan, Woon Seng School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing 3D audio reproduction is expected to be widely deployed in applications such as entertainment, simulation and communication. This research focuses on developing a structural model based on 3D audio reproduction to deliver toll-quality speech for the implementation of multi-channel speech coding in teleconferencing. In this model, a monoaural speech signal is synthesized into a binaural signal by reproducing the sound localization cues for each ear so as to create the perceived position of that signal. Master of Engineering 2008-10-20T07:23:08Z 2008-10-20T07:23:08Z 1999 1999 Thesis http://hdl.handle.net/10356/13286 en 180 p. application/pdf
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Phua, Kok Soon.
Spatial speech processing for multi-party teleconferencing
title Spatial speech processing for multi-party teleconferencing
title_full Spatial speech processing for multi-party teleconferencing
title_fullStr Spatial speech processing for multi-party teleconferencing
title_full_unstemmed Spatial speech processing for multi-party teleconferencing
title_short Spatial speech processing for multi-party teleconferencing
title_sort spatial speech processing for multi party teleconferencing
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
url http://hdl.handle.net/10356/13286
work_keys_str_mv AT phuakoksoon spatialspeechprocessingformultipartyteleconferencing