Spatial speech processing for multi-party teleconferencing

3D audio reproduction is expected to be widely deployed in applications such as entertainment, simulation and communication. This research focuses on developing a structural model based on 3D audio reproduction to deliver toll-quality speech for the implementation of multi-channel speech coding in t...

Full description

Bibliographic Details
Main Author:	Phua, Kok Soon.
Other Authors:	Gan, Woon Seng
Format:	Thesis
Language:	English
Published:	2008
Subjects:	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Online Access:	http://hdl.handle.net/10356/13286

_version_	1826114224484515840
author	Phua, Kok Soon.
author2	Gan, Woon Seng
author_facet	Gan, Woon Seng Phua, Kok Soon.
author_sort	Phua, Kok Soon.
collection	NTU
description	3D audio reproduction is expected to be widely deployed in applications such as entertainment, simulation and communication. This research focuses on developing a structural model based on 3D audio reproduction to deliver toll-quality speech for the implementation of multi-channel speech coding in teleconferencing. In this model, a monoaural speech signal is synthesized into a binaural signal by reproducing the sound localization cues for each ear so as to create the perceived position of that signal.
first_indexed	2024-10-01T03:35:40Z
format	Thesis
id	ntu-10356/13286
institution	Nanyang Technological University
language	English
last_indexed	2024-10-01T03:35:40Z
publishDate	2008
record_format	dspace
spelling	ntu-10356/132862023-07-04T15:52:05Z Spatial speech processing for multi-party teleconferencing Phua, Kok Soon. Gan, Woon Seng School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing 3D audio reproduction is expected to be widely deployed in applications such as entertainment, simulation and communication. This research focuses on developing a structural model based on 3D audio reproduction to deliver toll-quality speech for the implementation of multi-channel speech coding in teleconferencing. In this model, a monoaural speech signal is synthesized into a binaural signal by reproducing the sound localization cues for each ear so as to create the perceived position of that signal. Master of Engineering 2008-10-20T07:23:08Z 2008-10-20T07:23:08Z 1999 1999 Thesis http://hdl.handle.net/10356/13286 en 180 p. application/pdf
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Phua, Kok Soon. Spatial speech processing for multi-party teleconferencing
title	Spatial speech processing for multi-party teleconferencing
title_full	Spatial speech processing for multi-party teleconferencing
title_fullStr	Spatial speech processing for multi-party teleconferencing
title_full_unstemmed	Spatial speech processing for multi-party teleconferencing
title_short	Spatial speech processing for multi-party teleconferencing
title_sort	spatial speech processing for multi party teleconferencing
topic	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
url	http://hdl.handle.net/10356/13286
work_keys_str_mv	AT phuakoksoon spatialspeechprocessingformultipartyteleconferencing

Spatial speech processing for multi-party teleconferencing

Similar Items