Voice morphing

71 p.

Bibliographic Details
Main Author: Sowmya Uthirapathi
Other Authors: Foo Say Wei
Format: Thesis
Published: 2011
Subjects:
Online Access:http://hdl.handle.net/10356/46802
_version_ 1826117301435367424
author Sowmya Uthirapathi
author2 Foo Say Wei
author_facet Foo Say Wei
Sowmya Uthirapathi
author_sort Sowmya Uthirapathi
collection NTU
description 71 p.
first_indexed 2024-10-01T04:25:21Z
format Thesis
id ntu-10356/46802
institution Nanyang Technological University
last_indexed 2024-10-01T04:25:21Z
publishDate 2011
record_format dspace
spelling ntu-10356/468022023-07-04T15:46:49Z Voice morphing Sowmya Uthirapathi Foo Say Wei School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing 71 p. Voice morphing is a technique to modify source speaker's speech utterance to sound as if the target speaker has spoken it. There are many voice morphing methods but they all have the basic three components 1. Analysis 2. Transformation 3. Conversion Speech coding is used to analyze the speech signal. The goal of speech coding is to represent the speech in digital form with as little bits as possible. And there is a tradeoff between bit rate and voice quality. Aim of this study is to identify an analysis technique, which can generate the synthetic speech with better quality to meet the objectives of speech coding techniques. In this study CELP coding techniques is identified as the best for the analysis of original speech signal when compared to the LPC 10 decoder. The synthetic speech generated by CELP coding technique has better quality and it is closer to the original speech, whereas the synthetic speech generated from the LPC 10 decoder has less quality, more noise, and also unnatural. This is because of the jitter caused by the voiced excitation. So we can choose CELP coding technique for the analysis of speech signal which is the first stage in Voice Morphing. The prediction parameters obtained from CELP analysis were converted into Line Spectral Frequencies. These LSF are used to represent the spectral envelope of source and target speech. Master of Science (Signal Processing) 2011-12-23T09:56:44Z 2011-12-23T09:56:44Z 2009 2009 Thesis http://hdl.handle.net/10356/46802 Nanyang Technological University application/pdf
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Sowmya Uthirapathi
Voice morphing
title Voice morphing
title_full Voice morphing
title_fullStr Voice morphing
title_full_unstemmed Voice morphing
title_short Voice morphing
title_sort voice morphing
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
url http://hdl.handle.net/10356/46802
work_keys_str_mv AT sowmyauthirapathi voicemorphing