Prosody modeling with soft templates

This paper describes a novel prosody generation model. We intend it to broadly support many linguistic theories and multiple languages, for the model imposes no restriction on accent categories and shapes. This capability is crucial to the next-generation of Text-to-Speech systems that will need to...

詳細記述

書誌詳細
主要な著者:	Kochanski, G, Shih, C
その他の著者:	European Association for Signal Processing (EURASIP)
フォーマット:	Journal article
言語:	English
出版事項:	Elsevier 2003
主題:	Linguistics Computational Linguistics

_version_	1826288998342983680
author	Kochanski, G Shih, C
author2	European Association for Signal Processing (EURASIP)
author_facet	European Association for Signal Processing (EURASIP) Kochanski, G Shih, C
author_sort	Kochanski, G
collection	OXFORD
description	This paper describes a novel prosody generation model. We intend it to broadly support many linguistic theories and multiple languages, for the model imposes no restriction on accent categories and shapes. This capability is crucial to the next-generation of Text-to-Speech systems that will need to synthesize intonation variations for different speech acts, emotions, and styles of speech. The system supports mark-up tags that are mathematically defined and generate f0 deterministically. Underlying the tags is an articulatory model of accent interaction which balances physiological and communication constraints. We specify the model by way of an algorithm for calculating the pitch, and by way of examples. The model allows localized, linguistically reasonable tags, and is suitable for a data-driven fitting process.
first_indexed	2024-03-07T02:22:12Z
format	Journal article
id	oxford-uuid:a453b9e5-010e-42e4-b41b-e5b882ad1bc3
institution	University of Oxford
language	English
last_indexed	2024-03-07T02:22:12Z
publishDate	2003
publisher	Elsevier
record_format	dspace
spelling	oxford-uuid:a453b9e5-010e-42e4-b41b-e5b882ad1bc32022-03-27T02:33:03ZProsody modeling with soft templatesJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:a453b9e5-010e-42e4-b41b-e5b882ad1bc3LinguisticsComputational LinguisticsEnglishOxford University Research Archive - ValetElsevier2003Kochanski, GShih, CEuropean Association for Signal Processing (EURASIP)International Speech Communication AssociationThis paper describes a novel prosody generation model. We intend it to broadly support many linguistic theories and multiple languages, for the model imposes no restriction on accent categories and shapes. This capability is crucial to the next-generation of Text-to-Speech systems that will need to synthesize intonation variations for different speech acts, emotions, and styles of speech. The system supports mark-up tags that are mathematically defined and generate f0 deterministically. Underlying the tags is an articulatory model of accent interaction which balances physiological and communication constraints. We specify the model by way of an algorithm for calculating the pitch, and by way of examples. The model allows localized, linguistically reasonable tags, and is suitable for a data-driven fitting process.
spellingShingle	Linguistics Computational Linguistics Kochanski, G Shih, C Prosody modeling with soft templates
title	Prosody modeling with soft templates
title_full	Prosody modeling with soft templates
title_fullStr	Prosody modeling with soft templates
title_full_unstemmed	Prosody modeling with soft templates
title_short	Prosody modeling with soft templates
title_sort	prosody modeling with soft templates
topic	Linguistics Computational Linguistics
work_keys_str_mv	AT kochanskig prosodymodelingwithsofttemplates AT shihc prosodymodelingwithsofttemplates

Prosody modeling with soft templates

類似資料