A Romanian Prosody Prediction Module Based on a Functional Intonational Model

This paper presents a prosodic prediction module used by the Romanian Text-to-Speech (TtS) system in intonation synthesis. The prosody prediction refers to the surface generation of the F0 contour, based on the F0 patterns assigned to the functional categories of the prosodic units. Prior to the pre...

Full description

Bibliographic Details
Main Authors: Doina Jitca, Vasile Apopei, Otilia Paduraru
Format: Article
Language:English
Published: Publishing House of the Romanian Academy 2012-10-01
Series:Memoirs of the Scientific Sections of the Romanian Academy
Subjects:
Online Access:http://mss.academiaromana-is.ro/mem_sc_st_2012/art%2006%20Jitca.pdf
Description
Summary:This paper presents a prosodic prediction module used by the Romanian Text-to-Speech (TtS) system in intonation synthesis. The prosody prediction refers to the surface generation of the F0 contour, based on the F0 patterns assigned to the functional categories of the prosodic units. Prior to the prediction module presentation, the paper includes a summary of these functional categories and the partial melodic contour descriptions based on functional labels. The block diagram of the prediction module outlines two main processing steps: the phrasing prediction for building the utterance tree and the selection of the melodic contours of its groups. Both processing steps are exemplified within a case study of Romanian text speech synthesis. The prosody prediction results are discussed and compared with natural F0 contours of different speakers.
ISSN:1224-1407
2343-7049