Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier

The paper is aimed at determination of formant features (FF) which describe vocal tract characteristics. It comprises analysis of the first three formant positions together with their bandwidths and the formant tilts. Subsequently, the statistical evaluation and comparison of the FF was performed. T...

Full description

Bibliographic Details
Main Authors: J. Pribil, A. Pribilova
Format: Article
Language:English
Published: Spolecnost pro radioelektronicke inzenyrstvi 2013-04-01
Series:Radioengineering
Subjects:
Online Access:http://www.radioeng.cz/fulltexts/2013/13_01_0052_0059.pdf
_version_ 1819238565929811968
author J. Pribil
A. Pribilova
author_facet J. Pribil
A. Pribilova
author_sort J. Pribil
collection DOAJ
description The paper is aimed at determination of formant features (FF) which describe vocal tract characteristics. It comprises analysis of the first three formant positions together with their bandwidths and the formant tilts. Subsequently, the statistical evaluation and comparison of the FF was performed. This experiment was realized with the speech material in the form of sentences of male and female speakers expressing four emotional states (joy, sadness, anger, and a neutral state) in Czech and Slovak languages. The statistical distribution of the analyzed formant frequencies and formant tilts shows good differentiation between neutral and emotional styles for both voices. Contrary to it, the values of the formant 3-dB bandwidths have no correlation with the type of the speaking style or the type of the voice. These spectral parameters together with the values of the other speech characteristics were used in the feature vector for Gaussian mixture models (GMM) emotional speech style classifier that is currently developed. The overall mean classification error rate achieves about 18 %, and the best obtained error rate is 5 % for the sadness style of the female voice. These values are acceptable in this first stage of development of the GMM classifier that should be used for evaluation of the synthetic speech quality after applied voice conversion and emotional speech style transformation.
first_indexed 2024-12-23T13:38:15Z
format Article
id doaj.art-bbbd874315c546d29f716892b2eaa46d
institution Directory Open Access Journal
issn 1210-2512
language English
last_indexed 2024-12-23T13:38:15Z
publishDate 2013-04-01
publisher Spolecnost pro radioelektronicke inzenyrstvi
record_format Article
series Radioengineering
spelling doaj.art-bbbd874315c546d29f716892b2eaa46d2022-12-21T17:44:58ZengSpolecnost pro radioelektronicke inzenyrstviRadioengineering1210-25122013-04-012215259Determination of Formant Features in Czech and Slovak for GMM Emotional Speech ClassifierJ. PribilA. PribilovaThe paper is aimed at determination of formant features (FF) which describe vocal tract characteristics. It comprises analysis of the first three formant positions together with their bandwidths and the formant tilts. Subsequently, the statistical evaluation and comparison of the FF was performed. This experiment was realized with the speech material in the form of sentences of male and female speakers expressing four emotional states (joy, sadness, anger, and a neutral state) in Czech and Slovak languages. The statistical distribution of the analyzed formant frequencies and formant tilts shows good differentiation between neutral and emotional styles for both voices. Contrary to it, the values of the formant 3-dB bandwidths have no correlation with the type of the speaking style or the type of the voice. These spectral parameters together with the values of the other speech characteristics were used in the feature vector for Gaussian mixture models (GMM) emotional speech style classifier that is currently developed. The overall mean classification error rate achieves about 18 %, and the best obtained error rate is 5 % for the sadness style of the female voice. These values are acceptable in this first stage of development of the GMM classifier that should be used for evaluation of the synthetic speech quality after applied voice conversion and emotional speech style transformation.www.radioeng.cz/fulltexts/2013/13_01_0052_0059.pdfFormant features of speechemotional speechstatistical analysis
spellingShingle J. Pribil
A. Pribilova
Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier
Radioengineering
Formant features of speech
emotional speech
statistical analysis
title Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier
title_full Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier
title_fullStr Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier
title_full_unstemmed Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier
title_short Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier
title_sort determination of formant features in czech and slovak for gmm emotional speech classifier
topic Formant features of speech
emotional speech
statistical analysis
url http://www.radioeng.cz/fulltexts/2013/13_01_0052_0059.pdf
work_keys_str_mv AT jpribil determinationofformantfeaturesinczechandslovakforgmmemotionalspeechclassifier
AT apribilova determinationofformantfeaturesinczechandslovakforgmmemotionalspeechclassifier