Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier
The paper is aimed at determination of formant features (FF) which describe vocal tract characteristics. It comprises analysis of the first three formant positions together with their bandwidths and the formant tilts. Subsequently, the statistical evaluation and comparison of the FF was performed. T...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Spolecnost pro radioelektronicke inzenyrstvi
2013-04-01
|
Series: | Radioengineering |
Subjects: | |
Online Access: | http://www.radioeng.cz/fulltexts/2013/13_01_0052_0059.pdf |
_version_ | 1819238565929811968 |
---|---|
author | J. Pribil A. Pribilova |
author_facet | J. Pribil A. Pribilova |
author_sort | J. Pribil |
collection | DOAJ |
description | The paper is aimed at determination of formant features (FF) which describe vocal tract characteristics. It comprises analysis of the first three formant positions together with their bandwidths and the formant tilts. Subsequently, the statistical evaluation and comparison of the FF was performed. This experiment was realized with the speech material in the form of sentences of male and female speakers expressing four emotional states (joy, sadness, anger, and a neutral state) in Czech and Slovak languages. The statistical distribution of the analyzed formant frequencies and formant tilts shows good differentiation between neutral and emotional styles for both voices. Contrary to it, the values of the formant 3-dB bandwidths have no correlation with the type of the speaking style or the type of the voice. These spectral parameters together with the values of the other speech characteristics were used in the feature vector for Gaussian mixture models (GMM) emotional speech style classifier that is currently developed. The overall mean classification error rate achieves about 18 %, and the best obtained error rate is 5 % for the sadness style of the female voice. These values are acceptable in this first stage of development of the GMM classifier that should be used for evaluation of the synthetic speech quality after applied voice conversion and emotional speech style transformation. |
first_indexed | 2024-12-23T13:38:15Z |
format | Article |
id | doaj.art-bbbd874315c546d29f716892b2eaa46d |
institution | Directory Open Access Journal |
issn | 1210-2512 |
language | English |
last_indexed | 2024-12-23T13:38:15Z |
publishDate | 2013-04-01 |
publisher | Spolecnost pro radioelektronicke inzenyrstvi |
record_format | Article |
series | Radioengineering |
spelling | doaj.art-bbbd874315c546d29f716892b2eaa46d2022-12-21T17:44:58ZengSpolecnost pro radioelektronicke inzenyrstviRadioengineering1210-25122013-04-012215259Determination of Formant Features in Czech and Slovak for GMM Emotional Speech ClassifierJ. PribilA. PribilovaThe paper is aimed at determination of formant features (FF) which describe vocal tract characteristics. It comprises analysis of the first three formant positions together with their bandwidths and the formant tilts. Subsequently, the statistical evaluation and comparison of the FF was performed. This experiment was realized with the speech material in the form of sentences of male and female speakers expressing four emotional states (joy, sadness, anger, and a neutral state) in Czech and Slovak languages. The statistical distribution of the analyzed formant frequencies and formant tilts shows good differentiation between neutral and emotional styles for both voices. Contrary to it, the values of the formant 3-dB bandwidths have no correlation with the type of the speaking style or the type of the voice. These spectral parameters together with the values of the other speech characteristics were used in the feature vector for Gaussian mixture models (GMM) emotional speech style classifier that is currently developed. The overall mean classification error rate achieves about 18 %, and the best obtained error rate is 5 % for the sadness style of the female voice. These values are acceptable in this first stage of development of the GMM classifier that should be used for evaluation of the synthetic speech quality after applied voice conversion and emotional speech style transformation.www.radioeng.cz/fulltexts/2013/13_01_0052_0059.pdfFormant features of speechemotional speechstatistical analysis |
spellingShingle | J. Pribil A. Pribilova Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier Radioengineering Formant features of speech emotional speech statistical analysis |
title | Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier |
title_full | Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier |
title_fullStr | Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier |
title_full_unstemmed | Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier |
title_short | Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier |
title_sort | determination of formant features in czech and slovak for gmm emotional speech classifier |
topic | Formant features of speech emotional speech statistical analysis |
url | http://www.radioeng.cz/fulltexts/2013/13_01_0052_0059.pdf |
work_keys_str_mv | AT jpribil determinationofformantfeaturesinczechandslovakforgmmemotionalspeechclassifier AT apribilova determinationofformantfeaturesinczechandslovakforgmmemotionalspeechclassifier |