High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition

In this paper we study the cross-language speech emotion recognition using high-order Markov random fields, especially the application in Vietnamese speech emotion recognition. First, we extract the basic speech features including pitch frequency, formant frequency and short-term intensity. Based on...

Full description

Bibliographic Details
Main Authors: Zhipeng Jiang, Chengwei Huang
Format: Article
Language:English
Published: Sciendo 2015-11-01
Series:Cybernetics and Information Technologies
Subjects:
Online Access:https://doi.org/10.1515/cait-2015-0054
_version_ 1818393144555732992
author Zhipeng Jiang
Chengwei Huang
author_facet Zhipeng Jiang
Chengwei Huang
author_sort Zhipeng Jiang
collection DOAJ
description In this paper we study the cross-language speech emotion recognition using high-order Markov random fields, especially the application in Vietnamese speech emotion recognition. First, we extract the basic speech features including pitch frequency, formant frequency and short-term intensity. Based on the low level descriptor we further construct the statistic features including maximum, minimum, mean and standard deviation. Second, we adopt the high-order Markov random fields (MRF) to optimize the cross-language speech emotion model. The dimensional restrictions may be modeled by MRF. Third, based on the Vietnamese and Chinese database we analyze the efficiency of our emotion recognition system. We adopt the dimensional emotion model (arousal-valence) to verify the efficiency of MRF configuration method. The experimental results show that the high-order Markov random fields can improve the dimensional emotion recognition in the cross-language experiments, and the configuration method shows promising robustness over different languages.
first_indexed 2024-12-14T05:40:39Z
format Article
id doaj.art-6add256c0e564df3841d33f6f6d28f4b
institution Directory Open Access Journal
issn 1314-4081
language English
last_indexed 2024-12-14T05:40:39Z
publishDate 2015-11-01
publisher Sciendo
record_format Article
series Cybernetics and Information Technologies
spelling doaj.art-6add256c0e564df3841d33f6f6d28f4b2022-12-21T23:15:02ZengSciendoCybernetics and Information Technologies1314-40812015-11-01154505710.1515/cait-2015-0054High-Order Markov Random Fields and Their Applications in Cross-Language Speech RecognitionZhipeng Jiang0Chengwei Huang1School of Electronics and Information Engineering, Jinling Institute of Technology, Nanjing, ChinaCollege of Physics, Optoelectronics and Energy, Soochow University, Suzhou, ChinaIn this paper we study the cross-language speech emotion recognition using high-order Markov random fields, especially the application in Vietnamese speech emotion recognition. First, we extract the basic speech features including pitch frequency, formant frequency and short-term intensity. Based on the low level descriptor we further construct the statistic features including maximum, minimum, mean and standard deviation. Second, we adopt the high-order Markov random fields (MRF) to optimize the cross-language speech emotion model. The dimensional restrictions may be modeled by MRF. Third, based on the Vietnamese and Chinese database we analyze the efficiency of our emotion recognition system. We adopt the dimensional emotion model (arousal-valence) to verify the efficiency of MRF configuration method. The experimental results show that the high-order Markov random fields can improve the dimensional emotion recognition in the cross-language experiments, and the configuration method shows promising robustness over different languages.https://doi.org/10.1515/cait-2015-0054high-order markov random fieldsspeech emotion recognitioncrossdatabase recognitiondimensional emotion model
spellingShingle Zhipeng Jiang
Chengwei Huang
High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition
Cybernetics and Information Technologies
high-order markov random fields
speech emotion recognition
crossdatabase recognition
dimensional emotion model
title High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition
title_full High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition
title_fullStr High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition
title_full_unstemmed High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition
title_short High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition
title_sort high order markov random fields and their applications in cross language speech recognition
topic high-order markov random fields
speech emotion recognition
crossdatabase recognition
dimensional emotion model
url https://doi.org/10.1515/cait-2015-0054
work_keys_str_mv AT zhipengjiang highordermarkovrandomfieldsandtheirapplicationsincrosslanguagespeechrecognition
AT chengweihuang highordermarkovrandomfieldsandtheirapplicationsincrosslanguagespeechrecognition