Speech emotion recognition using wavelet packet reconstruction with attention-based deep recurrent neutral networks

Speech emotion recognition (SER) is a complicated and challenging task in the human-computer interaction because it is difficult to find the best feature set to discriminate the emotional state entirely. We always used the FFT to handle the raw signal in the process of extracting the low-level descr...

Full description

Bibliographic Details
Main Authors: Hao Meng, Tianhao Yan, Hongwei Wei, Xun Ji
Format: Article
Language:English
Published: Polish Academy of Sciences 2021-02-01
Series:Bulletin of the Polish Academy of Sciences: Technical Sciences
Subjects:
Online Access:https://journals.pan.pl/Content/119177/PDF/14_01872_Bpast.No.69(1)_24.02.21_K1_A_TeX.pdf