Speech emotion recognition using wavelet packet reconstruction with attention-based deep recurrent neutral networks
Speech emotion recognition (SER) is a complicated and challenging task in the human-computer interaction because it is difficult to find the best feature set to discriminate the emotional state entirely. We always used the FFT to handle the raw signal in the process of extracting the low-level descr...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Polish Academy of Sciences
2021-02-01
|
Series: | Bulletin of the Polish Academy of Sciences: Technical Sciences |
Subjects: | |
Online Access: | https://journals.pan.pl/Content/119177/PDF/14_01872_Bpast.No.69(1)_24.02.21_K1_A_TeX.pdf |