Speech Emotion Recognition Based on Self-Attention Weight Correction for Acoustic and Text Features
Speech emotion recognition (SER) is essential for understanding a speaker’s intention. Recently, some groups have attempted to improve SER performance using a bidirectional long short-term memory (BLSTM) to extract features from speech sequences and a self-attention mechanism to focus on...
Main Authors: | Jennifer Santoso, Takeshi Yamada, Kenkichi Ishizuka, Taiichi Hashimoto, Shoji Makino |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2022-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9936641/ |
Similar Items
-
Speech recognition and text-to-speech synthesis for student matric number and CPA /
by: 429495 Foo, Chin Ho
Published: (2001) -
Speech recognition and text-to-speech synthesis for student matric number, CPA and GPA /
by: 445693 Ee, Su Fun
Published: (2002) -
Blind speech separation /
by: Makino, S. (Shoji), et al.
Published: (2007) -
Isolated digit recognition system with speech compression using IBM-PC /
by: Sheikh Hussain Shaikh Salleh, 1958-, et al. -
Malay text to speech synthesis system : malay speech synthesis system design and verification /
by: Tan, Tian Swee, 1978-
Published: (2011)