Enhancing Embedded Space with Low–Level Features for Speech Emotion Recognition
This work proposes an approach that uses a feature space by combining the representation obtained in the unsupervised learning process and manually selected features defining the prosody of the utterances. In the experiments, we used two time-frequency representations (Mel and CQT spectrograms) and...
Հիմնական հեղինակներ: | , |
---|---|
Ձևաչափ: | Հոդված |
Լեզու: | English |
Հրապարակվել է: |
MDPI AG
2025-02-01
|
Շարք: | Applied Sciences |
Խորագրեր: | |
Առցանց հասանելիություն: | https://www.mdpi.com/2076-3417/15/5/2598 |