Enhancing Embedded Space with Low–Level Features for Speech Emotion Recognition
This work proposes an approach that uses a feature space by combining the representation obtained in the unsupervised learning process and manually selected features defining the prosody of the utterances. In the experiments, we used two time-frequency representations (Mel and CQT spectrograms) and...
Main Authors: | Lukasz Smietanka, Tomasz Maka |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-02-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/15/5/2598 |
Similar Items
-
Multi-Modal Emotion Recognition Using Speech Features and Text-Embedding
by: Sung-Woo Byun, et al.
Published: (2021-08-01) -
Learning Salient Segments for Speech Emotion Recognition Using Attentive Temporal Pooling
by: Xiaohan Xia, et al.
Published: (2020-01-01) -
Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network
by: Zijun Yang, et al.
Published: (2024-01-01) -
IMPROVED SPEAKER-INDEPENDENT EMOTION RECOGNITION FROM SPEECH USING TWO-STAGE FEATURE REDUCTION
by: Hasrul Mohd Nazid, et al.
Published: (2015-04-01) -
Autoencoder With Emotion Embedding for Speech Emotion Recognition
by: Chenghao Zhang, et al.
Published: (2021-01-01)