Enhancing Embedded Space with Low–Level Features for Speech Emotion Recognition

This work proposes an approach that uses a feature space by combining the representation obtained in the unsupervised learning process and manually selected features defining the prosody of the utterances. In the experiments, we used two time-frequency representations (Mel and CQT spectrograms) and...

Ամբողջական նկարագրություն

Մատենագիտական մանրամասներ
Հիմնական հեղինակներ:	Lukasz Smietanka, Tomasz Maka
Ձևաչափ:	Հոդված
Լեզու:	English
Հրապարակվել է:	MDPI AG 2025-02-01
Շարք:	Applied Sciences
Խորագրեր:	speech emotion recognition deep learning audio features
Առցանց հասանելիություն:	https://www.mdpi.com/2076-3417/15/5/2598

Համացանց

https://www.mdpi.com/2076-3417/15/5/2598

Enhancing Embedded Space with Low–Level Features for Speech Emotion Recognition

Համացանց

Նմանատիպ նյութեր