Enhancing Embedded Space with Low–Level Features for Speech Emotion Recognition

Enhancing Embedded Space with Low–Level Features for Speech Emotion Recognition

This work proposes an approach that uses a feature space by combining the representation obtained in the unsupervised learning process and manually selected features defining the prosody of the utterances. In the experiments, we used two time-frequency representations (Mel and CQT spectrograms) and...

Full description

Bibliographic Details
Main Authors:	Lukasz Smietanka, Tomasz Maka
Format:	Article
Language:	English
Published:	MDPI AG 2025-02-01
Series:	Applied Sciences
Subjects:	speech emotion recognition deep learning audio features
Online Access:	https://www.mdpi.com/2076-3417/15/5/2598

Similar Items

Multi-Modal Emotion Recognition Using Speech Features and Text-Embedding
by: Sung-Woo Byun, et al.
Published: (2021-08-01)

Learning Salient Segments for Speech Emotion Recognition Using Attentive Temporal Pooling
by: Xiaohan Xia, et al.
Published: (2020-01-01)

Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network
by: Zijun Yang, et al.
Published: (2024-01-01)

IMPROVED SPEAKER-INDEPENDENT EMOTION RECOGNITION FROM SPEECH USING TWO-STAGE FEATURE REDUCTION
by: Hasrul Mohd Nazid, et al.
Published: (2015-04-01)

Autoencoder With Emotion Embedding for Speech Emotion Recognition
by: Chenghao Zhang, et al.
Published: (2021-01-01)

Speech emotion recognition using machine learning — A systematic review
by: Samaneh Madanian, et al.
Published: (2023-11-01)

A Parallel-Model Speech Emotion Recognition Network Based on Feature Clustering
by: Li-Min Zhang, et al.
Published: (2023-01-01)

Feature selection enhancement and feature space visualization for speech-based emotion recognition
by: Sofia Kanwal, et al.
Published: (2022-11-01)

Decoupled Feature and Self-Knowledge Distillation for Speech Emotion Recognition
by: Haixiang Yu, et al.
Published: (2025-01-01)

A Neural Network Architecture for Children’s Audio–Visual Emotion Recognition
by: Anton Matveev, et al.
Published: (2023-11-01)

MelTrans: Mel-Spectrogram Relationship-Learning for Speech Emotion Recognition via Transformers
by: Hui Li, et al.
Published: (2024-08-01)

Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features
by: Tursunov Anvarjon, et al.
Published: (2020-09-01)

A Comparison of Machine Learning Algorithms and Feature Sets for Automatic Vocal Emotion Recognition in Speech
by: Cem Doğdu, et al.
Published: (2022-10-01)

Optimizing Speech Emotion Recognition with Machine Learning Based Advanced Audio Cue Analysis
by: Nuwan Pallewela, et al.
Published: (2024-07-01)

Emotion recognition of human speech using deep learning method and MFCC features
by: Sumon Kumar Hazra, et al.
Published: (2022-11-01)

Exploitation of Phase-Based Features for Whispered Speech Emotion Recognition
by: Jun Deng, et al.
Published: (2016-01-01)

Frequency, Time, Representation and Modeling Aspects for Major Speech and Audio Processing Applications
by: Juraj Kacur, et al.
Published: (2022-08-01)

Utterance Level Feature Aggregation with Deep Metric Learning for Speech Emotion Recognition
by: Bogdan Mocanu, et al.
Published: (2021-06-01)

Design of the Speech Emotion Recognition Model
by: Hanping Ke, et al.
Published: (2023-07-01)

Multimodal Approach of Speech Emotion Recognition Using Multi-Level Multi-Head Fusion Attention-Based Recurrent Neural Network
by: Ngoc-Huynh Ho, et al.
Published: (2020-01-01)

Comparative Performance Analysis of Metaheuristic Feature Selection Methods for Speech Emotion Recognition
by: Ozseven Turgut, et al.
Published: (2024-04-01)

Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network
by: Misbah Farooq, et al.
Published: (2020-10-01)

Enhancing Speech Emotion Recognition Using Dual Feature Extraction Encoders
by: Ilkhomjon Pulatov, et al.
Published: (2023-07-01)

A Feature Selection Algorithm Based on Differential Evolution for English Speech Emotion Recognition
by: Liya Yue, et al.
Published: (2023-11-01)

Deep Cross-Corpus Speech Emotion Recognition: Recent Advances and Perspectives
by: Shiqing Zhang, et al.
Published: (2021-11-01)

MSFL: Explainable Multitask-Based Shared Feature Learning for Multilingual Speech Emotion Recognition
by: Yiping Ma, et al.
Published: (2022-12-01)

Emotion Recognition from Chinese Speech for Smart Affective Services Using a Combination of SVM and DBN
by: Lianzhang Zhu, et al.
Published: (2017-07-01)

Data Augmentation and Effective Feature Selection in Generative Adversarial Networks for Speech Emotion Recognition
by: Arash Shilandari, et al.
Published: (2023-03-01)

Speech Emotion Recognition: Humans vs Machines
by: S. Werner, et al.
Published: (2019-12-01)

A Feature Fusion Model with Data Augmentation for Speech Emotion Recognition
by: Zhongwen Tu, et al.
Published: (2023-03-01)

A Comprehensive Review of Speech Emotion Recognition Systems
by: Taiba Majid Wani, et al.
Published: (2021-01-01)

Speech emotion recognition based on genetic algorithm?decision tree fusion of deep and acoustic features
by: Linhui Sun, et al.
Published: (2022-06-01)

A Combined CNN Architecture for Speech Emotion Recognition
by: Rolinson Begazo, et al.
Published: (2024-09-01)

Evaluating Self-Supervised Speech Representations for Speech Emotion Recognition
by: Bagus Tris Atmaja, et al.
Published: (2022-01-01)

Speech Emotion Recognition using Unsupervised Feature Selection Algorithms
by: S. R. Bandela, et al.
Published: (2020-06-01)

Progressive distribution adapted neural networks for cross-corpus speech emotion recognition
by: Yuan Zong, et al.
Published: (2022-09-01)

Lexical Dependent Emotion Detection Using Synthetic Speech Reference
by: Reza Lotfian, et al.
Published: (2019-01-01)

Analysis of Linguistic and Prosodic Features of Bilingual Arabic–English Speakers for Speech Emotion Recognition
by: Lamiaa Abdel-Hamid, et al.
Published: (2020-01-01)

On the Acoustics of Emotion in Audio: What Speech, Music and Sound have in Common
by: Felix eWeninger, et al.
Published: (2013-05-01)

Speech Emotion Recognition Using Deep Learning Transfer Models and Explainable Techniques
by: Tae-Wan Kim, et al.
Published: (2024-02-01)