Deep Visual Attributes vs. Hand-Crafted Audio Features on Multidomain Speech Emotion Recognition
Emotion recognition from speech may play a crucial role in many applications related to human–computer interaction or understanding the affective state of users in certain tasks, where other modalities such as video or physiological parameters are unavailable. In general, a human’s emotions may be r...
Main Authors: | Michalis Papakostas, Evaggelos Spyrou, Theodoros Giannakopoulos, Giorgos Siantikos, Dimitrios Sgouropoulos, Phivos Mylonas, Fillia Makedon |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2017-06-01
|
Series: | Computation |
Subjects: | |
Online Access: | http://www.mdpi.com/2079-3197/5/2/26 |
Similar Items
-
Emotion Recognition from Speech Using the Bag-of-Visual Words on Audio Segment Spectrograms
by: Evaggelos Spyrou, et al.
Published: (2019-02-01) -
Introduction to the Special Issue on Image-Based Information Retrieval from the Web
by: Phivos Mylonas, et al.
Published: (2019-06-01) -
Human Activity Recognition in the Presence of Occlusion
by: Ioannis Vernikos, et al.
Published: (2023-05-01) -
CogBeacon: A Multi-Modal Dataset and Data-Collection Platform for Modeling Cognitive Fatigue
by: Michalis Papakostas, et al.
Published: (2019-06-01) -
Audio-Based Event Detection at Different SNR Settings Using Two-Dimensional Spectrogram Magnitude Representations
by: Ioannis Papadimitriou, et al.
Published: (2020-09-01)