Deep emotion recognition based on audio–visual correlation
Human emotion recognition is studied by means of unimodal channels over the last decade. However, efforts continue to answer tempting questions about how variant modalities can complement each other. This study proposes a multimodal approach using three‐dimensional (3D) convolutional neural networks...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2020-10-01
|
Series: | IET Computer Vision |
Subjects: | |
Online Access: | https://doi.org/10.1049/iet-cvi.2020.0013 |