MHAiR: A Dataset of Audio-Image Representations for Multimodal Human Actions
Audio-image representations for a multimodal human action (MHAiR) dataset contains six different image representations of the audio signals that capture the temporal dynamics of the actions in a very compact and informative way. The dataset was extracted from the audio recordings which were captured...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2024-01-01
|
Series: | Data |
Subjects: | |
Online Access: | https://www.mdpi.com/2306-5729/9/2/21 |