MHAiR: A Dataset of Audio-Image Representations for Multimodal Human Actions

Audio-image representations for a multimodal human action (MHAiR) dataset contains six different image representations of the audio signals that capture the temporal dynamics of the actions in a very compact and informative way. The dataset was extracted from the audio recordings which were captured...

Full description

Bibliographic Details
Main Authors: Muhammad Bilal Shaikh, Douglas Chai, Syed Mohammed Shamsul Islam, Naveed Akhtar
Format: Article
Language:English
Published: MDPI AG 2024-01-01
Series:Data
Subjects:
Online Access:https://www.mdpi.com/2306-5729/9/2/21