Modality attention fusion model with hybrid multi-head self-attention for video understanding

Video question answering (Video-QA) is a subject undergoing intense study in Artificial Intelligence, which is one of the tasks which can evaluate such AI abilities. In this paper, we propose a Modality Attention Fusion framework with Hybrid Multi-head Self-attention (MAF-HMS). MAF-HMS focuses on th...

Full description

Bibliographic Details
Main Authors: Xuqiang Zhuang, Fang’ai Liu, Jian Hou, Jianhua Hao, Xiaohong Cai
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2022-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536548/?tool=EBI