Multi-Shared Attention with Global and Local Pathways for Video Question Answering

Video question answering is a challenging task of significant importance toward visual understanding.However,current visual question answering (VQA) methods mainly focus on a single static image,which is distinct from the sequential visual data we faced in the real world.In addition,due to the diver...

Full description

Bibliographic Details
Main Author: WANG Lei-quan, HOU Wen-yan, YUAN Shao-zu, ZHAO Xin, LIN Yao, WU Chun-lei
Format: Article
Language:zho
Published: Editorial office of Computer Science 2021-08-01
Series:Jisuanji kexue
Subjects:
Online Access:http://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-8-145.pdf