Multi-Shared Attention with Global and Local Pathways for Video Question Answering
Video question answering is a challenging task of significant importance toward visual understanding.However,current visual question answering (VQA) methods mainly focus on a single static image,which is distinct from the sequential visual data we faced in the real world.In addition,due to the diver...
Main Author: | WANG Lei-quan, HOU Wen-yan, YUAN Shao-zu, ZHAO Xin, LIN Yao, WU Chun-lei |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial office of Computer Science
2021-08-01
|
Series: | Jisuanji kexue |
Subjects: | |
Online Access: | http://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-8-145.pdf |
Similar Items
-
TASTA: Text‐Assisted Spatial and Temporal Attention Network for Video Question Answering
by: Tian Wang, et al.
Published: (2023-04-01) -
Co-Attention Network With Question Type for Visual Question Answering
by: Chao Yang, et al.
Published: (2019-01-01) -
Multi-Modality Global Fusion Attention Network for Visual Question Answering
by: Cheng Yang, et al.
Published: (2020-11-01) -
A Video Question Answering Model Based on Knowledge Distillation
by: Zhuang Shao, et al.
Published: (2023-06-01) -
Standard refrigeration and air conditioning : questions and answers/
by: 247465 Elonka, Stephen Michael, et al.
Published: (1973)