Multi-Shared Attention with Global and Local Pathways for Video Question Answering

Multi-Shared Attention with Global and Local Pathways for Video Question Answering

Video question answering is a challenging task of significant importance toward visual understanding.However,current visual question answering (VQA) methods mainly focus on a single static image,which is distinct from the sequential visual data we faced in the real world.In addition,due to the diver...

Full description

Bibliographic Details
Main Author:	WANG Lei-quan, HOU Wen-yan, YUAN Shao-zu, ZHAO Xin, LIN Yao, WU Chun-lei
Format:	Article
Language:	zho
Published:	Editorial office of Computer Science 2021-08-01
Series:	Jisuanji kexue
Subjects:	video question answering\|shared attention mechanism\|global and local pathways
Online Access:	http://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-8-145.pdf

Similar Items

TASTA: Text‐Assisted Spatial and Temporal Attention Network for Video Question Answering
by: Tian Wang, et al.
Published: (2023-04-01)

Co-Attention Network With Question Type for Visual Question Answering
by: Chao Yang, et al.
Published: (2019-01-01)

A Video Question Answering Model Based on Knowledge Distillation
by: Zhuang Shao, et al.
Published: (2023-06-01)

Standard refrigeration and air conditioning : questions and answers/
by: 247465 Elonka, Stephen Michael, et al.
Published: (1973)

Answer Category-Aware Answer Selection for Question Answering
by: Weijing Wu, et al.
Published: (2021-01-01)

Arabic Question Answering Systems: Gap Analysis
by: Mariam M. Biltawi, et al.
Published: (2021-01-01)

Question Difficulty Estimation Based on Attention Model for Question Answering
by: Hyun-Je Song, et al.
Published: (2021-12-01)

A survey on complex factual question answering
by: Lingxi Zhang, et al.
Published: (2023-01-01)

Stumpers!: answers to hundreds of questions that stumped the experts /
by: Shapiro, Fred R.
Published: (1998)

1000 questions and answers /
by: 589653 Farndon, John, et al.
Published: (2014)

A Multi-level Mesh Mutual Attention Model for Visual Question Answering
by: Zhi Lei, et al.
Published: (2022-10-01)

Deep Modular Bilinear Attention Network for Visual Question Answering
by: Feng Yan, et al.
Published: (2022-01-01)

Adaptable Closed-Domain Question Answering Using Contextualized CNN-Attention Models and Question Expansion
by: Mahsa Abazari Kia, et al.
Published: (2022-01-01)

Transformer-Based Neural Network for Answer Selection in Question Answering
by: Taihua Shao, et al.
Published: (2019-01-01)

Knowledge Base Question Answering With Attentive Pooling for Question Representation
by: Run-Ze Wang, et al.
Published: (2019-01-01)

Answers or no answers : studying question answerability in stack overflow
by: Chua, Alton Yeow Kuan, et al.
Published: (2020)

An Image Grid Can Be Worth a Video: Zero-Shot Video Question Answering Using a VLM
by: Wonkyun Kim, et al.
Published: (2024-01-01)

The multi-modal fusion in visual question answering: a review of attention mechanisms
by: Siyu Lu, et al.
Published: (2023-05-01)

Very short answer questions: a viable alternative to multiple choice questions
by: Thomas Puthiaparampil, et al.
Published: (2020-05-01)

Aggregated community question answering
by: Snehasish Banerjee, et al.
Published: (2015)

Hierarchical Attentional Factorization Machines for Expert Recommendation in Community Question Answering
by: Weizhao Tang, et al.
Published: (2020-01-01)

Survey of Multimodal Medical Question Answering
by: Hilmi Demirhan, et al.
Published: (2023-12-01)

1001 questions and answers about your car /
by: 286235 Schultz, Morton J.
Published: (1973)

A Comprehensive Review and Open Challenges on Visual Question Answering Models
by: Fasi Ahamad Shaik, et al.
Published: (2023-09-01)

Adversarial Learning with Bidirectional Attention for Visual Question Answering
by: Qifeng Li, et al.
Published: (2021-10-01)

Multi-Modal Explicit Sparse Attention Networks for Visual Question Answering
by: Zihan Guo, et al.
Published: (2020-11-01)

Answer Distillation Network With Bi-Text-Image Attention for Medical Visual Question Answering
by: Hongfang Gong, et al.
Published: (2025-01-01)

Spatio-Temporal Graph Convolution Transformer for Video Question Answering
by: Jiahao Tang, et al.
Published: (2024-01-01)

Harnessing the Power of Metadata for Enhanced Question Retrieval in Community Question Answering
by: Shima Ghasemi, et al.
Published: (2024-01-01)

Yes or no, or how to answer a negative question
by: Hana Gruet-Skrabalova
Published: (2016-12-01)

Survey of Multilingual Question Answering
by: LIU Chuang, XIONG De-yi
Published: (2022-01-01)

Collaborative Learning for Answer Selection in Question Answering
by: Taihua Shao, et al.
Published: (2019-01-01)

SBVQA 2.0: Robust End-to-End Speech-Based Visual Question Answering for Open-Ended Questions
by: Faris Alasmary, et al.
Published: (2023-01-01)

Intelligent Question Answering in Restricted Domains Using Deep Learning and Question Pair Matching
by: Lin-Qin Cai, et al.
Published: (2020-01-01)

Uslub: Questions and Answers in the Qur’an
by: Suhaimi Suhaimi
Published: (2023-01-01)

QARR-FSQA: Question-Answer Replacement and Removal Pretraining Framework for Few-Shot Question Answering
by: Siao Wah Tan, et al.
Published: (2024-01-01)

Chinese Medical Question Answer Matching Using End-to-End Character-Level Multi-Scale CNNs
by: Sheng Zhang, et al.
Published: (2017-07-01)

Tweet question classification for enhancing Tweet Question Answering System
by: Chindukuri Mallikarjuna, et al.
Published: (2025-03-01)

Survey of Question Answering Based on Knowledge Graph Reasoning
by: SA Rina, LI Yanling, LIN Min
Published: (2022-08-01)

How it works book of amazing answers to curious questions
Published: (2012)