Multi-Shared Attention with Global and Local Pathways for Video Question Answering
Video question answering is a challenging task of significant importance toward visual understanding.However,current visual question answering (VQA) methods mainly focus on a single static image,which is distinct from the sequential visual data we faced in the real world.In addition,due to the diver...
Main Author: | |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial office of Computer Science
2021-08-01
|
Series: | Jisuanji kexue |
Subjects: | |
Online Access: | http://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-8-145.pdf |