Multi-Shared Attention with Global and Local Pathways for Video Question Answering
Video question answering is a challenging task of significant importance toward visual understanding.However,current visual question answering (VQA) methods mainly focus on a single static image,which is distinct from the sequential visual data we faced in the real world.In addition,due to the diver...
1. autor: | |
---|---|
Format: | Artykuł |
Język: | zho |
Wydane: |
Editorial office of Computer Science
2021-08-01
|
Seria: | Jisuanji kexue |
Hasła przedmiotowe: | |
Dostęp online: | http://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-8-145.pdf |