Path-Wise Attention Memory Network for Visual Question Answering
Visual question answering (VQA) is regarded as a multi-modal fine-grained feature fusion task, which requires the construction of multi-level and omnidirectional relations between nodes. One main solution is the composite attention model which is composed of co-attention (CA) and self-attention(SA)....
Main Authors: | Yingxin Xiang, Chengyuan Zhang, Zhichao Han, Hao Yu, Jiaye Li, Lei Zhu |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-09-01
|
Series: | Mathematics |
Subjects: | |
Online Access: | https://www.mdpi.com/2227-7390/10/18/3244 |
Similar Items
-
Dual Attention Network for Pitch Estimation of Monophonic Music
by: Wenfang Ma, et al.
Published: (2021-07-01) -
Working-Memory-Guided Attention Competes with Exogenous Attention but Not with Endogenous Attention
by: Ping Zhu, et al.
Published: (2023-05-01) -
The attentional boost effect and perceptual degradation: Assessing the influence of attention on recognition memory
by: Mitchell R. P. LaPointe, et al.
Published: (2022-11-01) -
Editorial: The attentional boost effect and related phenomena: new insights into the relation between attention and memory
by: Clelia Rossi-Arnaud, et al.
Published: (2023-06-01) -
Working Memory Capacity Depends on Attention Control, but Not Selective Attention
by: Alexander I. Kotyusov, et al.
Published: (2023-01-01)