Hierarchical Attention-Based Fusion for Image Caption With Multi-Grained Rewards
Image caption based on reinforcement learning (RL) methods has achieved significant success recently. Most of these methods take CIDEr score as the reward of reinforcement learning algorithm to compute gradients, thus refining the image caption baseline model. However, CIDEr score is not the sole cr...
Main Authors: | Chunlei Wu, Shaozu Yuan, Haiwen Cao, Yiwei Wei, Leiquan Wang |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9039552/ |
Similar Items
-
Social Image Captioning: Exploring Visual Attention and User Attention
by: Leiquan Wang, et al.
Published: (2018-02-01) -
Multi-Gate Attention Network for Image Captioning
by: Weitao Jiang, et al.
Published: (2021-01-01) -
Sequential Dual Attention: Coarse-to-Fine-Grained Hierarchical Generation for Image Captioning
by: Zhibin Guan, et al.
Published: (2018-11-01) -
Cascade Semantic Fusion for Image Captioning
by: Shiwei Wang, et al.
Published: (2019-01-01) -
Cross-scale Feature Fusion Self-attention for Image Captioning
by: WANG Ming-zhan, JI Jun-zhong, JIA Ao-zhe, ZHANG Xiao-dan
Published: (2022-10-01)