MC-Net: multi-scale contextual information aggregation network for image captioning on remote sensing images
ABSTRACTRemote Sensing Image Captioning (RSIC) plays a crucial role in advancing semantic understanding and has increasingly become a focal point of research. Nevertheless, existing RSIC methods grapple with challenges due to the intricate multi-scale nature and multifaceted backgrounds inherent in...
Main Authors: | Haiyan Huang, Zhenfeng Shao, Qimin Cheng, Xiao Huang, Xiaoping Wu, Guoming Li, Li Tan |
---|---|
Format: | Article |
Language: | English |
Published: |
Taylor & Francis Group
2023-12-01
|
Series: | International Journal of Digital Earth |
Subjects: | |
Online Access: | https://www.tandfonline.com/doi/10.1080/17538947.2023.2283482 |
Similar Items
-
VAA: Visual Aligning Attention Model for Remote Sensing Image Captioning
by: Zhengyuan Zhang, et al.
Published: (2019-01-01) -
Image Caption Generation Using Contextual Information Fusion With Bi-LSTM-s
by: Huawei Zhang, et al.
Published: (2023-01-01) -
Multi‐task learning for captioning images with novel words
by: He Zheng, et al.
Published: (2019-04-01) -
Exploring Multi-Level Attention and Semantic Relationship for Remote Sensing Image Captioning
by: Zhenghang Yuan, et al.
Published: (2020-01-01) -
VSAM-Based Visual Keyword Generation for Image Caption
by: Suya Zhang, et al.
Published: (2021-01-01)