Thangka Image Captioning Based on Semantic Concept Prompt and Multimodal Feature Optimization

Thangka images exhibit a high level of diversity and richness, and the existing deep learning-based image captioning methods generate poor accuracy and richness of Chinese captions for Thangka images. To address this issue, this paper proposes a Semantic Concept Prompt and Multimodal Feature Optimiz...

Full description

Bibliographic Details
Main Authors: Wenjin Hu, Lang Qiao, Wendong Kang, Xinyue Shi
Format: Article
Language:English
Published: MDPI AG 2023-08-01
Series:Journal of Imaging
Subjects:
Online Access:https://www.mdpi.com/2313-433X/9/8/162