Thangka Image Captioning Based on Semantic Concept Prompt and Multimodal Feature Optimization

Thangka images exhibit a high level of diversity and richness, and the existing deep learning-based image captioning methods generate poor accuracy and richness of Chinese captions for Thangka images. To address this issue, this paper proposes a Semantic Concept Prompt and Multimodal Feature Optimiz...

Full description

Bibliographic Details
Main Authors:	Wenjin Hu, Lang Qiao, Wendong Kang, Xinyue Shi
Format:	Article
Language:	English
Published:	MDPI AG 2023-08-01
Series:	Journal of Imaging
Subjects:	image captioning Thangka deep learning visual concepts knowledge distillation
Online Access:	https://www.mdpi.com/2313-433X/9/8/162

Internet

https://www.mdpi.com/2313-433X/9/8/162

Thangka Image Captioning Based on Semantic Concept Prompt and Multimodal Feature Optimization

Internet

Similar Items