A Unified Visual and Linguistic Semantics Method for Enhanced Image Captioning

Image captioning, also recognized as the challenge of transforming visual data into coherent natural language descriptions, has persisted as a complex problem. Traditional approaches often suffer from semantic gaps, wherein the generated textual descriptions lack depth, context, or the nuanced relat...

Бүрэн тодорхойлолт

Номзүйн дэлгэрэнгүй
Үндсэн зохиолчид:	Jiajia Peng, Tianbing Tang
Формат:	Өгүүллэг
Хэл сонгох:	English
Хэвлэсэн:	MDPI AG 2024-03-01
Цуврал:	Applied Sciences
Нөхцлүүд:	image captioning image features clustering mechanism Chinese language description
Онлайн хандалт:	https://www.mdpi.com/2076-3417/14/6/2657

Интернэт

https://www.mdpi.com/2076-3417/14/6/2657

A Unified Visual and Linguistic Semantics Method for Enhanced Image Captioning

Интернэт

Ижил төстэй зүйлс