A Unified Visual and Linguistic Semantics Method for Enhanced Image Captioning

Image captioning, also recognized as the challenge of transforming visual data into coherent natural language descriptions, has persisted as a complex problem. Traditional approaches often suffer from semantic gaps, wherein the generated textual descriptions lack depth, context, or the nuanced relat...

Бүрэн тодорхойлолт

Номзүйн дэлгэрэнгүй
Үндсэн зохиолчид: Jiajia Peng, Tianbing Tang
Формат: Өгүүллэг
Хэл сонгох:English
Хэвлэсэн: MDPI AG 2024-03-01
Цуврал:Applied Sciences
Нөхцлүүд:
Онлайн хандалт:https://www.mdpi.com/2076-3417/14/6/2657