A Unified Visual and Linguistic Semantics Method for Enhanced Image Captioning
Image captioning, also recognized as the challenge of transforming visual data into coherent natural language descriptions, has persisted as a complex problem. Traditional approaches often suffer from semantic gaps, wherein the generated textual descriptions lack depth, context, or the nuanced relat...
Main Authors: | Jiajia Peng, Tianbing Tang |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2024-03-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/14/6/2657 |
Similar Items
-
Image Caption Generation via Unified Retrieval and Generation-Based Method
by: Shanshan Zhao, et al.
Published: (2020-09-01) -
A Study of ConvNeXt Architectures for Enhanced Image Captioning
by: Leo Ramos, et al.
Published: (2024-01-01) -
Semantic Representations With Attention Networks for Boosting Image Captioning
by: Deema Abdal Hafeth, et al.
Published: (2023-01-01) -
Style-Enhanced Transformer for Image Captioning in Construction Scenes
by: Kani Song, et al.
Published: (2024-03-01) -
Novel Object Captioning with Semantic Match from External Knowledge
by: Sen Du, et al.
Published: (2023-07-01)