A Unified Visual and Linguistic Semantics Method for Enhanced Image Captioning

A Unified Visual and Linguistic Semantics Method for Enhanced Image Captioning

Image captioning, also recognized as the challenge of transforming visual data into coherent natural language descriptions, has persisted as a complex problem. Traditional approaches often suffer from semantic gaps, wherein the generated textual descriptions lack depth, context, or the nuanced relat...

Full description

Bibliographic Details
Main Authors:	Jiajia Peng, Tianbing Tang
Format:	Article
Language:	English
Published:	MDPI AG 2024-03-01
Series:	Applied Sciences
Subjects:	image captioning image features clustering mechanism Chinese language description
Online Access:	https://www.mdpi.com/2076-3417/14/6/2657

Similar Items

Cascade Semantic Fusion for Image Captioning
by: Shiwei Wang, et al.
Published: (2019-01-01)

A Scientometric Visualization Analysis of Image Captioning Research From 2010 to 2020
by: Wenxuan Liu, et al.
Published: (2021-01-01)

Image Caption Generation via Unified Retrieval and Generation-Based Method
by: Shanshan Zhao, et al.
Published: (2020-09-01)

A Study of ConvNeXt Architectures for Enhanced Image Captioning
by: Leo Ramos, et al.
Published: (2024-01-01)

Semantic Representations With Attention Networks for Boosting Image Captioning
by: Deema Abdal Hafeth, et al.
Published: (2023-01-01)

Novel Object Captioning with Semantic Match from External Knowledge
by: Sen Du, et al.
Published: (2023-07-01)

Enhancing Surveillance Systems: Integration of Object, Behavior, and Space Information in Captions for Advanced Risk Assessment
by: Minseong Jeon, et al.
Published: (2024-01-01)

Variational Autoencoder-Based Multiple Image Captioning Using a Caption Attention Map
by: Boeun Kim, et al.
Published: (2019-07-01)

Style-Enhanced Transformer for Image Captioning in Construction Scenes
by: Kani Song, et al.
Published: (2024-03-01)

A Systematic Literature Review on Using the Encoder-Decoder Models for Image Captioning in English and Arabic Languages
by: Ashwaq Alsayed, et al.
Published: (2023-09-01)

VAA: Visual Aligning Attention Model for Remote Sensing Image Captioning
by: Zhengyuan Zhang, et al.
Published: (2019-01-01)

Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates
by: Nicholas Moratelli, et al.
Published: (2023-01-01)

Explicit Image Caption Reasoning: Generating Accurate and Informative Captions for Complex Scenes with LMM
by: Mingzhang Cui, et al.
Published: (2024-06-01)

Exploring better image captioning with grid features
by: Jie Yan, et al.
Published: (2024-02-01)

Insights into Object Semantics: Leveraging Transformer Networks for Advanced Image Captioning
by: Deema Abdal Hafeth, et al.
Published: (2024-03-01)

A Context Semantic Auxiliary Network for Image Captioning
by: Jianying Li, et al.
Published: (2023-07-01)

Tiny TR-CAP: A novel small-scale benchmark dataset for general-purpose image captioning tasks
by: Abbas Memiş, et al.
Published: (2025-04-01)

Image Captioning Based on Semantic Scenes
by: Fengzhi Zhao, et al.
Published: (2024-10-01)

#PraCegoVer: A Large Dataset for Image Captioning in Portuguese
by: Gabriel Oliveira dos Santos, et al.
Published: (2022-01-01)

Context-Driven Image Caption With Global Semantic Relations of the Named Entities
by: Yun Jing, et al.
Published: (2020-01-01)

Chinese image captioning with fusion encoder and visual keyword search
by: Yang Zou, et al.
Published: (2024-09-01)

Semantic interdisciplinary evaluation of image captioning models
by: Uddagiri Sirisha, et al.
Published: (2022-12-01)

Stack-VS : stacked visual-semantic attention for image caption generation
by: Cheng, Ling, et al.
Published: (2021)

Text Augmentation Using BERT for Image Captioning
by: Viktar Atliha, et al.
Published: (2020-08-01)

VSAM-Based Visual Keyword Generation for Image Caption
by: Suya Zhang, et al.
Published: (2021-01-01)

Separate Syntax and Semantics: Part-of-Speech-Guided Transformer for Image Captioning
by: Dong Wang, et al.
Published: (2022-11-01)

Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
by: Ling Cheng, et al.
Published: (2020-01-01)

Generalized Image Captioning for Multilingual Support
by: Suhyun Cho, et al.
Published: (2023-02-01)

The Optimal Choice of the Encoder–Decoder Model Components for Image Captioning
by: Mateusz Bartosiewicz, et al.
Published: (2024-08-01)

Cross-scale Feature Fusion Self-attention for Image Captioning
by: WANG Ming-zhan, JI Jun-zhong, JIA Ao-zhe, ZHANG Xiao-dan
Published: (2022-10-01)

Semantic-Guided Selective Representation for Image Captioning
by: Yinan Li, et al.
Published: (2023-01-01)

Extracting Structured Supervision From Captions for Weakly Supervised Semantic Segmentation
by: Daniel R. Vilar, et al.
Published: (2021-01-01)

Component based comparative analysis of each module in image captioning
by: Seoung-Ho Choi, et al.
Published: (2021-03-01)

Image-Caption Model Based on Fusion Feature
by: Yaogang Geng, et al.
Published: (2022-09-01)

CLIP-Based Grid Features and Masking for Remote Sensing Image Captioning
by: Qiaoling Lin, et al.
Published: (2025-01-01)

Image Captioning with multi-level similarity-guided semantic matching
by: Jiesi Li, et al.
Published: (2021-12-01)

An Attentive Fourier-Augmented Image-Captioning Transformer
by: Raymond Ian Osolo, et al.
Published: (2021-09-01)

Image-Captioning Model Compression
by: Viktar Atliha, et al.
Published: (2022-02-01)

EAES: Effective Augmented Embedding Spaces for Text-Based Image Captioning
by: Khang Nguyen, et al.
Published: (2022-01-01)

Panoptic Segmentation-Based Attention for Image Captioning
by: Wenjie Cai, et al.
Published: (2020-01-01)