Bi-LS-AttM: A Bidirectional LSTM and Attention Mechanism Model for Improving Image Captioning
The discipline of automatic image captioning represents an integration of two pivotal branches of artificial intelligence, namely computer vision (CV) and natural language processing (NLP). The principal functionality of this technology lies in transmuting the extracted visual features into semantic...
Main Authors: | Tian Xie, Weiping Ding, Jinbao Zhang, Xusen Wan, Jiehua Wang |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-07-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/13/13/7916 |
Similar Items
-
Att-BiL-SL: Attention-Based Bi-LSTM and Sequential LSTM for Describing Video in the Textual Formation
by: Shakil Ahmed, et al.
Published: (2021-12-01) -
Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates
by: Nicholas Moratelli, et al.
Published: (2023-01-01) -
Model Semantic Attention (SemAtt) With Hybrid Learning Separable Neural Network and Long Short-Term Memory to Generate Caption
by: Agus Nursikuwagus, et al.
Published: (2024-01-01) -
UAT: Universal Attention Transformer for Video Captioning
by: Heeju Im, et al.
Published: (2022-06-01) -
Explicit Image Caption Reasoning: Generating Accurate and Informative Captions for Complex Scenes with LMM
by: Mingzhang Cui, et al.
Published: (2024-06-01)