A performance analysis of transformer-based deep learning models for Arabic image captioning

A performance analysis of transformer-based deep learning models for Arabic image captioning

Image captioning has become a fundamental operation that allows the automatic generation of text descriptions of images. However, most existing work focused on performing the image captioning task in English, and only a few proposals exist that address the image captioning task in Arabic. This paper...

Full description

Bibliographic Details
Main Authors:	Ashwaq Alsayed, Thamir M. Qadah, Muhammad Arif
Format:	Article
Language:	English
Published:	Elsevier 2023-10-01
Series:	Journal of King Saud University: Computer and Information Sciences
Subjects:	Image captioning Arabic image captioning Transformer model Performance analysis and evaluation Deep learning Machine learning
Online Access:	http://www.sciencedirect.com/science/article/pii/S131915782300304X

Similar Items

A Systematic Literature Review on Using the Encoder-Decoder Models for Image Captioning in English and Arabic Languages
by: Ashwaq Alsayed, et al.
Published: (2023-09-01)

An Attentive Fourier-Augmented Image-Captioning Transformer
by: Raymond Ian Osolo, et al.
Published: (2021-09-01)

Arabic Captioning for Images of Clothing Using Deep Learning
by: Rasha Saleh Al-Malki, et al.
Published: (2023-04-01)

Cross Encoder-Decoder Transformer with Global-Local Visual Extractor for Medical Image Captioning
by: Hojun Lee, et al.
Published: (2022-02-01)

An Analysis of the Use of Feed-Forward Sub-Modules for Transformer-Based Image Captioning Tasks
by: Raymond Ian Osolo, et al.
Published: (2021-12-01)

Full-Memory Transformer for Image Captioning
by: Tongwei Lu, et al.
Published: (2023-01-01)

Real-time Arabic Video Captioning Using CNN and Transformer Networks Based on Parallel Implementation
by: Adel Jalal Yousif, et al.
Published: (2024-03-01)

Insights into Object Semantics: Leveraging Transformer Networks for Advanced Image Captioning
by: Deema Abdal Hafeth, et al.
Published: (2024-03-01)

Separate Syntax and Semantics: Part-of-Speech-Guided Transformer for Image Captioning
by: Dong Wang, et al.
Published: (2022-11-01)

A Context Semantic Auxiliary Network for Image Captioning
by: Jianying Li, et al.
Published: (2023-07-01)

Exploring Spatial-Based Position Encoding for Image Captioning
by: Xiaobao Yang, et al.
Published: (2023-11-01)

Metaheuristics Optimization with Deep Learning Enabled Automated Image Captioning System
by: Mesfer Al Duhayyim, et al.
Published: (2022-07-01)

Modeling of Hyperparameter Tuned Deep Learning Model for Automated Image Captioning
by: Mohamed Omri, et al.
Published: (2022-01-01)

Image Captioning with Word Gate and Adaptive Self-Critical Learning
by: Xinxin Zhu, et al.
Published: (2018-06-01)

#PraCegoVer: A Large Dataset for Image Captioning in Portuguese
by: Gabriel Oliveira dos Santos, et al.
Published: (2022-01-01)

Imageability- and Length-Controllable Image Captioning
by: Marc A. Kastner, et al.
Published: (2021-01-01)

Novel Object Captioning with Semantic Match from External Knowledge
by: Sen Du, et al.
Published: (2023-07-01)

Style-Enhanced Transformer for Image Captioning in Construction Scenes
by: Kani Song, et al.
Published: (2024-03-01)

Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates
by: Nicholas Moratelli, et al.
Published: (2023-01-01)

Text Augmentation Using BERT for Image Captioning
by: Viktar Atliha, et al.
Published: (2020-08-01)

UAT: Universal Attention Transformer for Video Captioning
by: Heeju Im, et al.
Published: (2022-06-01)

Image Captioning Using Motion-CNN with Object Detection
by: Kiyohiko Iwamura, et al.
Published: (2021-02-01)

From Plane to Hierarchy: Deformable Transformer for Remote Sensing Image Captioning
by: Runyan Du, et al.
Published: (2023-01-01)

Parallel Pathway Dense Video Captioning With Deformable Transformer
by: Wangyu Choi, et al.
Published: (2022-01-01)

Generalized Image Captioning for Multilingual Support
by: Suhyun Cho, et al.
Published: (2023-02-01)

Crop Disease Diagnosis with Deep Learning-Based Image Captioning and Object Detection
by: Dong In Lee, et al.
Published: (2023-02-01)

Caps Captioning: A Modern Image Captioning Approach Based on Improved Capsule Network
by: Shima Javanmardi, et al.
Published: (2022-11-01)

Image-Captioning Model Compression
by: Viktar Atliha, et al.
Published: (2022-02-01)

Thangka Image Captioning Based on Semantic Concept Prompt and Multimodal Feature Optimization
by: Wenjin Hu, et al.
Published: (2023-08-01)

MC-Net: multi-scale contextual information aggregation network for image captioning on remote sensing images
by: Haiyan Huang, et al.
Published: (2023-12-01)

Semantic Representations With Attention Networks for Boosting Image Captioning
by: Deema Abdal Hafeth, et al.
Published: (2023-01-01)

Caption for Cover of Volume 2 Issue 1
by: Amy Christian
Published: (2011-12-01)

Stylized Image Captioning Model Based on Disentangle-Retrieve-Generate
by: CHEN Zhang-hui, XIONG Yun
Published: (2022-06-01)

Folk Games Image Captioning using Object Attention
by: Saiful Akbar, et al.
Published: (2023-08-01)

An Image Captioning Algorithm Based on Combination Attention Mechanism
by: Jinlong Liu, et al.
Published: (2022-04-01)

VSAM-Based Visual Keyword Generation for Image Caption
by: Suya Zhang, et al.
Published: (2021-01-01)

Context-aware visual policy network for fine-grained image captioning
by: Zha, Zheng-Jun, et al.
Published: (2022)

Video Caption Based Searching Using End-to-End Dense Captioning and Sentence Embeddings
by: Akshay Aggarwal, et al.
Published: (2020-06-01)

DIC-Transformer: interpretation of plant disease classification results using image caption generation technology
by: Qingtian Zeng, et al.
Published: (2024-01-01)

Enhanced Image Captioning with Color Recognition Using Deep Learning Methods
by: Yeong-Hwa Chang, et al.
Published: (2021-12-01)