Lighting Search Algorithm With Convolutional Neural Network-Based Image Captioning System for Natural Language Processing

Recently, deep learning models have become more prominent due to their tremendous performance for real-time tasks like face recognition, object detection, natural language processing (NLP), instance segmentation, image classification, gesture recognition, and video classification. Image captioning i...

Celý popis

Podrobná bibliografie
Hlavní autoři: Rana Othman Alnashwan, Samia Allaoua Chelloug, Nabil Sharaf Almalki, Imene Issaoui, Abdelwahed Motwakel, Ahmed Sayed
Médium: Článek
Jazyk:English
Vydáno: IEEE 2023-01-01
Edice:IEEE Access
Témata:
On-line přístup:https://ieeexplore.ieee.org/document/10355962/
_version_ 1827398971380727808
author Rana Othman Alnashwan
Samia Allaoua Chelloug
Nabil Sharaf Almalki
Imene Issaoui
Abdelwahed Motwakel
Ahmed Sayed
author_facet Rana Othman Alnashwan
Samia Allaoua Chelloug
Nabil Sharaf Almalki
Imene Issaoui
Abdelwahed Motwakel
Ahmed Sayed
author_sort Rana Othman Alnashwan
collection DOAJ
description Recently, deep learning models have become more prominent due to their tremendous performance for real-time tasks like face recognition, object detection, natural language processing (NLP), instance segmentation, image classification, gesture recognition, and video classification. Image captioning is one of the critical tasks in NLP and computer vision (CV). It completes conversion from image to text; specifically, the model produces description text automatically based on the input images. In this aspect, this article develops a Lighting Search Algorithm (LSA) with a Hybrid Convolutional Neural Network Image Captioning System (LSAHCNN-ICS) for NLP. This introduced LSAHCNN-ICS system develops an end-to-end model which employs convolutional neural network (CNN) based ShuffleNet as an encoder and HCNN as a decoder. At the encoding part, the ShuffleNet model derives feature descriptors of the image. Besides, in the decoding part, the description of text can be generated using the proposed hybrid convolutional neural network (HCNN) model. To achieve improved captioning results, the LSA is applied as a hyperparameter tuning strategy, representing the innovation of the study. The simulation analysis of the presented LSAHCNN-ICS technique is performed on a benchmark database, and the obtained results demonstrated the enhanced outcomes of the LSAHCNN-ICS algorithm over other recent methods with maximum Consensus-based Image Description Evaluation (CIDEr Code) of 43.60, 59.54, and 135.14 on Flickr8k, Flickr30k, and MSCOCO datasets correspondingly.
first_indexed 2024-03-08T19:37:34Z
format Article
id doaj.art-a9f94894aeae4f27bf785f94f07d0806
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-08T19:37:34Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-a9f94894aeae4f27bf785f94f07d08062023-12-26T00:04:39ZengIEEEIEEE Access2169-35362023-01-011114264314265110.1109/ACCESS.2023.334270310355962Lighting Search Algorithm With Convolutional Neural Network-Based Image Captioning System for Natural Language ProcessingRana Othman Alnashwan0Samia Allaoua Chelloug1https://orcid.org/0000-0002-9711-0235Nabil Sharaf Almalki2Imene Issaoui3Abdelwahed Motwakel4https://orcid.org/0000-0003-4084-5457Ahmed Sayed5Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box, 84428, Riyadh, Saudi ArabiaDepartment of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box, 84428, Riyadh, Saudi ArabiaDepartment of Special Education, College of Education, King Saud University, Riyadh, Saudi ArabiaUnit of Scientific Research, Applied College, Qassim University, Buraydah, Saudi ArabiaDepartment of Information Systems, College of Business Administration in Hawtat Bani Tamim, Prince Sattam bin Abdulaziz University, Al-Kharj, Saudi ArabiaResearch Center, Future University in Egypt, New Cairo, EgyptRecently, deep learning models have become more prominent due to their tremendous performance for real-time tasks like face recognition, object detection, natural language processing (NLP), instance segmentation, image classification, gesture recognition, and video classification. Image captioning is one of the critical tasks in NLP and computer vision (CV). It completes conversion from image to text; specifically, the model produces description text automatically based on the input images. In this aspect, this article develops a Lighting Search Algorithm (LSA) with a Hybrid Convolutional Neural Network Image Captioning System (LSAHCNN-ICS) for NLP. This introduced LSAHCNN-ICS system develops an end-to-end model which employs convolutional neural network (CNN) based ShuffleNet as an encoder and HCNN as a decoder. At the encoding part, the ShuffleNet model derives feature descriptors of the image. Besides, in the decoding part, the description of text can be generated using the proposed hybrid convolutional neural network (HCNN) model. To achieve improved captioning results, the LSA is applied as a hyperparameter tuning strategy, representing the innovation of the study. The simulation analysis of the presented LSAHCNN-ICS technique is performed on a benchmark database, and the obtained results demonstrated the enhanced outcomes of the LSAHCNN-ICS algorithm over other recent methods with maximum Consensus-based Image Description Evaluation (CIDEr Code) of 43.60, 59.54, and 135.14 on Flickr8k, Flickr30k, and MSCOCO datasets correspondingly.https://ieeexplore.ieee.org/document/10355962/Deep convolutional neural networknatural language processingimage captioningmachine learninghyperparameter tuning
spellingShingle Rana Othman Alnashwan
Samia Allaoua Chelloug
Nabil Sharaf Almalki
Imene Issaoui
Abdelwahed Motwakel
Ahmed Sayed
Lighting Search Algorithm With Convolutional Neural Network-Based Image Captioning System for Natural Language Processing
IEEE Access
Deep convolutional neural network
natural language processing
image captioning
machine learning
hyperparameter tuning
title Lighting Search Algorithm With Convolutional Neural Network-Based Image Captioning System for Natural Language Processing
title_full Lighting Search Algorithm With Convolutional Neural Network-Based Image Captioning System for Natural Language Processing
title_fullStr Lighting Search Algorithm With Convolutional Neural Network-Based Image Captioning System for Natural Language Processing
title_full_unstemmed Lighting Search Algorithm With Convolutional Neural Network-Based Image Captioning System for Natural Language Processing
title_short Lighting Search Algorithm With Convolutional Neural Network-Based Image Captioning System for Natural Language Processing
title_sort lighting search algorithm with convolutional neural network based image captioning system for natural language processing
topic Deep convolutional neural network
natural language processing
image captioning
machine learning
hyperparameter tuning
url https://ieeexplore.ieee.org/document/10355962/
work_keys_str_mv AT ranaothmanalnashwan lightingsearchalgorithmwithconvolutionalneuralnetworkbasedimagecaptioningsystemfornaturallanguageprocessing
AT samiaallaouachelloug lightingsearchalgorithmwithconvolutionalneuralnetworkbasedimagecaptioningsystemfornaturallanguageprocessing
AT nabilsharafalmalki lightingsearchalgorithmwithconvolutionalneuralnetworkbasedimagecaptioningsystemfornaturallanguageprocessing
AT imeneissaoui lightingsearchalgorithmwithconvolutionalneuralnetworkbasedimagecaptioningsystemfornaturallanguageprocessing
AT abdelwahedmotwakel lightingsearchalgorithmwithconvolutionalneuralnetworkbasedimagecaptioningsystemfornaturallanguageprocessing
AT ahmedsayed lightingsearchalgorithmwithconvolutionalneuralnetworkbasedimagecaptioningsystemfornaturallanguageprocessing