Novel Advance Image Caption Generation Utilizing Vision Transformer and Generative Adversarial Networks

In this paper, we propose a novel method for producing image captions through the utilization of Generative Adversarial Networks (GANs) and Vision Transformers (ViTs) using our proposed Image Captioning Utilizing Transformer and GAN (ICTGAN) model. Here we use the efficient representation learning o...

Full description

Bibliographic Details
Main Authors:	Shourya Tyagi, Olukayode Ayodele Oki, Vineet Verma, Swati Gupta, Meenu Vijarania, Joseph Bamidele Awotunde, Abdulrauph Olanrewaju Babatunde
Format:	Article
Language:	English
Published:	MDPI AG 2024-11-01
Series:	Computers
Subjects:	image caption generation vision transformer generative adversarial networks multi-head self-attention model MS COCO
Online Access:	https://www.mdpi.com/2073-431X/13/12/305

Internet

https://www.mdpi.com/2073-431X/13/12/305

Novel Advance Image Caption Generation Utilizing Vision Transformer and Generative Adversarial Networks

Internet

Similar Items