2024
DOI: 10.3390/computers13120305
|View full text |Cite
|
Sign up to set email alerts
|

Novel Advance Image Caption Generation Utilizing Vision Transformer and Generative Adversarial Networks

Shourya Tyagi,
Olukayode Ayodele Oki,
Vineet Verma
et al.

Abstract: In this paper, we propose a novel method for producing image captions through the utilization of Generative Adversarial Networks (GANs) and Vision Transformers (ViTs) using our proposed Image Captioning Utilizing Transformer and GAN (ICTGAN) model. Here we use the efficient representation learning of the ViTs to improve the realistic image production of the GAN. Using textual features from the LSTM-based language model, our proposed model combines salient information extracted from images using ViTs. This merg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 20 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?