2022
DOI: 10.3390/app12157724
|View full text |Cite
|
Sign up to set email alerts
|

Metaheuristics Optimization with Deep Learning Enabled Automated Image Captioning System

Abstract: Image captioning is a popular topic in the domains of computer vision and natural language processing (NLP). Recent advancements in deep learning (DL) models have enabled the improvement of the overall performance of the image captioning approach. This study develops a metaheuristic optimization with a deep learning-enabled automated image captioning technique (MODLE-AICT). The proposed MODLE-AICT model focuses on the generation of effective captions to the input images by using two processes involving encodin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 28 publications
0
7
0
Order By: Relevance
“…The Image captioning outcomes of the RDOAI-ICS model on the Flickr8K dataset is given in Table 1 and Fig. 4 [22]. The experimental outcomes revealed that the RDOAI-ICS model had outperformed other Image captioning approaches under all measures.…”
Section: Results Analysismentioning
confidence: 94%
See 1 more Smart Citation
“…The Image captioning outcomes of the RDOAI-ICS model on the Flickr8K dataset is given in Table 1 and Fig. 4 [22]. The experimental outcomes revealed that the RDOAI-ICS model had outperformed other Image captioning approaches under all measures.…”
Section: Results Analysismentioning
confidence: 94%
“…Result analysis of RDOAI-ICS approach under Flickr8K datasetA comparative analysis of the RDOAI-ICS model on the Flickr8K dataset is given in Table2and Fig. 5[22]. The experimental outcomes displayed by the RDOAI-ICS algorithm have exhibited other Image captioning techniques in all measures.…”
mentioning
confidence: 89%
“…3 illustrates some sample images. A set of brief comparative analyses is made with recent methods, namely hard-attention, Neural Image Caption (NIC) [36], softattention [37], hard attention [38], Spatial and Channel-wise Attention with CNN VGG (SCA-CNN-VGG) [39], and CNN [40] methods. In this study, three parameters were employed for investigational validation such as CIDEr, BLEU, and Meter.…”
Section: Performance Validationmentioning
confidence: 99%
“…IMAGE CAPTIONING RESULT OF LSAHCNN-ICS TECHNIQUE WITH OTHERAPPROACHES UNDER FLICKR30K DATASET[36][37][38][39][40] …”
mentioning
confidence: 99%
See 1 more Smart Citation