2022
DOI: 10.48084/etasr.5039
|View full text |Cite
|
Sign up to set email alerts
|

The New Dataset MITWPU-1K for Object Recognition and Image Captioning Tasks

Abstract: In the domain of image captioning, many pre-trained datasets are available. Using these datasets, models can be trained to automatically generate image descriptions regarding the contents of an image. Researchers usually do not spend much time in creating and training the new dataset before using it for a specific application, instead, they simply use existing pre-trained datasets. MS COCO, ImageNet, Flicker, and Pascal VOC, are well-known datasets that are widely used in the task of generating image captions.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 11 publications
0
0
0
Order By: Relevance