Automatic Image Caption Generation Using Deep Learning

Verma, Akash; Yadav, Arun Kumar; Kumar, Mohit; Yadav, Divakar

doi:10.21203/rs.3.rs-1282936/v1

Cited by 3 publications

(4 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, the proposed transformer model has demonstrated superior performance (Table 1) in terms of the BLEU-4 score (0.71) and METEOR score (0.81), indicating higher accuracy and fluency in caption generation compared with the traditional CNN model proposed by Akash Verma et al [21]. The authors of the study demonstrated that BLEU-4 score of the generated picture was (0.66) and a METEOR score of (0.50) using the "Flickr8k" dataset.…”

Section: Comparison Of Vit To Cnnmentioning

confidence: 95%

Transforming Healthcare: Leveraging Vision-Based Neural Networks for Smart Home Patient Monitoring

Gibet Tani,

Eloutouate,

Elouaai

et al. 2023

Int. J. Onl. Eng.

View full text Add to dashboard Cite

Image captioning is a promising technique for remote monitoring of patient behavior, enabling healthcare providers to identify changes in patient routines and conditions. In this study, we explore the use of transformer neural networks for image caption generation from surveillance camera footage, captured at regular intervals of one minute. Our goal is to develop and evaluate a transformer neural network model, trained and tested on the COCO (common objects in context) dataset, for generating captions that describe patient behavior. Furthermore, we will compare our proposed approach with a traditional convolutional neural network (CNN) method to highlight the prominence of our proposed approach. Our findings demonstrate the potential of transformer neural networks in generating natural language descriptions of patient behavior, which can provide valuable insights for healthcare providers. The use of such technology can allow for more efficient monitoring of patients, enabling timely interventions when necessary. Moreover, our study highlights the potential of transformer neural networks in identifying patterns and trends in patient behavior over time, which can aid in developing personalized healthcare plans.

show abstract

Section: Comparison Of Vit To Cnnmentioning

confidence: 95%

Transforming Healthcare: Leveraging Vision-Based Neural Networks for Smart Home Patient Monitoring

Gibet Tani,

Eloutouate,

Elouaai

et al. 2023

Int. J. Onl. Eng.

View full text Add to dashboard Cite

show abstract

“…Traditional methods relied upon search based and template based techniques which came with the drawback of major dependency on the datasets to generate captions [11] [12]. On the other hand, the deep learning methods turned the direction by introducing encoder-decoder framework [13], attention based model, reinforcement learning [25] and so on.…”

Section: Introductionmentioning

confidence: 99%

“…The different datasets such as Flickr8k, Flickr30k, MSCOCO and Pascal1K used for training the model[24]. Verma et al has described neural network-based model for automatic image captioning and has presented BLEU scores comparison of proposed model with other existing models for different images[25].…”

mentioning

confidence: 99%

See 1 more Smart Citation

Study and Development of Image Caption Generation using Various Encoders for Different Image Categories

Namdev,

Reddy

2023

Preprint

View full text Add to dashboard Cite

Images can act as the source of information or way of communication. Captioning an image by machines in a manner such that it conveys the true meaning is considered the most difficult task. It is indeed a process of deep analysis and most researched area. This paper presents a comparative analysis of different deep learning models such as Inception V3, Resnet50 and VGG16 based on a novel image captioning methodology applied on different picture categories. The hybrid approach is developed to get high BLEU score for each input images. The data set generation, implementation of hybrid approach, and the challenges along with the future work are discussed.

show abstract

Smart Assistant for Visually Impaired People using Deep Learning Algorithms

Bipin,

Abirami

2023

2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS)

View full text Add to dashboard Cite

Automatic Image Caption Generation Using Deep Learning

Cited by 3 publications

References 39 publications

Transforming Healthcare: Leveraging Vision-Based Neural Networks for Smart Home Patient Monitoring

Transforming Healthcare: Leveraging Vision-Based Neural Networks for Smart Home Patient Monitoring

Study and Development of Image Caption Generation using Various Encoders for Different Image Categories

Smart Assistant for Visually Impaired People using Deep Learning Algorithms

Contact Info

Product

Resources

About