Automatic Image Caption Generation Based on Some Machine Learning Algorithms

Predić, Bratislav; Manic, D.; Saračević, Muzafer; Karabašević, Darjan; Stanujkić, Dragiša

doi:10.1155/2022/4001460

Cited by 7 publications

(3 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the recent years, machine learning has been used for multiple tasks of text analysis to allow complex analysis of text. These include text generation such as automatic document classification [Kadhim, 2019;Kowsari et al, 2019], text generation [Gatt and Krahmer, 2018;de Rosa and Papa, 2021], text summarization [Gambhir and Gupta, 2017;El-Kassas et al, 2021], sentiment analysis [Zhang et al, 2018;Yadav and Vishwakarma, 2020], automatic caption generation [Bai and An, 2018;Hossain et al, 2019;Predić et al, 2022].…”

Section: Discussionmentioning

confidence: 99%

A data science and machine learning approach to continuous analysis of Shakespeare's plays

Swisher

Shamir

2023

Journal of Data Mining &Amp; Digital Humanities

View full text Add to dashboard Cite

The availability of quantitative text analysis methods has provided new ways of analyzing literature in a manner that was not available in the pre-information era. Here we apply comprehensive machine learning analysis to the work of William Shakespeare. The analysis shows clear changes in the style of writing over time, with the most significant changes in the sentence length, frequency of adjectives and adverbs, and the sentiments expressed in the text. Applying machine learning to make a stylometric prediction of the year of the play shows a Pearson correlation of 0.71 between the actual and predicted year, indicating that Shakespeare's writing style as reflected by the quantitative measurements changed over time. Additionally, it shows that the stylometrics of some of the plays is more similar to plays written either before or after the year they were written. For instance, Romeo and Juliet is dated 1596, but is more similar in stylometrics to plays written by Shakespeare after 1600. The source code for the analysis is available for free download.

show abstract

Section: Discussionmentioning

confidence: 99%

A data science and machine learning approach to continuous analysis of Shakespeare's plays

Swisher

Shamir

2023

Journal of Data Mining &Amp; Digital Humanities

View full text Add to dashboard Cite

show abstract

“…Next, in the second stage, the Querying Transformer is pre-trained for vision-to-language generative learning, utilizing a frozen Large Language Model (LLM). The authors in [20,21] have presented their work based on CNN and LSTM with integration with ML algorithms. The authors in [22] presented their work of generating the captions for the text summarization technique using an ML-based pre-trained algorithm.…”

Section: Related Workmentioning

confidence: 99%

“…To achieve pre-training of a unified vision-language model that combines comprehension and generation abilities, the Bootstrapping Language-Image Pre-training (BLIP) model introduces a multimodal encoder-decoder architecture. This architecture serves three key functions [19,20]:…”

Section: Bootstrapping Process For Language-image Pretraining Modelmentioning

confidence: 99%

Enhancing User Profile Authenticity through Automatic Image Caption Generation Using a Bootstrapping Language–Image Pre-Training Model

Bharne,

Bhaladhare

2024

RAiSE-2023

View full text Add to dashboard Cite

show abstract

A multi-classifier system for automatic fingerprint classification using transfer learning and majority voting

Walhazi

Maalej

Amara

2023

Multimed Tools Appl

View full text Add to dashboard Cite

Automatic Image Caption Generation Based on Some Machine Learning Algorithms

Cited by 7 publications

References 28 publications

A data science and machine learning approach to continuous analysis of Shakespeare's plays

A data science and machine learning approach to continuous analysis of Shakespeare's plays

Enhancing User Profile Authenticity through Automatic Image Caption Generation Using a Bootstrapping Language–Image Pre-Training Model

A multi-classifier system for automatic fingerprint classification using transfer learning and majority voting

Contact Info

Product

Resources

About