2022
DOI: 10.26555/jiteki.v8i1.23009
|View full text |Cite
|
Sign up to set email alerts
|

Sentiment Analysis and Topic Modelling of The COVID-19 Vaccine in Indonesia on Twitter Social Media Using Word Embedding

Abstract: This study aims to analyze the sentiments of the Indonesian people towards the COVID-19 vaccine on Twitter. Data collection was carried out from September 2020 to June 2021 with the keyword "covid vaccine," which resulted in 262306 tweets. After filtering and cleaning, there are 83384 tweets left. The labeling process was done manually by an expert. The label composition in the data is 35209 tweets of positive sentiment, 41596 tweets of neutral sentiment, and 6579 tweets of negative sentiment. The remaining da… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 21 publications
0
5
0
Order By: Relevance
“…Machine Learning classifiers deployed, trained, and tested in this work were Logistic regression [30], Support Vector Machine (SVM) [31], K-nearest neighbors (KNN) [32], Decision Tree [33], Stochastic Gradient Descent (SGD) [34], and Multinomial Naive Bayes [35]. In the ensemble learning category, several models were applied to do the same task such as Voting Classifiers [36], Random Forest [37], Bagging Meta-Estimator [38], AdaBoost [39], XGBoost [40], Gradient Boosting [41], and Light Gradient Boosting Machine (LightGBM) [42]. In the deep learning category, different large language models (LLMs) were used such as Bert [43 & 44], RoBERTa [45 & 46], GPT [47 & 48], and GPT2 [49].…”
Section: The Proposed Approachmentioning
confidence: 99%
“…Machine Learning classifiers deployed, trained, and tested in this work were Logistic regression [30], Support Vector Machine (SVM) [31], K-nearest neighbors (KNN) [32], Decision Tree [33], Stochastic Gradient Descent (SGD) [34], and Multinomial Naive Bayes [35]. In the ensemble learning category, several models were applied to do the same task such as Voting Classifiers [36], Random Forest [37], Bagging Meta-Estimator [38], AdaBoost [39], XGBoost [40], Gradient Boosting [41], and Light Gradient Boosting Machine (LightGBM) [42]. In the deep learning category, different large language models (LLMs) were used such as Bert [43 & 44], RoBERTa [45 & 46], GPT [47 & 48], and GPT2 [49].…”
Section: The Proposed Approachmentioning
confidence: 99%
“…Pre-Trained word embedding is a vectorization process to convert words into vectors where the vector represents words that have been changed before [9], [18], [21], [22]. This process is needed because it aims to convert words into vectors so that they can be processed by the model [23]. The dimension of the resulting vector can be determined.…”
Section: Pre-trained Word Embeddingmentioning
confidence: 99%
“…The maximum number of words in this study is 123 and the dimensions used have a dimension length of 300. This study uses the FastText framework because it is more efficient and faster than Glove and Word2vec [18], [23]. FastText is a library owned and managed by fecabook that functions to represent words efficiently and support the text classification process [23].…”
Section: Pre-trained Word Embeddingmentioning
confidence: 99%
See 1 more Smart Citation
“…Marec et al [ 117 ] used concepts of sentiment analysis to detect public sentiments about specific COVID-19 vaccines, such as AstraZeneca/Oxford, Pfizer/BioNTech, and Moderna. The works of Nezhad et al [ 118 ], Agustiningsih et al [ 119 ], and Ponmani et al [ 120 ] presented the results of performing sentiment analysis of Tweets about COVID-19 vaccines from Iran, Indonesia, and India, respectively. In addition to this, the sentiment analysis of relevant Tweets during the COVID-19 pandemic was performed to detect the sentiments of people towards remote work [ 121 ], online education [ 122 ], social distancing [ 123 ], wearing masks [ 124 ], and vaccine boosters [ 125 ].…”
Section: Literature Reviewmentioning
confidence: 99%