2021
DOI: 10.11591/eei.v10i5.3157
|View full text |Cite
|
Sign up to set email alerts
|

Performance comparison of TF-IDF and Word2Vec models for emotion text classification

Abstract: Emotion is the human feeling when communicating with other humans or reaction to everyday events. Emotion classification is needed to recognize human emotions from text. This study compare the performance of the TF-IDF and Word2Vec models to represent features in the emotional text classification. We use the support vector machine (SVM) and Multinomial Naïve Bayes (MNB) methods for classification of emotional text on commuter line and transjakarta tweet data. The emotion classification in this study has two st… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
22
0
2

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 61 publications
(24 citation statements)
references
References 22 publications
0
22
0
2
Order By: Relevance
“…The more often a word appears in many documents, the smaller the IDF value will be. The following is the formula for TF-IDF [19]:…”
Section: Feature Extraction Using Tf-idf Vectorizermentioning
confidence: 99%
“…The more often a word appears in many documents, the smaller the IDF value will be. The following is the formula for TF-IDF [19]:…”
Section: Feature Extraction Using Tf-idf Vectorizermentioning
confidence: 99%
“…The greater the term in the document, the greater the weight value. The term frequency (TF) formula can be defined by [21]:…”
Section: Tf-idf Vectorizermentioning
confidence: 99%
“…After preprocessing, the data is ready to be processed at the next stage, weighting with TF-IDF. Term Frequency-Inverse Document Frequency, commonly known as TF-IDF, is a method of determining the weight of a word by giving different weights to each word in a document based on the frequency of words per document and the frequency of words in all documents [13]. The first step in this process is to calculate the frequency of appearance of a word in a document (TF) with the equation (1).…”
Section: Term Frequency-inverse Document Frequency (Tf-idf) Word Weig...mentioning
confidence: 99%
“…Next, the calculation of the number of documents containing a certain word is carried out, and then calculated its inverse (IDF) [13] with equations (2).…”
Section: Term Frequency-inverse Document Frequency (Tf-idf) Word Weig...mentioning
confidence: 99%
See 1 more Smart Citation