2022
DOI: 10.1007/978-981-19-2394-4_8
|View full text |Cite
|
Sign up to set email alerts
|

Comparative Analysis Between Macro and Micro-Accuracy in Imbalance Dataset for Movie Review Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…As indicated in Table 4 , our Bi-LSTM model’s hyperparameters were selected based on the highest results after we tested and compared the outputs of many classifiers using different learning paradigms (different optimal hyperparameters and architectures) [ 86 ]. Since the imbalanced dataset leads to overfitting in the performance of the model [ 87 , 88 , 89 ], consequently, the proposed detection models faced overfitting in their performance because the dataset used in this research is imbalanced (we did not take action to balance the dataset for the reasons referred to in the “Dataset Visualization” section). To prevent the performance of the proposed model from overfitting as well as to increase the generalization of the model to the new data [ 90 , 91 ], and to generate high detection accuracy [ 92 ], we used several techniques and methods, including reducing the complexity of model design [ 93 , 94 ], using weight regularization with L1 and L2 regularizers, where L1 is the sum of the absolute weights, and L2 is the sum of the squared weights [ 90 , 93 ], and adding a dropout layer [ 91 , 92 ].…”
Section: Methodsmentioning
confidence: 99%
“…As indicated in Table 4 , our Bi-LSTM model’s hyperparameters were selected based on the highest results after we tested and compared the outputs of many classifiers using different learning paradigms (different optimal hyperparameters and architectures) [ 86 ]. Since the imbalanced dataset leads to overfitting in the performance of the model [ 87 , 88 , 89 ], consequently, the proposed detection models faced overfitting in their performance because the dataset used in this research is imbalanced (we did not take action to balance the dataset for the reasons referred to in the “Dataset Visualization” section). To prevent the performance of the proposed model from overfitting as well as to increase the generalization of the model to the new data [ 90 , 91 ], and to generate high detection accuracy [ 92 ], we used several techniques and methods, including reducing the complexity of model design [ 93 , 94 ], using weight regularization with L1 and L2 regularizers, where L1 is the sum of the absolute weights, and L2 is the sum of the squared weights [ 90 , 93 ], and adding a dropout layer [ 91 , 92 ].…”
Section: Methodsmentioning
confidence: 99%
“…The author in [37] has carried out research on comparative analysis of macro and micro accuracy through a three-classifier approach, namely: Naïve Bayes (NB), Support Vector Machine (SVM), and Random Forest (RF) in the Movie Reviews dataset. In align previous research, this study proposes two additional classifiers, namely: k-Nearest Neighbors (KNN) and Long Short-Term Memory (LSTM), which will be tested on the Plant-Disease Relation (PDR) dataset.…”
Section: Introductionmentioning
confidence: 99%
“…In comparative analysis, it is necessary to evaluate the model using various performance metrics [5], and it is an interdisciplinary method that encompasses broad cross-sections of disciplines [1]. Comparative analysis is one way to solve the problem of model performance in classifying unbalanced datasets.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation