2023
DOI: 10.47065/bits.v4i4.3141
|View full text |Cite
|
Sign up to set email alerts
|

Undersampling dan K-Fold Random Forest Untuk Klasifikasi Kelas Tidak Seimbang

Abstract: Classification in Data Mining is a process of modelling that explains and differentiates data classes intending to estimate the class of an object whose class is unknown. Classification can be applied in various aspects so over time quite a lot of classification algorithms have been developed, but some problems are often encountered in classification, namely the problem of data imbalance. An imbalanced class is a condition where there are several data where the number of classes is not balanced or there is a s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 14 publications
(18 reference statements)
0
1
0
Order By: Relevance
“…Overfitting refers to a situation in which the model has mostly memorized the irrelevant details or noise included in the training data rather than grasping the fundamental patterns. Consequently, this can result in inadequate generalization when applied to novel, unknown data [13]. Resampling approaches, such as oversampling (the addition of samples) or undersampling (the reduction of instances), have the potential to incorporate novel models into the dataset.…”
Section: Modeling and Evaluation Of Classification Resultsmentioning
confidence: 99%
“…Overfitting refers to a situation in which the model has mostly memorized the irrelevant details or noise included in the training data rather than grasping the fundamental patterns. Consequently, this can result in inadequate generalization when applied to novel, unknown data [13]. Resampling approaches, such as oversampling (the addition of samples) or undersampling (the reduction of instances), have the potential to incorporate novel models into the dataset.…”
Section: Modeling and Evaluation Of Classification Resultsmentioning
confidence: 99%