2022
DOI: 10.3390/electronics11172703
|View full text |Cite
|
Sign up to set email alerts
|

KDE-Based Ensemble Learning for Imbalanced Data

Abstract: Imbalanced class distribution affects many applications in machine learning, including medical diagnostics, text classification, intrusion detection and many others. In this paper, we propose a novel ensemble classification method designed to deal with imbalanced data. The proposed method trains each tree in the ensemble using uniquely generated synthetically balanced data. The data balancing is carried out via kernel density estimation, which offers a natural and effective approach to generating new sample po… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(3 citation statements)
references
References 30 publications
0
3
0
Order By: Relevance
“…These are then used as input features for the meta-model, which subsequently determines the final prediction for that instance. The overarching goal of the stacking model is to capitalize on the strengths of each individual model, aiming to produce predictions that are more accurate than any single model could achieve on its own [40].…”
Section: Model Descriptionsmentioning
confidence: 99%
“…These are then used as input features for the meta-model, which subsequently determines the final prediction for that instance. The overarching goal of the stacking model is to capitalize on the strengths of each individual model, aiming to produce predictions that are more accurate than any single model could achieve on its own [40].…”
Section: Model Descriptionsmentioning
confidence: 99%
“…Algorithm 1 illustrates the operation process of the KGSMOTE module. Firstly, the KDE [40] is used to estimate the probability density function of rareclass attack samples. Then, the generated rare-class attack data distribution is extracted from this probability density function, as shown in Equation (3).…”
Section: Imbalanced Data Processing Module Based On Kgsmotementioning
confidence: 99%
“…Kernel density estimation (KDE) [17] is an unsupervised probability distribution learning method. In recent studies [18][19][20], it functions as a data analyst and classifier. The single KDE analyst fails to capture the features of data with many details and frequent variations such as symbols in NMN.…”
Section: Introductionmentioning
confidence: 99%