2023
DOI: 10.1002/cpe.7894
|View full text |Cite
|
Sign up to set email alerts
|

Entropy and improved k‐nearest neighbor search based under‐sampling (ENU) method to handle class overlap in imbalanced datasets

Anil Kumar,
Dinesh Singh,
Rama Shankar Yadav

Abstract: SummaryMany real‐world application datasets such as medical diagnostics, fraud detection, biological classification, risk analysis and so forth are facing class imbalance and overlapping problems. It seriously affects the learning of the classification model on these datasets because minority instances are not visible to the learner in the overlapped region and the performance of learners is biased towards the majority. Undersampling‐based methods are the most commonly used techniques to handle the above‐menti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2025
2025

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(1 citation statement)
references
References 79 publications
0
1
0
Order By: Relevance
“…Previous studies have extensively investigated methods to improve model accuracy on imbalanced data, including data resampling (such as under-sampling majority classes [19][20][21][22] or over-sampling minority classes [23][24][25] ), generating synthetic data 1,13,14 and cost-sensitive learning 26,27 that assigns higher weights to the loss of minority classes. However, these studies mostly focused on imbalanced classification, with few efforts dedicated to addressing imbalanced regression tasks.…”
Section: Imbalanced Learningmentioning
confidence: 99%
“…Previous studies have extensively investigated methods to improve model accuracy on imbalanced data, including data resampling (such as under-sampling majority classes [19][20][21][22] or over-sampling minority classes [23][24][25] ), generating synthetic data 1,13,14 and cost-sensitive learning 26,27 that assigns higher weights to the loss of minority classes. However, these studies mostly focused on imbalanced classification, with few efforts dedicated to addressing imbalanced regression tasks.…”
Section: Imbalanced Learningmentioning
confidence: 99%