2020
DOI: 10.1007/s00521-020-05130-z
|View full text |Cite
|
Sign up to set email alerts
|

CDSMOTE: class decomposition and synthetic minority class oversampling technique for imbalanced-data classification

Abstract: Class-imbalanced datasets are common across several domains such as health, banking, security, and others. The dominance of majority class instances (negative class) often results in biased learning models, and therefore, classifying such datasets requires employing some methods to compact the problem. In this paper, we propose a new hybrid approach aiming at reducing the dominance of the majority class instances using class decomposition and increasing the minority class instances using an oversampling method… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
30
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 65 publications
(31 citation statements)
references
References 44 publications
1
30
0
Order By: Relevance
“…The original CDSMOTE method presented in [10] is comprised of two steps: 1) class decomposition to redistribute the number of samples per class without losing any sample and 2) oversampling the new minority class(es) to reduce the dominance of the new majority class(es). Regarding the first step, class decomposition can be broadly described as the process of clustering class-instances into smaller groups by means of unsupervised learning algorithms.…”
Section: B Cdsmote For Multi-class Datasetsmentioning
confidence: 99%
See 4 more Smart Citations
“…The original CDSMOTE method presented in [10] is comprised of two steps: 1) class decomposition to redistribute the number of samples per class without losing any sample and 2) oversampling the new minority class(es) to reduce the dominance of the new majority class(es). Regarding the first step, class decomposition can be broadly described as the process of clustering class-instances into smaller groups by means of unsupervised learning algorithms.…”
Section: B Cdsmote For Multi-class Datasetsmentioning
confidence: 99%
“…In [22], an evolutionarybased method namely Genetic Algorithm was used to optimise a set of parameters including the best k values, and again an improved classification accuracy was achieved when the proposed method was tested on 22 different life science and medical datasets. More recently, class-decomposition was successfully applied to handle class-imbalance across various public and common imbalanced binary datasets [10]. The authors applied class-decomposition to reduce the dominance of the majority class instances, to then oversample the minority class instances.…”
Section: B Cdsmote For Multi-class Datasetsmentioning
confidence: 99%
See 3 more Smart Citations