2019
DOI: 10.1109/access.2019.2899467
|View full text |Cite
|
Sign up to set email alerts
|

A Parameter-Free Cleaning Method for SMOTE in Imbalanced Classification

Abstract: Oversampling is an efficient technique in dealing with class-imbalance problem. It addresses the problem by reduplicating or generating the minority class samples to balance the distribution between the samples of the majority and the minority class. Synthetic minority oversampling technique (SMOTE) is one of the typical representatives. During the past decade, researchers have proposed many variants of SMOTE. However, the existing oversampling methods may generate wrong minority class samples in some scenario… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
27
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 50 publications
(27 citation statements)
references
References 25 publications
0
27
0
Order By: Relevance
“…For example, Prati et al [25] developed a study using a set of artificial datasets showing that the degree of class overlapping has a strong correlation with class imbalance. In this scenario, the use of over-sampling methods based on SMOTE [10], [41] has shown to be very effective [4]. The large influence of overlapping on classification performance with respect to the imbalance ratio was also corroborated in the particular case in which the minority class is more represented in the overlapping region than the majority class [14], [15].…”
Section: Class Overlapping Affecting Classification Problems: Rementioning
confidence: 76%
“…For example, Prati et al [25] developed a study using a set of artificial datasets showing that the degree of class overlapping has a strong correlation with class imbalance. In this scenario, the use of over-sampling methods based on SMOTE [10], [41] has shown to be very effective [4]. The large influence of overlapping on classification performance with respect to the imbalance ratio was also corroborated in the particular case in which the minority class is more represented in the overlapping region than the majority class [14], [15].…”
Section: Class Overlapping Affecting Classification Problems: Rementioning
confidence: 76%
“…Tomek Links can be described as a method for undersampling or as a technique for cleaning up data. They can be identified as a pair of the nearest neighbors of opposite classes, which are minimally distant [64]. They are used to remove the overlapping samples that SMOTE adds.…”
Section: Smotetomekmentioning
confidence: 99%
“…It combines benefits (and some limitations) of both approaches mentioned above. Some methods applying the mixed sampling technique are Selective Preprocessing of Imbalanced Data (SPIDER) [33], Optimal Genetic Algorithms based Resampling [34] (OGAR), SMOTE-ENN [35] combining SMOTE and Edited Nearest Neighbors [36], SMOTE-TL [37] combining SMOTE and Tomek Links [38], and parameter-free data cleaning method to SMOTE [39].…”
Section: Related Workmentioning
confidence: 99%