2021
DOI: 10.1016/j.ins.2021.07.053
|View full text |Cite
|
Sign up to set email alerts
|

UFFDFR: Undersampling framework with denoising, fuzzy c-means clustering, and representative sample selection for imbalanced data classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 38 publications
(13 citation statements)
references
References 40 publications
0
12
0
1
Order By: Relevance
“…This resulted in two classes (minority and majority), which is a clear indication of an unbalanced dataset. In such cases, the cost of classifying and assigning all samples into the majority class (i.e., composite class in this case) is relatively low. In such circumstances, oversampling methods are handy tools to overcome imbalanced data by generating an adequate amount of minority class samples . In this study, three distinct oversampling methods were used, and they are explained here.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…This resulted in two classes (minority and majority), which is a clear indication of an unbalanced dataset. In such cases, the cost of classifying and assigning all samples into the majority class (i.e., composite class in this case) is relatively low. In such circumstances, oversampling methods are handy tools to overcome imbalanced data by generating an adequate amount of minority class samples . In this study, three distinct oversampling methods were used, and they are explained here.…”
Section: Methodsmentioning
confidence: 99%
“…In such circumstances, oversampling methods are handy tools to overcome imbalanced data by generating an adequate amount of minority class samples. 31 In this study, three distinct oversampling methods were used, and they are explained here. In what follows, x i is a vector repressing the extracted features, namely, T i 0 , T i L , m i , and dt i , sampled during test i:…”
Section: Sample Preparationmentioning
confidence: 99%
“…New sampling proposals for specific use cases like e.g. balancing imbalanced classes [49,57], online sampling [7,42] or reinforcement learning [13,15] emerge constantly. We do not intend to cover all of these here, but simply mention that there is a large and growing body of literature to assist in obtaining the best possible sampling strategy for a given purpose.…”
Section: Statistical Methodologymentioning
confidence: 99%
“…However, when dealing with a large data sets, the computation cost is quite severe [27]. UFFDFR [28] uses a denoising stage to reduce the effect of noise samples and fuzzy c-means clustering algorithm to improve the performance of classification, and selects the representative samples based on distance. The three parts of UFFDFR are in sequence, so any unsatisfactory part will affect the final result.…”
Section: Related Workmentioning
confidence: 99%