2020
DOI: 10.1109/access.2020.2987364
|View full text |Cite
|
Sign up to set email alerts
|

Using Cost-Sensitive Learning and Feature Selection Algorithms to Improve the Performance of Imbalanced Classification

Abstract: Imbalanced data problem is widely present in network intrusion detection, spam filtering, biomedical engineering, finance, science, being a challenge in many real-life data-intensive applications. Classifier bias occurs when traditional classification algorithms are used to deal with imbalanced data. As already known, the General Vector Machine (GVM) algorithm has good generalization ability, though it does not work well for the imbalanced classification. Additionally, the state-of-the-art Binary Ant Lion Opti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
19
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 57 publications
(25 citation statements)
references
References 47 publications
0
19
0
Order By: Relevance
“…However, the problems of classifying imbalanced data often occur in real-life applications such as analyzing medical datasets, where the cases of patients with the disease are significantly lower than those without the disease. For instance, in cancer detection, the cases of patients diagnosed with cancer are much smaller than those of patients who do not have cancer [ 4 ]. The classification model to predict cancer results in lower classification performance of abnormal class and incorrect prediction disease which leads to serious health risk.…”
Section: Introductionmentioning
confidence: 99%
“…However, the problems of classifying imbalanced data often occur in real-life applications such as analyzing medical datasets, where the cases of patients with the disease are significantly lower than those without the disease. For instance, in cancer detection, the cases of patients diagnosed with cancer are much smaller than those of patients who do not have cancer [ 4 ]. The classification model to predict cancer results in lower classification performance of abnormal class and incorrect prediction disease which leads to serious health risk.…”
Section: Introductionmentioning
confidence: 99%
“…The problem of imbalanced classification (misclassification) has been one of the factors in the emergence of a relatively new research topic in the field of machine learning under the name of "cost-sensitive learning" [30]. It consists of associating a "cost" penalty to an incorrect prediction and then trying to minimize the cost of a model on the learning dataset.…”
Section: G Results and Discussionmentioning
confidence: 99%
“…To study the effectiveness of CGAN in addressing the issue of an imbalanced dataset, a comparison is made with two typical approaches: the synthetic minority oversampling technique (SMOTE) [42,43] and cost-sensitive learning (CSL) [44,45]. Table 8 presents the performance of CGAN-IFCM, SMOTE-IFCM, and CSL-IFCM in binary and multi-class voice disorder detection.…”
Section: Comparison Between Cgan Smote and Cslmentioning
confidence: 99%