2012
DOI: 10.5120/6280-8449
|View full text |Cite
|
Sign up to set email alerts
|

On the Classification of Imbalanced Datasets

Abstract: In recent research the classifications of imbalanced data sets have received considerable attention. It is natural that due to the class imbalance the classifier tends to favour majority class. In this paper we investigate the performance of different methods for handling data imbalance in the microcalcification classification which is a classical example for data imbalance problem. Micro calcifications are very tiny deposits of calcium that appear as small bright spots in the mammogram. Classification of micr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0
2

Year Published

2012
2012
2024
2024

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 23 publications
(18 citation statements)
references
References 42 publications
0
16
0
2
Order By: Relevance
“…The problem of class imbalancing (i.e. number of feature points from one class is significantly higher than that of the other one) is regarded a crucial problem in machine learning literature and therefore many approaches exist to solve it [24] in order to produce a balanced classifier for the subsequent layers. Popular solutions are sampling techniques like sub-or oversampling additionally combined with boosting.…”
Section: • Compensation Of Missing Data Points Should Be Includedmentioning
confidence: 99%
“…The problem of class imbalancing (i.e. number of feature points from one class is significantly higher than that of the other one) is regarded a crucial problem in machine learning literature and therefore many approaches exist to solve it [24] in order to produce a balanced classifier for the subsequent layers. Popular solutions are sampling techniques like sub-or oversampling additionally combined with boosting.…”
Section: • Compensation Of Missing Data Points Should Be Includedmentioning
confidence: 99%
“…Authors in [8] proposed a SVM-based approach to distinguish microcalcifications from other ROIs. A study on the performance of several classifiers is presented in [6], including SVM, ANN, Bayesian and kNN techniques.…”
Section: Related Workmentioning
confidence: 99%
“…Many computer aided diagnosis (CAD) methods have been developed for analyzing and interpreting the mammogram. In the mammograms [6][7][8], masses are the most important symptoms of abnormality. Accurate diagnosis of masses is very difficult because of their appearances.…”
Section: Introductionmentioning
confidence: 99%