2017
DOI: 10.1371/journal.pone.0177678
|View full text |Cite
|
Sign up to set email alerts
|

Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric

Abstract: Data imbalance is frequently encountered in biomedical applications. Resampling techniques can be used in binary classification to tackle this issue. However such solutions are not desired when the number of samples in the small class is limited. Moreover the use of inadequate performance metrics, such as accuracy, lead to poor generalization results because the classifiers tend to predict the largest size class. One of the good approaches to deal with this issue is to optimize performance metrics that are des… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

4
624
0
8

Year Published

2017
2017
2023
2023

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 1,013 publications
(636 citation statements)
references
References 18 publications
4
624
0
8
Order By: Relevance
“…For a thorough evaluation of the accuracy of our test, and to account for the strong imbalance between benign and atypical/anaplastic meningiomas in our database, the Matthews Correlation Coefficient was calculated using the Multi Class Confusion Matrix function embedded in MatLab. The Matthews Correlation Coefficient takes values in the interval [–1, 1], with 1 showing a complete agreement, –1 a complete disagreement, and 0 showing that the prediction was uncorrelated with the ground truth . A schematic representation of the workflow used in this retrospective study is reported in Fig.…”
Section: Methodsmentioning
confidence: 99%
“…For a thorough evaluation of the accuracy of our test, and to account for the strong imbalance between benign and atypical/anaplastic meningiomas in our database, the Matthews Correlation Coefficient was calculated using the Multi Class Confusion Matrix function embedded in MatLab. The Matthews Correlation Coefficient takes values in the interval [–1, 1], with 1 showing a complete agreement, –1 a complete disagreement, and 0 showing that the prediction was uncorrelated with the ground truth . A schematic representation of the workflow used in this retrospective study is reported in Fig.…”
Section: Methodsmentioning
confidence: 99%
“…In general, MCC is considered a balanced measure that can be used for classes that have various sizes, 31 ð27Þ Tables 12-13 and 14 give the performance analysis of the FA classifier with α = 0.1, 0.15 and 0.2, respectively. In general, MCC is considered a balanced measure that can be used for classes that have various sizes, 31 ð27Þ Tables 12-13 and 14 give the performance analysis of the FA classifier with α = 0.1, 0.15 and 0.2, respectively.…”
Section: Resultsmentioning
confidence: 99%
“…Then, we measured the performance of the classification with the Matthews correlation coefficient (MCC) [20]. MCC is a metric to evaluate the classifier performance, which has been recently acknowledged as an elective measure in the machine learning community.…”
Section: Results and Analysismentioning
confidence: 99%