2021
DOI: 10.1186/s13040-021-00244-z
|View full text |Cite
|
Sign up to set email alerts
|

The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation

Abstract: Evaluating binary classifications is a pivotal task in statistics and machine learning, because it can influence decisions in multiple areas, including for example prognosis or therapies of patients in critical conditions. The scientific community has not agreed on a general-purpose statistical indicator for evaluating two-class confusion matrices (having true positives, true negatives, false positives, and false negatives) yet, even if advantages of the Matthews correlation coefficient (MCC) over accuracy and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
367
2
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
9
1

Relationship

1
9

Authors

Journals

citations
Cited by 558 publications
(374 citation statements)
references
References 58 publications
3
367
2
2
Order By: Relevance
“…Full-size  DOI: 10.7717/peerj-cs.623/fig-1 (Matthews, 1975) in binary classification (Chicco & Jurman, 2020;Chicco, Tötsch & Jurman, 2021;Tötsch & Hoffmann, 2021;Chicco, Starovoitov & Jurman, 2021;Chicco, Warrens & Jurman, 2021), R-squared generates a high score only if the regression is able to correctly classify most of the elements of each class. In this example, the regression fails to classify all the elements of the 4 class and of the 5 class, so we believe a good metric would communicate this key-message.…”
Section: Uc5 Use Casementioning
confidence: 99%
“…Full-size  DOI: 10.7717/peerj-cs.623/fig-1 (Matthews, 1975) in binary classification (Chicco & Jurman, 2020;Chicco, Tötsch & Jurman, 2021;Tötsch & Hoffmann, 2021;Chicco, Starovoitov & Jurman, 2021;Chicco, Warrens & Jurman, 2021), R-squared generates a high score only if the regression is able to correctly classify most of the elements of each class. In this example, the regression fails to classify all the elements of the 4 class and of the 5 class, so we believe a good metric would communicate this key-message.…”
Section: Uc5 Use Casementioning
confidence: 99%
“…Table 2 shows the accuracy, sensitivity, specificity, precision, and negative predictive values obtained to evaluate the classification model's performance using formulas ( 9)-( 13) [26][27][28]. Formula (1) defines accuracy and indicates the probability of accurately classifying all under stress and without stress conditions.…”
Section: Resultsmentioning
confidence: 99%
“…Starting from five fingerprints, 479 AdaBoost (AB) models, 549 decision tree (DT) models, and 2.903 random forest (RF) models were built and validated for each fingerprint and endpoint, summing 59,640 machine learning models (a boxplot analysis for each model and fingerprints are displayed in the Supplementary Figure S1). The cross-validation MCC value was selected to evaluate the model’s performance due to their capability of classifying the performance by a single value, comprising all parameters of a confusion matrix [ 88 , 89 ]. In this sense, if the model had the higher CV MCC and EXT MCC, then it was selected.…”
Section: Resultsmentioning
confidence: 99%