2012
DOI: 10.1007/978-81-322-1038-2_2
|View full text |Cite
|
Sign up to set email alerts
|

An Evaluation of Classification Algorithms Using Mc Nemar’s Test

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

5
51
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 53 publications
(56 citation statements)
references
References 10 publications
5
51
0
Order By: Relevance
“…Further, for both feature sets, features extracted 320 from the non-speech parts yield higher identification rates than features extracted from the whole utterance or the speech parts. For each classifier, the performance differences between the baseline features (extracted from the speech parts) and those extracted from the whole utterance or the non-speech parts were found to be significantly different according to McNemar's test [48]. McNemar's test tabulates the correlation of the correct 325 and incorrect decisions between two systems and counts the number of trials that two systems disagree and then uses the chi-square test statistics to compute the p-values.…”
Section: Comparison Of Mfcc and Lfcc Featuresmentioning
confidence: 99%
“…Further, for both feature sets, features extracted 320 from the non-speech parts yield higher identification rates than features extracted from the whole utterance or the speech parts. For each classifier, the performance differences between the baseline features (extracted from the speech parts) and those extracted from the whole utterance or the non-speech parts were found to be significantly different according to McNemar's test [48]. McNemar's test tabulates the correlation of the correct 325 and incorrect decisions between two systems and counts the number of trials that two systems disagree and then uses the chi-square test statistics to compute the p-values.…”
Section: Comparison Of Mfcc and Lfcc Featuresmentioning
confidence: 99%
“…Statistical significance of these comparisons are represented with scores. Note that these results should be analyzed using Table 4 (given in [27,31]) which presents the confidence levels as an indicator of the statistical significance based on scores. In terms of statistical significance, the binary case resulted in significant results for 3 out of 8 datasets, while the floating-point descriptor yielded statistically significant results for the complete dataset.…”
Section: Resultsmentioning
confidence: 99%
“…The test creates 2 × 2 contingency tables in order to compute scores. This test has a published past of usage by medical research community [30] and has recently been used for performance comparison in computer vision for the first time by Clark [7] and later for machine learning by Bostanci [27]. The test is significantly robust against Type-I error, i.e.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Room mean squared error (RMSE) indicates how precise the classification is. Lower RMSE value indicates the more accurate classifier results [12]. RMSE value is also used in this research for evaluation.…”
Section: B Evaluation Criterionmentioning
confidence: 99%