2012
DOI: 10.2478/v10006-012-0064-z
|View full text |Cite
|
Sign up to set email alerts
|

Nonparametric statistical analysis for multiple comparison of machine learning regression algorithms

Abstract: In the paper we present some guidelines for the application of nonparametric statistical tests and post-hoc procedures devised to perform multiple comparisons of machine learning algorithms. We emphasize that it is necessary to distinguish between pairwise and multiple comparison tests. We show that the pairwise Wilcoxon test, when employed to multiple comparisons, will lead to overoptimistic conclusions. We carry out intensive normality examination employing ten different tests showing that the output of mach… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
58
0
7

Year Published

2014
2014
2023
2023

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 139 publications
(66 citation statements)
references
References 44 publications
1
58
0
7
Order By: Relevance
“…The results of the Friedman test [39] for the AUC measure with the four datasets of S. cerevisiae -C.…”
Section: Resultsmentioning
confidence: 99%
“…The results of the Friedman test [39] for the AUC measure with the four datasets of S. cerevisiae -C.…”
Section: Resultsmentioning
confidence: 99%
“…The statistical analysis was performed using the Mann-Whitney U (Wilcoxon rank-sum test) test-two-sided test, provided by the R software environment for statistical computing (R Core Team, 2013). This test was chosen since, according to Trawiński et al (2012), it is more sensible than the t-test when the number of observations is small (10 in our case).…”
Section: 3mentioning
confidence: 99%
“…Ramon and De Raedt (2000) adapted neural networks (Trawiński et al, 2012) to the MIL setting via taking into account the relation of a bag to its instances. Zhang and Zhou (2004) later derived a similar framework.…”
Section: Related Workmentioning
confidence: 99%