2014
DOI: 10.1007/s11192-014-1251-5
|View full text |Cite
|
Sign up to set email alerts
|

Null hypothesis significance tests. A mix-up of two different theories: the basis for widespread confusion and numerous misinterpretations

Abstract: Null hypothesis statistical significance tests (NHST) are widely used in quantitative research in the empirical sciences including scientometrics. Nevertheless, since their introduction nearly a century ago significance tests have been controversial. Many researchers are not aware of the numerous criticisms raised against NHST. As practiced, NHST has been characterized as a 'null ritual' that is overused and too often misapplied and misinterpreted. NHST is in fact a patchwork of two fundamentally different cla… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
69
0
1

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 91 publications
(71 citation statements)
references
References 89 publications
0
69
0
1
Order By: Relevance
“…If the p-value is below a predefined, arbitrary threshold, usually p<0.05, the result is explained as evidence in favor of an alternative hypothesis, with smaller p-values taken as stronger evidence in favor of the alternative hypothesis. Importantly, p-values do not signify the strength of evidence in favor of an alternative hypothesis (Goodman, 2008;Schneider, 2015). Moreover, NHST and the resulting p-value do not give any information on the magnitude of the difference (Nakagawa and Cuthill, 2007;Cumming, 2014;Ivarsson et al, 2015;Motulsky, 2015).…”
Section: Introductionmentioning
confidence: 92%
“…If the p-value is below a predefined, arbitrary threshold, usually p<0.05, the result is explained as evidence in favor of an alternative hypothesis, with smaller p-values taken as stronger evidence in favor of the alternative hypothesis. Importantly, p-values do not signify the strength of evidence in favor of an alternative hypothesis (Goodman, 2008;Schneider, 2015). Moreover, NHST and the resulting p-value do not give any information on the magnitude of the difference (Nakagawa and Cuthill, 2007;Cumming, 2014;Ivarsson et al, 2015;Motulsky, 2015).…”
Section: Introductionmentioning
confidence: 92%
“…We have not provided uncertainty estimates in relation to the indicators presented. The issue of quantifying uncertainty is a contested topic especially in relation to research evaluation and scientometric indicators (Schneider, 2013(Schneider, , 2015(Schneider, , 2016Hicks et al, 2015). A current pragmatic compromise, although still disputed, is to use so-called 'stability intervals', which are bootstrapped confidence intervals (Colliander & Ahlgren, 2011;Schneider & van Leeuwen, 2014).…”
Section: Search Strategy Data and Indicatorsmentioning
confidence: 99%
“…Consequently, the Type I error rate remains constant if researchers simply repeat the same test over and over again using different samples that have been randomly drawn from the exact same population. However, this first situation is somewhat hypothetical and may even be regarded as impossible in the social sciences because populations of people change over time and location (e.g., Gergen, 1973;Iso-Ahola, 2017;Schneider, 2015;Serlin, 1987;Stroebe & Strack, 2014). Yesterday's population of psychology undergraduate students from the University of Newcastle, Australia will be a different population to today's population of psychology undergraduate students from the University of Newcastle, Australia.…”
Section: Familywise Error Rates Based On Different Tests Of the Same mentioning
confidence: 99%