2019
DOI: 10.1080/00031305.2018.1529625
|View full text |Cite
|
Sign up to set email alerts
|

Valid P -Values Behave Exactly as They Should: Some Misleading Criticisms of P -Values and Their Resolution With S -Values

Abstract: The present note explores sources of misplaced criticisms of P-values, such as conflicting definitions of "significance levels" and "P-values" in authoritative sources, and the consequent misinterpretation of P-values as error probabilities. It then discusses several properties of P-values that have been presented as fatal flaws: That P-values exhibit extreme variation across samples (and thus are "unreliable"), confound effect size with sample size, are sensitive to sample size, and depend on investigator sam… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

4
279
1
4

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 273 publications
(288 citation statements)
references
References 55 publications
4
279
1
4
Order By: Relevance
“…based on leave-one-out cross validation), in addition to metrics of model misfit or discrepancy (e.g. Sander Greenland's S-value, Greenland (2019)).…”
Section: Lessons For Scientific Inferencementioning
confidence: 99%
“…based on leave-one-out cross validation), in addition to metrics of model misfit or discrepancy (e.g. Sander Greenland's S-value, Greenland (2019)).…”
Section: Lessons For Scientific Inferencementioning
confidence: 99%
“…And yet, there is a potential way out. Greenland (2019) suggested performing a logarithmic transformation of p-values: (−1)· log 2 (p). The logarithmic transformation causes p-values to be expressed as the number of bits of information against the model.…”
Section: P-values Without Significance Testingmentioning
confidence: 99%
“…But the P-value itself is not supposed to be "reliable" in the sense of staying put (Greenland 2018a). Its fickleness indicates variation in the data from sample to sample.…”
Section: Don't Blame the P-valuementioning
confidence: 99%
“…Instead of talking about hypothetical coverage of the true value by such intervals, which will fail under various assumption violations, we can think of the confidence interval as a "compatibility interval" (Greenland 2018a,b), showing effect sizes most compatible with the data according to their P-values, under the model used to compute the interval. Likewise, we can think of a posterior probability interval, or Bayesian "credible interval," as a compatibility interval showing effect sizes most compatible with the data, under the model and prior distribution used to compute the interval (Greenland 2018a). Again, whether such intervals include or exclude zero should play no role in their interpretation, because even with only random variation the intervals from different studies can easily be very different (Cumming 2014).…”
Section: Empire Of Diversitymentioning
confidence: 99%