2018
DOI: 10.31234/osf.io/7k6ay
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Equivalence Testing and the Second Generation P-Value

Abstract: To move beyond the limitations of null-hypothesis tests, statistical approaches have been developed where the observed data are compared against a range of values that are equivalent to the absence of a meaningful effect. Specifying a range of values around zero allows researchers to statistically reject the presence of effects large enough to matter, and prevents practically insignificant effects from being interpreted as a statistically significant difference. We compare the behavior of the recently proposed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0
1

Year Published

2019
2019
2022
2022

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 12 publications
0
10
0
1
Order By: Relevance
“…In some research articles in sports science, p values were replaced by magnitude-based inferences ( Batterham & Hopkins, 2006 ), which were recently strongly criticized because of their high error rates ( Sainani, 2018 ). Recently proposed “second-generation p values” ( Blume, D’Agostino McGowan, Dupont, & Greevy, 2018 ) turned out to be highly similar to, but less informative than, equivalence tests ( Lakens & Delacre, 2020 ). Training researchers how to use existing frequentist and Bayesian approaches to estimation and hypothesis testing well (which means with care and while acknowledging the limitations of each approach) might be a more fruitful approach for improving statistical inferences than developing novel statistical approaches.…”
Section: Why Would Alternatives To P Values Fare Any Better?mentioning
confidence: 99%
“…In some research articles in sports science, p values were replaced by magnitude-based inferences ( Batterham & Hopkins, 2006 ), which were recently strongly criticized because of their high error rates ( Sainani, 2018 ). Recently proposed “second-generation p values” ( Blume, D’Agostino McGowan, Dupont, & Greevy, 2018 ) turned out to be highly similar to, but less informative than, equivalence tests ( Lakens & Delacre, 2020 ). Training researchers how to use existing frequentist and Bayesian approaches to estimation and hypothesis testing well (which means with care and while acknowledging the limitations of each approach) might be a more fruitful approach for improving statistical inferences than developing novel statistical approaches.…”
Section: Why Would Alternatives To P Values Fare Any Better?mentioning
confidence: 99%
“…In these cases, traditional hypothesis tests are not appropriate because a nonsignificant p-value does not provide evidence that there is zero or no effect at all [49,56]. Researchers should avoid using "magnitude based inference" [57,58] or "second generation p-values" [59] as hypothesis tests for equivalence, though they may be useful as descriptive statistics of the confidence interval. The primary problem with using these statistics as hypothesis tests is the higher false positive risk and dependence upon the sample size [57].…”
Section: Selecting the Appropriate Inferential Statisticsmentioning
confidence: 99%
“…Then, P ζ has a maximum value of 1/2, which is a deliberate consequence of the definition, as this value does not suggest a “proof” of H 0 . For a comparison of the SGPV with TOST, see [ 12 ].…”
Section: Definitionsmentioning
confidence: 99%
“…4.2.6 Estimation. The effects η J and ϑ pred,J are estimated by plugging in estimates for the parameters,b J ,ŝ ¼ŝ f andŝ r , into (12) or (13). Using the first option shows how to obtain a confidence interval.…”
Section: Plos Onementioning
confidence: 99%