2018
DOI: 10.1177/2515245918770963
|View full text |Cite
|
Sign up to set email alerts
|

Equivalence Testing for Psychological Research: A Tutorial

Abstract: Psychologists should be able to falsify predictions. A common prediction in psychological research is that a nonzero effect exists in the population. For example, one might predict that American Asian women primed with their Asian identity will perform better on a math test compared with women who are primed with their female identity. To be able to design a study that allows for strong inferences (Platt, 1964), it is important to specify which test result would falsify the hypothesis in question. Equivalence … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

7
890
1
8

Year Published

2018
2018
2023
2023

Publication Types

Select...
8
2

Relationship

1
9

Authors

Journals

citations
Cited by 1,122 publications
(991 citation statements)
references
References 45 publications
7
890
1
8
Order By: Relevance
“…On the other hand, there are concerns about the limitations of NHST. Importantly, in NHST, nonstatistically significant results are typically inconclusive, so researchers cannot accept a null hypothesis (but see Machery, 2012, andLakens, Scheel, &Isager, 2018). And if we cannot accept a null hypothesis, then it is harder to evaluate and publish failed replication attempts.…”
Section: Causes Of the Replicability Crisismentioning
confidence: 99%
“…On the other hand, there are concerns about the limitations of NHST. Importantly, in NHST, nonstatistically significant results are typically inconclusive, so researchers cannot accept a null hypothesis (but see Machery, 2012, andLakens, Scheel, &Isager, 2018). And if we cannot accept a null hypothesis, then it is harder to evaluate and publish failed replication attempts.…”
Section: Causes Of the Replicability Crisismentioning
confidence: 99%
“…Hypothesis 2 assumed that color-evasion would be unrelated to cognitive, metacognitive, and behavioral CQ. Since conventional significance tests technically cannot test for the absence of an effect, we used equivalence testing with one-sided t-tests to reject the presence of a smallest effect size of interest (SESOI) (Lakens, Scheel, & Isager, 2018). As these procedures are not yet available for multilevel regression, we only tested equivalence for the bivariate correlations between color-evasion with self-reported cognitive, metacognitive, and behavioral CQ as well as with the SJT score.…”
Section: Gendermentioning
confidence: 99%
“…For each study, we report frequentist and Bayesian statistics, confidence intervals around Hedges' g based on d av , and the Bayes Factor (JZS) with a r-scale of .707 (Rouder, Speckman, Sun, Morey, & Iverson, 2009). When we expected no or a weak IAT effect to emerge, we additionally report the results of equivalence tests (i.e., the 'two one-sided t-tests approach', see Lakens, Scheel, & Isager, 2018). For completeness, we report the robust statistics using the Yuen-Welch method for comparing 20% trimmed means (Wilcox, 2012;Wilcox & Tian, 2011) in the supplementary materials (see the Appendix A in the supplementary materials for the robust statistics).…”
Section: Sample Size Justification and Statistical Analysesmentioning
confidence: 99%