2022
DOI: 10.1111/jeb.14009
|View full text |Cite
|
Sign up to set email alerts
|

Why and how we should join the shift from significance testing to estimation

Abstract: A paradigm shift away from null hypothesis significance testing seems in progress. Based on simulations, we illustrate some of the underlying motivations. First, p‐values vary strongly from study to study, hence dichotomous inference using significance thresholds is usually unjustified. Second, ‘statistically significant’ results have overestimated effect sizes, a bias declining with increasing statistical power. Third, ‘statistically non‐significant’ results have underestimated effect sizes, and this bias get… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 46 publications
(25 citation statements)
references
References 105 publications
1
18
0
Order By: Relevance
“…Similarly, Structure has limited discriminatory power in the presence of low genetic differentiation and moderate gene flow (Latch et al 2006;Waples and Gaggiotti 2006), which could explain why clusters were not detected in this study. It can also be argued that the PCA and Structure analyses indicate correctly that there is no distinction between the contingents, but that the observed significant pairwise F ST values represent false-positives due to large data sets (Helyar et al 2011), an argument in line with similar discussions on the importance of focusing on the effect size rather than the significance of P values (Halsey et al 2015;Berner and Amrhein 2022). In this study, effect sizes matched expectations; the largest F ST values (effect size) were observed between the reference sets and the intercountry comparisons while the smallest ones were mainly within Canadian NAFO division comparisons.…”
Section: Transatlantic and Nwa Population Structurementioning
confidence: 58%
“…Similarly, Structure has limited discriminatory power in the presence of low genetic differentiation and moderate gene flow (Latch et al 2006;Waples and Gaggiotti 2006), which could explain why clusters were not detected in this study. It can also be argued that the PCA and Structure analyses indicate correctly that there is no distinction between the contingents, but that the observed significant pairwise F ST values represent false-positives due to large data sets (Helyar et al 2011), an argument in line with similar discussions on the importance of focusing on the effect size rather than the significance of P values (Halsey et al 2015;Berner and Amrhein 2022). In this study, effect sizes matched expectations; the largest F ST values (effect size) were observed between the reference sets and the intercountry comparisons while the smallest ones were mainly within Canadian NAFO division comparisons.…”
Section: Transatlantic and Nwa Population Structurementioning
confidence: 58%
“…P-values cannot be directly compared between studies and often trigger unjustifiable false comparisons (Bernardi et al, 2016;Berner & Amrhein, 2022;Halsey, 2019). A statistical significance indicated by a p-value of less than 0.05 is often erroneously misinterpreted as indicating a meaningful effect size, whereas statistically non-significant results often have an underlying non-zero effect size (Bernardi et al, 2016;Berner & Amrhein, 2022;Halsey, 2019). Furthermore, use of pvalues and NHST effectively asks the binary question, "is there an effect?…”
Section: Introductionmentioning
confidence: 99%
“…P -values cannot be directly compared between studies and often trigger unjustifiable false comparisons (Bernardi et al, 2016; Berner & Amrhein, 2022; Halsey, 2019). A statistical significance indicated by a p -value of less than 0.05 is often erroneously misinterpreted as indicating a meaningful effect size, whereas statistically non-significant results often have an underlying non-zero effect size (Bernardi et al, 2016; Berner & Amrhein, 2022; Halsey, 2019). Furthermore, use of p -values and NHST effectively asks the binary question, “is there an effect?”, whereas studies in ecology and evolution are typically quantitative and “how large is the effect?” is usually a more important question (Ho et al, 2019; Sullivan & Feinn, 2012).…”
Section: Introductionmentioning
confidence: 99%
“…The last component of this framework is the estimation strategy or the process that will be used to learn about the empirical estimand, which includes the estimator and estimate. Among other things, it is sought (see Wasserstein et al, 2019 for an introduction) that rather than estimation strategies that rely on making dichotomous inferences about the presence or absence of the effects of interest (e.g., using null hypothesis significance testing statistical or Bayes factor) and on reporting and interpreting point estimates, estimation strategies focus on estimating the direction and size of these effects, and on embracing uncertainty, for example, by reporting frequentist confidence intervals or their Bayesian counterparts, credible intervals (Berner & Amrhein, 2022;Smith, 2018).…”
Section: A Brief Introduction To Formal Modelsmentioning
confidence: 99%