The Potential Effect of Lowering the Threshold of Statistical Significance From P &lt; .05 to P &lt; .005 in Orthopaedic Sports Medicine

Evans, Sheridan; Anderson, Jon; Johnson, Austin; Checketts, Jake X.; Scott, Jared; Middlemist, Kevin; Fishbeck, Keith; Vassar, Matt

doi:10.1016/j.arthro.2020.11.041

Cited by 5 publications

(5 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As noted previously, near the front of this issue of Arthroscopy, readers will find the original scientific article, "The Potential Effect of Lowering the Threshold of Statistical Significance From P < .05 to P < .005 in Orthopaedic Sports Medicine" by Evans et al 2 In addition, near the end of this issue, readers will find the Level V evidence (expert opinion) article, "The Blight of the Type II Error: When No Difference Does Not Mean No Difference" by Domb and Sabetian of the American Hip Institute in Chicago. 19 As mentioned, Evans et al focus on avoidance of falsely positive, albeit statistically significant, conclusions.…”

Section: In This Issuementioning

confidence: 99%

“…19 As mentioned, Evans et al focus on avoidance of falsely positive, albeit statistically significant, conclusions. 2 In contrast, Domb and Sabetian focus on reasons why studies could fail, errantly, to achieve statistically significant results. Domb and Sabetian introduce that "underpowered studies caused by small sample sizes are especially prevalent in the surgical literature," where power is defined as "the capacity of the study to recognize whether there is a difference (between treatment groups), given that such difference exists."…”

Section: In This Issuementioning

confidence: 99%

“…There's so much to more to consider when it comes to data, and yet we're still misleadingly dichotomizing study results as significant or not. In the end, readers of both Evans et al 2 and Domb and Sabetian 19 could keep in mind that, moving forward, we need to avoid the rigid and dichotomous thought process of difference versus no difference in favor of a process that estimates uncertainty. When it comes to medical research data, uncertainty, on some level, is omnipresent.…”

Section: In This Issuementioning

confidence: 99%

“…Near the front of this months' issue of Arthroscopy, readers will find the original scientific article, "The Potential Effect of Lowering the Threshold of Statistical Significance From P < .05 to P < .005 in Orthopaedic Sports Medicine" by Evans, Johnson, Anderson, Checketts, Scott, Middlemist, Fishbeck, and Vassar of the Oklahoma State University. 2 Evans et al 2 report that some respected scholars recommend "redefining statistical significance by changing the P value threshold from .05 to .005" to lower the risk of that medical research studies reach "false-positive" conclusions, and their results show that lowering the P value threshold would dramatically reduce the number of statistically significant findings in randomized controlled trials. This is clinically relevant because if the statistical significance threshold were thus changed, evidence-based recommendations used to guide clinical decision-making would be greatly affected.…”

mentioning

confidence: 99%

See 3 more Smart Citations

Misinterpretation of P Values and Statistical Power Creates a False Sense of Certainty: Statistical Significance, Lack of Significance, and the Uncertainty Challenge

Cote¹,

Lubowitz

Brand

et al. 2021

Arthroscopy: The Journal of Arthroscopic & Related Surgery

View full text Add to dashboard Cite

Section: In This Issuementioning

confidence: 99%

Section: In This Issuementioning

confidence: 99%

Section: In This Issuementioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

Misinterpretation of P Values and Statistical Power Creates a False Sense of Certainty: Statistical Significance, Lack of Significance, and the Uncertainty Challenge

Cote¹,

Lubowitz

Brand

et al. 2021

Arthroscopy: The Journal of Arthroscopic & Related Surgery

View full text Add to dashboard Cite

“…In a series of studies, Vassar and colleagues examined the impact of changing the threshold on randomized controlled trials (RCTs) published in general medical, orthopaedic trauma, and orthopaedic sports medicine journals [24][25][26]. The results of these studies are summarized in Table 1, along with a study by Thakur and Jha [27] that examined changing the P value threshold on results from 123 RCTs pertaining to chronic rhinosinusitis and a study by Khan et al [28] that focused on 72 RCTS from high impact general medical and cardiology journals.…”

Section: Introductionmentioning

confidence: 99%

Impact of redefining statistical significance on P-hacking and false positive rates: An agent-based model

Fitzpatrick,

Gorman,

Trombatore

2024

PLoS ONE

View full text Add to dashboard Cite

In recent years, concern has grown about the inappropriate application and interpretation of P values, especially the use of P<0.05 to denote “statistical significance” and the practice of P-hacking to produce results below this threshold and selectively reporting these in publications. Such behavior is said to be a major contributor to the large number of false and non-reproducible discoveries found in academic journals. In response, it has been proposed that the threshold for statistical significance be changed from 0.05 to 0.005. The aim of the current study was to use an evolutionary agent-based model comprised of researchers who test hypotheses and strive to increase their publication rates in order to explore the impact of a 0.005 P value threshold on P-hacking and published false positive rates. Three scenarios were examined, one in which researchers tested a single hypothesis, one in which they tested multiple hypotheses using a P<0.05 threshold, and one in which they tested multiple hypotheses using a P<0.005 threshold. Effects sizes were varied across models and output assessed in terms of researcher effort, number of hypotheses tested and number of publications, and the published false positive rate. The results supported the view that a more stringent P value threshold can serve to reduce the rate of published false positive results. Researchers still engaged in P-hacking with the new threshold, but the effort they expended increased substantially and their overall productivity was reduced, resulting in a decline in the published false positive rate. Compared to other proposed interventions to improve the academic publishing system, changing the P value threshold has the advantage of being relatively easy to implement and could be monitored and enforced with minimal effort by journal editors and peer reviewers.

show abstract

Lowering the statistical significance threshold of randomized controlled trials in three major general anesthesiology journals

Waters

Rucker

Love

et al. 2023

Can J Anesth/J Can Anesth

View full text Add to dashboard Cite

The Potential Effect of Lowering the Threshold of Statistical Significance From P < .05 to P < .005 in Orthopaedic Sports Medicine

Cited by 5 publications

References 11 publications

Misinterpretation of P Values and Statistical Power Creates a False Sense of Certainty: Statistical Significance, Lack of Significance, and the Uncertainty Challenge

Misinterpretation of P Values and Statistical Power Creates a False Sense of Certainty: Statistical Significance, Lack of Significance, and the Uncertainty Challenge

Impact of redefining statistical significance on P-hacking and false positive rates: An agent-based model

Lowering the statistical significance threshold of randomized controlled trials in three major general anesthesiology journals

Contact Info

Product

Resources

About