J Perezgonzalez scite author profile

Despite frequent calls for the overhaul of null hypothesis significance testing (NHST), this controversial procedure remains ubiquitous in behavioral, social and biomedical teaching and research. Little change seems possible once the procedure becomes well ingrained in the minds and current practice of researchers; thus, the optimal opportunity for such change is at the time the procedure is taught, be this at undergraduate or at postgraduate levels. This paper presents a tutorial for the teaching of data testing procedures, often referred to as hypothesis testing theories. The first procedure introduced is Fisher's approach to data testing—tests of significance; the second is Neyman-Pearson's approach—tests of acceptance; the final procedure is the incongruent combination of the previous two theories into the current approach—NSHT. For those researchers sticking with the latter, two compromise solutions on how to improve NHST conclude the tutorial.

show abstract

Manipulating the Alpha Level Cannot Cure Significance Testing

Trafimow

Amrhein

Areshenkoff

et al. 2018

Front. Psychol.

View full text Add to dashboard Cite

We argue that making accept/reject decisions on scientific hypotheses, including a recent call for changing the canonical alpha level from p = 0.05 to p = 0.005, is deleterious for the finding of new discoveries and the progress of science. Given that blanket and variable alpha levels both are problematic, it is sensible to dispense with significance testing altogether. There are alternatives that address study design and sample size much more directly than significance testing does; but none of the statistical tools should be taken as the new magic method giving clear-cut mechanical answers. Inference should not be based on single studies at all, but on cumulative evidence from multiple independent studies. When evaluating the strength of the evidence, we should consider, for example, auxiliary assumptions, the strength of the experimental design, and implications for applications. To boil all this down to a binary decision based on a p-value threshold of 0.05, 0.01, 0.005, or anything else, is not acceptable.

show abstract

Manipulating the alpha level cannot cure significance testing

Trafimow

Amrhein

Areshenkoff

et al. 2018

Preprint

View full text Add to dashboard Cite

We argue that making accept/reject decisions on scientific hypotheses, including a recent call for changing the canonical alpha level from p= .05 to .005, is deleterious for the finding of new discoveries and the progress of science. Given that blanket and variable alpha levels both are problematic, it is sensible to dispense with significance testing altogether. There are alternatives that address study design and sample size much more directly than significance testing does; but none of the statistical tools should be taken as the new magic method giving clear-cut mechanical answers. Inference should not be based on single studies at all, but on cumulative evidence from multiple independent studies. When evaluating the strength of the evidence, we should consider, for example, auxiliary assumptions, the strength of the experimental design, and implications for applications. To boil all this down to a binary decision based on a p-value threshold of .05, .01, .005, or anything else, is not acceptable.

show abstract

Replication crisis or an opportunity to improve scientific production?

Frías-Navarro

Pascual‐Llobell

Pascual‐Soler

et al. 2020

Euro J of Education

View full text Add to dashboard Cite

The 21st century began with a heated debate in scientific research, known as the 'replication crisis', 'reproducibility crisis' or 'crisis of confidence ' (Fanelli, 2018). Problems in replicating findings and reproducing scientific research began to be evident in many areas of research. The problems became highly visible thanks to essays and studies published in scientific journals and articles in the popular press (

show abstract

P-values as percentiles. Commentary on: â€œNull hypothesis significance tests. A mixâ€“up of two different theories: the basis for widespread confusion and numerous misinterpretationsâ€

Perezgonzalez

2015

Front. Psychol.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

J Perezgonzalez

Fisher, Neyman-Pearson or NHST? A tutorial for teaching data testing

Manipulating the Alpha Level Cannot Cure Significance Testing

Manipulating the alpha level cannot cure significance testing

Replication crisis or an opportunity to improve scientific production?

P-values as percentiles. Commentary on: â€œNull hypothesis significance tests. A mixâ€“up of two different theories: the basis for widespread confusion and numerous misinterpretationsâ€

Contact Info

Product

Resources

About

J Perezgonzalez

Fisher, Neyman-Pearson or NHST? A tutorial for teaching data testing

Manipulating the Alpha Level Cannot Cure Significance Testing

Manipulating the alpha level cannot cure significance testing

Replication crisis or an opportunity to improve scientific production?

P-values as percentiles. Commentary on: â€œNull hypothesis significance tests. A mixâ€“up of two different theories: the basis for widespread confusion and numerous misinterpretationsâ€

Contact Info

Product

Resources

About

P-values as percentiles. Commentary on: â€œNull hypothesis significance tests. A mixâ€“up of two different theories: the basis for widespread confusion and numerous misinterpretationsâ€