Current controversies: Null hypothesis significance testing

descriptionPublicationkeyboard_double_arrow_right Article 22 Apr 2022 Denmark English Publisher:WileyJournal:Acta Obstetricia et Gynecologica Scandinavica, volume 101, pages 624-627 (issn: 0001-6349, eissn: 1600-0412,

Copyright policy )

Authors: Philip M. Sedgwick; Anne Hammer; Ulrik Schiøler Kesmodel; Lars Henning Pedersen;

doi: 10.1111/aogs.14366

pmid: 35451497

pmc: PMC9564801

Current controversies: Null hypothesis significance testing

- Summary
- Subjects
- Metrics

Abstract

AbstractTraditional null hypothesis significance testing (NHST) incorporating the critical level of significance of 0.05 has become the cornerstone of decision‐making in health care, and nowhere less so than in obstetric and gynecological research. However, such practice is controversial. In particular, it was never intended for clinical significance to be inferred from statistical significance. The inference of clinical importance based on statistical significance (p < 0.05), and lack of clinical significance otherwise (p ≥ 0.05) represents misunderstanding of the original purpose of NHST. Furthermore, the limitations of NHST—sensitivity to sample size, plus type I and II errors—are frequently ignored. Therefore, decision‐making based on NHST has the potential for recurrent false claims about the effectiveness of interventions or importance of exposure to risk factors, or dismissal of important ones. This commentary presents the history behind NHST along with the limitations that modern‐day NHST presents, and suggests that a statistics reform regarding NHST be considered.

Country

Denmark

Related Organizations

Aarhus University
Denmark
Aalborg University
Denmark
University of London
United Kingdom
St George's, University of London
United Kingdom
Aalborg University Library (AUB)
Denmark

View all View all

Keywords

Controversies, clinical significance, Gynecology and obstetrics, null hypothesis significance testing, Research Design, Sample Size, RG1-991, p < 0.05, Humans, statistical significance

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	26
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%