The Design of Replication Studies

descriptionPublicationkeyboard_double_arrow_right Article 31 Mar 2021 English Publisher:Oxford University Press (OUP)Journal:Journal of the Royal Statistical Society Series A: Statistics in Society, volume 184, pages 868-886 (issn: 0964-1998, eissn: 1467-985X,

Copyright policy )

Authors: Hedges, Larry V.; Schauer, Jacob M.;

doi: 10.1111/rssa.12688

The Design of Replication Studies

- Summary
- Subjects
- Metrics

Abstract

Abstract Empirical evaluations of replication have become increasingly common, but there has been no unified approach to doing so. Some evaluations conduct only a single replication study while others run several, usually across multiple laboratories. Designing such programs has largely contended with difficult issues about which experimental components are necessary for a set of studies to be considered replications. However, another important consideration is that replication studies be designed to support sufficiently sensitive analyses. For instance, if hypothesis tests are to be conducted about replication, studies should be designed to ensure these tests are well-powered; if not, it can be difficult to determine conclusively if replication attempts succeeded or failed. This paper describes methods for designing ensembles of replication studies to ensure that they are both adequately sensitive and cost-efficient. It describes two potential analyses of replication studies—hypothesis tests and variance component estimation—and approaches to obtaining optimal designs for them. Using these results, it assesses the statistical power, precision of point estimators and optimality of the design used by the Many Labs Project and finds that while it may have been sufficiently powered to detect some larger differences between studies, other designs would have been less costly and/or produced more precise estimates or higher-powered hypothesis tests.

Related Organizations

NORTHWESTERN UNIVERSITY
United States
Northwestern University
United States

Keywords

meta-analysis, power, replication, experimental design, Applications of statistics

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	10
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

10

Top 10%

Average

Top 10%

hybrid

Fields of Science (4) View all

Fields of Science