Comparing multiple comparisons: practical guidance for choosing the best multiple comparisons test

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 04 Dec 2020 English Publisher:PeerJJournal:PeerJ, volume 8, page e10387 (eissn: 2167-8359,

Copyright policy )

Authors: Midway, Stephen; Robertson, Matthew; Flinn, Shane; Kaller, Michael;

doi: 10.7717/peerj.10387

pmid: 33335808

pmc: PMC7720730

Comparing multiple comparisons: practical guidance for choosing the best multiple comparisons test

- Summary
- Subjects
- Metrics

Abstract

Multiple comparisons tests (MCTs) include the statistical tests used to compare groups (treatments) often following a significant effect reported in one of many types of linear models. Due to a variety of data and statistical considerations, several dozen MCTs have been developed over the decades, with tests ranging from very similar to each other to very different from each other. Many scientific disciplines use MCTs, including >40,000 reports of their use in ecological journals in the last 60 years. Despite the ubiquity and utility of MCTs, several issues remain in terms of their correct use and reporting. In this study, we evaluated 17 different MCTs. We first reviewed the published literature for recommendations on their correct use. Second, we created a simulation that evaluated the performance of nine common MCTs. The tests examined in the simulation were those that often overlapped in usage, meaning the selection of the test based on fit to the data is not unique and that the simulations could inform the selection of one or more tests when a researcher has choices. Based on the literature review and recommendations: planned comparisons are overwhelmingly recommended over unplanned comparisons, for planned non-parametric comparisons the Mann-Whitney-Wilcoxon U test is recommended, Scheffé’s S test is recommended for any linear combination of (unplanned) means, Tukey’s HSD and the Bonferroni or the Dunn-Sidak tests are recommended for pairwise comparisons of groups, and that many other tests exist for particular types of data. All code and data used to generate this paper are available at: https://github.com/stevemidway/MultipleComparisons .

Related Organizations

Louisiana State University System
United States
Louisiana State University
United States
Louisiana State University Agricultural Center
United States
Michigan State University
United States
Memorial University of Newfoundland
Canada

View all View all

Keywords

Bioinformatics

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	325
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%

Found an issue? Give us feedback

325

Top 0.1%

Top 1%

Green

gold

Fields of Science

Fields of Science

Related to Research communities

UArctic