Computational Reproducibility of Molecular Phylogenies

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 01 Jul 2023 English Publisher:Oxford University Press (OUP)Journal:Molecular Biology and Evolution, volume 40 (issn: 0737-4038, eissn: 1537-1719,

Copyright policy )Funded by:NIH | Inferring molecular evolu...

Authors: Sudhir Kumar; Qiqing Tao; Alessandra P Lamarca; Koichiro Tamura;

doi: 10.1093/molbev/msad165

pmid: 37467477

pmc: PMC10370456

Computational Reproducibility of Molecular Phylogenies

- Summary
- Subjects
- Metrics

Abstract

Abstract Repeated runs of the same program can generate different molecular phylogenies from identical data sets under the same analytical conditions. This lack of reproducibility of inferred phylogenies casts a long shadow on downstream research employing these phylogenies in areas such as comparative genomics, systematics, and functional biology. We have assessed the relative accuracies and log-likelihoods of alternative phylogenies generated for computer-simulated and empirical data sets. Our findings indicate that these alternative phylogenies reconstruct evolutionary relationships with comparable accuracy. They also have similar log-likelihoods that are not inferior to the log-likelihoods of the true tree. We determined that the direct relationship between irreproducibility and inaccuracy is due to their common dependence on the amount of phylogenetic information in the data. While computational reproducibility can be enhanced through more extensive heuristic searches for the maximum likelihood tree, this does not lead to higher accuracy. We conclude that computational irreproducibility plays a minor role in molecular phylogenetics.

Related Organizations

Department of Biology
United States
Federal University of Rio de Janeiro
Brazil
Temple University
United States
TEMPLE UNIV OF THE COMMONWEALTH
Tokyo Metropolitan University
Japan

View all View all

Keywords

Letter, Reproducibility of Results, Computer Simulation, Genomics, Biological Evolution, Phylogeny

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

3

Top 10%

Average

Green

gold

Funded by

NIH| Inferring molecular evolutionary rates and divergence dates