DNA Sequences Are as Useful as Protein Sequences for Inferring Deep Phylogenies

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 27 Jun 2023 Austria English Publisher:Oxford University Press (OUP)Journal:Systematic Biology, volume 72, pages 1,119-1,135 (issn: 1063-5157, eissn: 1076-836X,

Copyright policy )Funded by:UKRI | UCL Biosciences Big Data, UKRI | Efficient Bayesian phylog..., UKRI | Addressing the problem of...

Authors: Kapli, Paschalia; Kotari, Ioanna; Telford, Maximilian J.; Goldman, Nick; Yang, Ziheng;

doi: 10.1093/sysbio/syad036

pmid: 37366056

pmc: PMC10627555

DNA Sequences Are as Useful as Protein Sequences for Inferring Deep Phylogenies

- Summary
- Subjects
- Metrics

Abstract

Abstract Inference of deep phylogenies has almost exclusively used protein rather than DNA sequences based on the perception that protein sequences are less prone to homoplasy and saturation or to issues of compositional heterogeneity than DNA sequences. Here, we analyze a model of codon evolution under an idealized genetic code and demonstrate that those perceptions may be misconceptions. We conduct a simulation study to assess the utility of protein versus DNA sequences for inferring deep phylogenies, with protein-coding data generated under models of heterogeneous substitution processes across sites in the sequence and among lineages on the tree, and then analyzed using nucleotide, amino acid, and codon models. Analysis of DNA sequences under nucleotide-substitution models (possibly with the third codon positions excluded) recovered the correct tree at least as often as analysis of the corresponding protein sequences under modern amino acid models. We also applied the different data-analysis strategies to an empirical dataset to infer the metazoan phylogeny. Our results from both simulated and real data suggest that DNA sequences may be as useful as proteins for inferring deep phylogenies and should not be excluded from such analyses. Analysis of DNA data under nucleotide models has a major computational advantage over protein-data analysis, potentially making it feasible to use advanced models that account for among-site and among-lineage heterogeneity in the nucleotide-substitution process in inference of deep phylogenies.

Country

Austria

Related Organizations

European Molecular Biology Laboratory
Germany
University College London
United Kingdom
UNIVERSITY COLLEGE LONDON, Bartlett School of Planning
United Kingdom
Vetmeduni Vienna
Austria
European Bioinformatics Institute
United Kingdom

View all View all

Keywords

Evolution, Molecular, Codon-Substitution Models; Amino-Acid Substitution; Compositional Heterogeneity; Sister Group; Nucleotide Substitution; Evolutionary Trees; Likelihood Models; Supports Sponges; Mixture-Models; Reconstruction, Base Sequence, Models, Genetic, Nucleotides, Animals, Amino Acids, Codon, Phylogeny, Regular Manuscripts

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	25
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

25

Top 10%

Average

Top 10%

Green

hybrid

Funded by

UKRI| UCL Biosciences Big Data, UKRI| Efficient Bayesian phylogenomic dating with new models of trait evolution and rich diversities of living and fossil species, UKRI| Addressing the problem of deep coalescence in ancient radiations: Resolving the explosive radiation of the Lophotrochozoa.