Powered by OpenAIRE graph
Found an issue? Give us feedback
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

Supplementary Information for Phylogenetic analyses of ray-finned fishes (Actinopterygii) using collagen type I protein sequences

Authors: Harvey, Virginia; Keating, Joseph; Buckley, Michael;

Supplementary Information for Phylogenetic analyses of ray-finned fishes (Actinopterygii) using collagen type I protein sequences

Abstract

Ray-finned fishes (Actinopterygii) are the largest and most diverse group of vertebrates, comprising over half of all living vertebrate species. Phylogenetic relationships between ray-finned fishes have historically pivoted on the study of morphology, which has notoriously failed to resolve higher-order relationships, such as within the percomorphs. More recently, comprehensive genomic analyses have provided further resolution of actinopterygian phylogeny, including higher-order relationships. Such analyses are rightfully regarded as the ‘gold standard’ for phylogenetics. However, DNA retrieval requires modern or well-preserved tissue and is less likely to be preserved in archaeological or fossil specimens. In contrast some proteins, such as collagen, are phylogenetically informative and can survive into deep time. Here, we test the utility of collagen type I amino acid sequences for phylogenetic estimation of ray-finned fishes. We estimate topology using Bayesian approaches and compare the congruence of our estimated trees with published genomic phylogenies. Furthermore, we apply a Bayesian molecular clock approach and compare estimated divergence dates with previously published genomic clock analyses. Our collagen-derived trees exhibit 77% of node positions as congruent with recent genomic-derived trees, with the majority of discrepancies occurring in higher-order node positions, almost exclusively within the Percomorpha. Our molecular clock trees present divergence times that are fairly comparable with genomic-based phylogenetic analyses. We estimate the mean node age of Actinopteri at ~293 million years (Ma), the base of Teleostei at ~211 Ma and the radiation of percomorphs beginning at ~141 Ma (~350 Ma, ~250–283 Ma and ~120–133 Ma in genomic trees, respectively). Finally, we show that the average rate of collagen (I) sequence evolution is 0.9 amino acid substitutions for every million years of divergence, with the α3 (I) sequence evolving the fastest, followed by the α2 (I) chain. This is the quickest rate known for any vertebrate group. We demonstrate that phylogenetic analyses using collagen type I amino acid sequences generate tangible signals for actinopterygians that are highly congruent with recent genomic-level studies. However, there is limited congruence within percomorphs, perhaps due to clade-specific functional constraints acting upon collagen sequences. Our results provide important insights for future phylogenetic analyses incorporating extinct actinopterygian species via collagen (I) sequencing.

1. Bayesian topology analysis – Bayesian phylogenetic trees were estimated using MrBayes software (v3.2.7). A collagen (I) sequence dataset was run in PartitionFinder2 using the ‘MrBayes only’ option, and testing both linked and unlinked branch lengths. 2. Tree space visualisation – Tree space was visualised in R using a custom script utilising the phylogenetic packages phangorn(), Quartet(), vegan() and ade4(). 3. Bayesian clock analysis – The rate of sequence evolution per collagen (I) α-chain was investigated in MrBayes using both a uniform and birth-death clock prior.

Related Organizations
Keywords

FOS: Biological sciences

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
    OpenAIRE UsageCounts
    Usage byUsageCounts
    visibility views 8
    download downloads 5
  • 8
    views
    5
    downloads
    Powered byOpenAIRE UsageCounts
Powered by OpenAIRE graph
Found an issue? Give us feedback
visibility
download
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
views
OpenAIRE UsageCountsViews provided by UsageCounts
downloads
OpenAIRE UsageCountsDownloads provided by UsageCounts
0
Average
Average
Average
8
5
Related to Research communities