Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ Federated Research D...arrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2012
License: CC 0
Data sources: ZENODO
DRYAD
Dataset . 2012
License: CC 0
Data sources: Datacite
DRYAD
Dataset . 2012
License: CC 0
Data sources: Datacite
DRYAD
Dataset . 2015
License: CC 0
Data sources: Datacite
versions View all 6 versions
addClaim

Data from: Origin of land plants revisited in the light of sequence contamination and missing data

Authors: Laurin-Lemay, Simon; Brinkmann, Henner; Philippe, Hervé;

Data from: Origin of land plants revisited in the light of sequence contamination and missing data

Abstract

Figure-S3-54Figures S3-54: Single gene phylogenies used to validate the contamination of the charophyte sequences detected by blast search.FigS3-54-Final.pdfFigure-S55-109Figures S55-109: Single gene phylogenies used for the congruence test.FigS55-109-Final.pdfTable-S2Table S2: Summary of the contaminations detected in the Finet et al. (2010) dataset.Table_S2-Final.pdfTable-S3Table S3: Impact of taxon sampling on the GTR+G and CATGTR+G inferences.Table_S3-Final.pdfComplete-22360NEXUS file: 40 taxa, 119 proteins, 22,360 unambiguously aligned amino acid positions, 11.8% of missing data.Finet-99NEXUS file: 77 ribosomal protein alignments of Finet et al., with 99 sequences deleted. 74 contaminant sequences were detected by the congruence test and a total of 99 sequences were removed because in 25 cases it was impossible to identify the correct sequence.Not-Complete-43300NEXUS file: 40 taxa, 164 proteins, 43,300 unambiguously aligned amino acid positions. Ribosomal dataset (11,571 positions and 4.7 % of missing data) and non-ribosomal dataset (31,729 positions and 24.1 % of missing data).Table-S1Table S1: Contaminations detected in alignments of Finet et al. (2010) by Blast search and by congruence test.Table_S1-Final.pdf164-single-gene-before-SCaFoSNEXUS files: 164 orthologous genes before the use of SCaFoS (selection of sequences, species and genes).FINET-sineContam-TreesThe trees (RAxML and PhyloBayes) are inferred from the cleaned dataset (Finet-99 NEXUS file).PhyloBayes-Trees20 PhyloBayes trees (CATGTR+G) obtained from five different partitions (missing data) with four different taxon samples.Phylobayes-Trees.zipRAxML-Trees20 RAxML Best trees (GTR+G) + 20 Bipartitions (GTR+G) obtained from five different partitions (missing data) with four different taxon samples.

Knowing the closest relatives of land plants is key to understanding the complex adaptations to terrestrial life. Unfortunately, multi-gene analyses yield highly incongruent results, suggesting for instance Charales, Zygnematales, or Coleochaete as the sister-group of land plants. Such controversy may result from the real history of life, in particular closely spaced speciation events, incomplete lineage sorting, gene duplication or horizontal gene transfer. In such cases, the solution resides in improved taxon sampling and sophisticated models of evolution. However, we will show that the quality of data used to infer the phylogeny may also play a major role, creating unnecessary controversy. In particular, the inclusion of contaminant sequences from other species, and of genes with incomplete taxon sampling explains a large part of the discrepancies observed between various studies. The use of a carefully checked and almost complete dataset suggests that land plants are closely related to a group composed of Zygnematales and Coleochaetales.

Country
Canada
Related Organizations
Keywords

missing data, contamination, Zygnematales, phylogenomics, Coleochaetales

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    1
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
    OpenAIRE UsageCounts
    Usage byUsageCounts
    visibility views 254
    download downloads 216
  • 254
    views
    216
    downloads
    Powered byOpenAIRE UsageCounts
Powered by OpenAIRE graph
Found an issue? Give us feedback
visibility
download
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
views
OpenAIRE UsageCountsViews provided by UsageCounts
downloads
OpenAIRE UsageCountsDownloads provided by UsageCounts
1
Average
Average
Average
254
216
Related to Research communities