Downloads provided by UsageCounts
We describe new methods for characterizing gene tree discordance in phylogenomic datasets, which screen for deviations from neutral expectations, summarize variation in statistical support among gene trees, and allow comparison of the patterns of discordance induced by various analysis choices. Using an exceptionally complete set of genome sequences for the short arm of chromosome 3 in Oryza (rice) species, we applied these methods to identify the causes and consequences of differing patterns of discordance in the sets of gene trees inferred using a panel of 20 distinct analysis pipelines. We found that discordance patterns were strongly affected by aspects of data selection, alignment, and alignment masking. Unusual patterns of discordance evident when using certain pipelines were reduced or eliminated by using alternative pipelines, suggesting that they were the product of methodological biases rather than evolutionary processes. In some cases, once such biases were eliminated, evolutionary processes such as introgression could be implicated. Additionally, patterns of gene tree discordance had significant downstream impacts on species tree inference. For example, inference from supermatrices was positively misleading when pipelines that led to biased gene trees were used. Several results may generalize to other data sets: we found that gene tree and species tree inference gave more reasonable results when intron sequence was included during sequence alignment and/or tree inference, the alignment software PRANK was used, and/or detectable "block-shift" alignment artifacts were removed. We discuss our findings in the context of well-established relationships in Oryza and continuing controversies regarding the domestication history of O. sativa.
SupplementaryFigures1-5onlineAppendicescollectedAlignmentsForPosting.tarAbout 26,000 alignments used in study, in nexus format.annotationGffsForPosting.targff formatted files describing gene annotations not available from other sources. Coordinates correspond to sequence assemblies deposited in Dryad, which may differ from versions in Genbank.SupplementaryTables1-3seqAssembliesForPostingSequence assemblies in fasta format for all taxa except O. sativa. Some differ from assemblies deposited in Genbank.
gene trees, species trees, phylogenomics, Oryza, discordance, multilocus data
gene trees, species trees, phylogenomics, Oryza, discordance, multilocus data
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 70 | |
| downloads | 37 |

Views provided by UsageCounts
Downloads provided by UsageCounts