
A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution.
106014 Genomics, DNA, Plant, repeats, Genes, Insect, Repetitive DNA, Magnoliopsida, RepetitiveDNA, genomics, 106014 Genomik, Animals, Cluster Analysis, molecular systematics, Phylogeny, Molecular systematics, Repetitive Sequences, Nucleic Acid, Genome, repetitive DNA, Continuous characters, continuous characters, 106008 Botanik, phylogenomics, Genomics, 106008 Botany, phylogenetics, Phylogenetics, C400 Genetics, Next-generation sequencing, next-generation sequencing, Drosophila, Regular Articles
106014 Genomics, DNA, Plant, repeats, Genes, Insect, Repetitive DNA, Magnoliopsida, RepetitiveDNA, genomics, 106014 Genomik, Animals, Cluster Analysis, molecular systematics, Phylogeny, Molecular systematics, Repetitive Sequences, Nucleic Acid, Genome, repetitive DNA, Continuous characters, continuous characters, 106008 Botanik, phylogenomics, Genomics, 106008 Botany, phylogenetics, Phylogenetics, C400 Genetics, Next-generation sequencing, next-generation sequencing, Drosophila, Regular Articles
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 131 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
