
Abstract Ancient whole-genome duplications (WGDs) leave signatures in comparative genomic data sets that can be harnessed to detect these events of presumed evolutionary importance. Current statistical approaches for the detection of ancient WGDs in a phylogenetic context have two main drawbacks. The first is that unwarranted restrictive assumptions on the “background” gene duplication and loss rates make inferences unreliable in the face of model violations. The second is that most methods can only be used to examine a limited set of a priori selected WGD hypotheses and cannot be used to discover WGDs in a phylogeny. In this study, we develop an approach for WGD inference using gene count data that seeks to overcome both issues. We employ a phylogenetic birth–death model that includes WGD in a flexible hierarchical Bayesian approach and use reversible-jump Markov chain Monte Carlo to perform Bayesian inference of branch-specific duplication, loss, and WGD retention rates across the space of WGD configurations. We evaluate the proposed method using simulations, apply it to data sets from flowering plants, and discuss the statistical intricacies of model-based WGD inference.
Genome, Models, Genetic, Bayesian inference, Bayes Theorem, 310, Markov Chains, Phylogenetics, Polyploidy, Magnoliopsida, Gene content evolution, Genetic Techniques, Genome duplication, Gene Duplication, Chromosome Duplication, Computer Simulation, Whole-genome duplication (WGD), Reversible-jump MCMC, Monte Carlo Method, Phylogeny
Genome, Models, Genetic, Bayesian inference, Bayes Theorem, 310, Markov Chains, Phylogenetics, Polyploidy, Magnoliopsida, Gene content evolution, Genetic Techniques, Genome duplication, Gene Duplication, Chromosome Duplication, Computer Simulation, Whole-genome duplication (WGD), Reversible-jump MCMC, Monte Carlo Method, Phylogeny
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 9 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
