
Abstract We develop a Bayesian method for inferring the species phylogeny under the multispecies coalescent (MSC) model. To improve the mixing properties of the Markov chain Monte Carlo (MCMC) algorithm that traverses the space of species trees, we implement two efficient MCMC proposals: the first is based on the Subtree Pruning and Regrafting (SPR) algorithm and the second is based on a node-slider algorithm. Like the Nearest-Neighbor Interchange (NNI) algorithm we implemented previously, both new algorithms propose changes to the species tree, while simultaneously altering the gene trees at multiple genetic loci to automatically avoid conflicts with the newly proposed species tree. The method integrates over gene trees, naturally taking account of the uncertainty of gene tree topology and branch lengths given the sequence data. A simulation study was performed to examine the statistical properties of the new method. The method was found to show excellent statistical performance, inferring the correct species tree with near certainty when 10 loci were included in the dataset. The prior on species trees has some impact, particularly for small numbers of loci. We analyzed several previously published datasets (both real and simulated) for rattlesnakes and Philippine shrews, in comparison with alternative methods. The results suggest that the Bayesian coalescent-based method is statistically more efficient than heuristic methods based on summary statistics, and that our implementation is computationally more efficient than alternative full-likelihood methods under the MSC. Parameter estimates for the rattlesnake data suggest drastically different evolutionary dynamics between the nuclear and mitochondrial loci, even though they support largely consistent species trees. We discuss the different challenges facing the marginal likelihood calculation and transmodel MCMC as alternative strategies for estimating posterior probabilities for species trees. [Bayes factor; Bayesian inference; MCMC; multispecies coalescent; nodeslider; species tree; SPR.]
570, MCMC, Bayesian inference, SPR, Bioengineering, nodeslider, species tree, Models, Biological, 2.5 Research design and methodologies (aetiology), Models, Genetics, Animals, Computer Simulation, Aetiology, Quantitative Biology - Populations and Evolution, Phylogeny, Evolutionary Biology, Shrews, Crotalus, Populations and Evolution (q-bio.PE), Bayes Theorem, Biological, Classification, Bayes factor, multispecies coalescent, FOS: Biological sciences, Algorithms, Regular Articles
570, MCMC, Bayesian inference, SPR, Bioengineering, nodeslider, species tree, Models, Biological, 2.5 Research design and methodologies (aetiology), Models, Genetics, Animals, Computer Simulation, Aetiology, Quantitative Biology - Populations and Evolution, Phylogeny, Evolutionary Biology, Shrews, Crotalus, Populations and Evolution (q-bio.PE), Bayes Theorem, Biological, Classification, Bayes factor, multispecies coalescent, FOS: Biological sciences, Algorithms, Regular Articles
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 161 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
