
Abstract General genealogical processes such as Λ- and Ξ-coalescents, which respectively model multiple and simultaneous mergers, have important applications in studying marine species, strong positive selection, recurrent selective sweeps, strong bottlenecks, large sample sizes, and so on. Recently, there has been significant progress in developing useful inference tools for such general models. In particular, inference methods based on the site frequency spectrum (SFS) have received noticeable attention. Here, we derive a new formula for the expected SFS for general Λ- and Ξ-coalescents, which leads to an efficient algorithm. For time-homogeneous coalescents, the runtime of our algorithm for computing the expected SFS is O(n2), where n is the sample size. This is a factor of n2 faster than the state-of-the-art method. Furthermore, in contrast to existing methods, our method generalizes to time-inhomogeneous Λ- and Ξ-coalescents with measures that factorize as Λ(dx)/ζ(t) and Ξ(dx)/ζ(t), respectively, where ζ denotes a strictly positive function of time. The runtime of our algorithm in this setting is O(n3). We also obtain general theoretical results for the identifiability of the Λ measure when ζ is a constant function, as well as for the identifiability of the function ζ under a fixed Ξ measure.
simultaneous merger, multiple merger, 510, Genetic, Theoretical, Models, Genetics, FOS: Mathematics, Quantitative Biology - Populations and Evolution, Models, Genetic, Probability (math.PR), Populations and Evolution (q-bio.PE), Biological Sciences, Models, Theoretical, identifiability, 004, Biochemistry and cell biology, FOS: Biological sciences, Biochemistry and Cell Biology, frequency spectrum, Mathematics - Probability, Algorithms, Developmental Biology
simultaneous merger, multiple merger, 510, Genetic, Theoretical, Models, Genetics, FOS: Mathematics, Quantitative Biology - Populations and Evolution, Models, Genetic, Probability (math.PR), Populations and Evolution (q-bio.PE), Biological Sciences, Models, Theoretical, identifiability, 004, Biochemistry and cell biology, FOS: Biological sciences, Biochemistry and Cell Biology, frequency spectrum, Mathematics - Probability, Algorithms, Developmental Biology
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 39 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
