
pmid: 20349153
The site-frequency spectrum, representing the distribution of allele frequencies at a set of polymorphic sites, is a commonly used summary statistic in population genetics. Explicit forms of the spectrum are known for both models with and without selection if independence among sites is assumed. The availability of these explicit forms has allowed for maximum likelihood estimation of selection, developed first in the Poisson random field model of Sawyer and Hartl, which is now the primary method for estimating selection directly from DNA sequence data. The independence assumption, which amounts to assume free recombination between sites, is, however, a limiting case for many population genetics models. Here, we extend the site-frequency spectrum theory to consider the case where the sites are completely linked. We use diffusion approximation to calculate the joint distribution of the allele frequencies of linked sites for models without selection and for models with equal coefficient selection. The joint distribution is derived by first constructing Green's functions corresponding to multiallele diffusion equations. We show that the site-frequency spectrum is highly correlated between frequencies that are complementary (i.e., sum to 1), and the correlation is significantly elevated by positive selection. The results presented here can be used to extend the Poisson random field to allow for estimating selection for correlated sites. More generally, the Green's function construction should be able to aid in studying the genetic drift of multiple alleles in other cases.
Pharmacology, Mathematics(all), Agricultural and Biological Sciences(all), Models, Genetic, Biochemistry, Genetics and Molecular Biology(all), Biochemistry, molecular biology, Neuroscience(all), Immunology, Genetic Drift, diffusion approximation, Green's function, Applications of statistics to biology and medical sciences; meta analysis, Wright-Fisher model, Genetics, Population, Computational Theory and Mathematics, Gene Frequency, Environmental Science(all), Computer Simulation, Genetics and epigenetics, Applications of Brownian motions and diffusion theory (population genetics, absorption problems, etc.)
Pharmacology, Mathematics(all), Agricultural and Biological Sciences(all), Models, Genetic, Biochemistry, Genetics and Molecular Biology(all), Biochemistry, molecular biology, Neuroscience(all), Immunology, Genetic Drift, diffusion approximation, Green's function, Applications of statistics to biology and medical sciences; meta analysis, Wright-Fisher model, Genetics, Population, Computational Theory and Mathematics, Gene Frequency, Environmental Science(all), Computer Simulation, Genetics and epigenetics, Applications of Brownian motions and diffusion theory (population genetics, absorption problems, etc.)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 8 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
