
doi: 10.1002/sim.5792
pmid: 23609602
We present a Bayesian approach for estimating the relative frequencies of multi‐single nucleotide polymorphism (SNP) haplotypes in populations of the malaria parasite Plasmodium falciparum by using microarray SNP data from human blood samples. Each sample comes from a malaria patient and contains one or several parasite clones that may genetically differ. Samples containing multiple parasite clones with different genetic markers pose a special challenge. The situation is comparable with a polyploid organism. The data from each blood sample indicates whether the parasites in the blood carry a mutant or a wildtype allele at various selected genomic positions. If both mutant and wildtype alleles are detected at a given position in a multiply infected sample, the data indicates the presence of both alleles, but the ratio is unknown. Thus, the data only partially reveals which specific combinations of genetic markers (i.e. haplotypes across the examined SNPs) occur in distinct parasite clones. In addition, SNP data may contain errors at non‐negligible rates. We use a multinomial mixture model with partially missing observations to represent this data and a Markov chain Monte Carlo method to estimate the haplotype frequencies in a population. Our approach addresses both challenges, multiple infections and data errors. Copyright © 2013 John Wiley & Sons, Ltd.
haplotypes, Models, Statistical, frequency estimation, Plasmodium falciparum, malaria, Genetic Variation, multiple infection, Polymorphism, Single Nucleotide, Markov Chains, Applications of statistics to biology and medical sciences; meta analysis, Papua New Guinea, Gibbs sampling, Haplotypes, Data Interpretation, Statistical, Animals, Humans, Bayesian mixture model, Malaria, Falciparum, Monte Carlo Method, Algorithms, Oligonucleotide Array Sequence Analysis
haplotypes, Models, Statistical, frequency estimation, Plasmodium falciparum, malaria, Genetic Variation, multiple infection, Polymorphism, Single Nucleotide, Markov Chains, Applications of statistics to biology and medical sciences; meta analysis, Papua New Guinea, Gibbs sampling, Haplotypes, Data Interpretation, Statistical, Animals, Humans, Bayesian mixture model, Malaria, Falciparum, Monte Carlo Method, Algorithms, Oligonucleotide Array Sequence Analysis
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 12 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
