
Abstract A novel haplotype association method is presented, and its power is demonstrated. Relying on a statistical model for linkage disequilibrium (LD), the method first infers ancestral haplotypes and their loadings at each marker for each individual. The loadings are then used to quantify local haplotype sharing between individuals at each marker. A statistical model was developed to link the local haplotype sharing and phenotypes to test for association. We devised a novel method to fit the LD model, reducing the complexity from putatively quadratic to linear (in the number of ancestral haplotypes). Therefore, the LD model can be fitted to all study samples simultaneously, and, consequently, our method is applicable to big data sets. Compared to existing haplotype association methods, our method integrated out phase uncertainty, avoided arbitrariness in specifying haplotypes, and had the same number of tests as the single-SNP analysis. We applied our method to data from the Wellcome Trust Case Control Consortium and discovered eight novel associations between seven gene regions and five disease phenotypes. Among these, GRIK4, which encodes a protein that belongs to the glutamate-gated ionic channel family, is strongly associated with both coronary artery disease and rheumatoid arthritis. A software package implementing methods described in this article is freely available at http://www.haplotype.org.
Models, Genetic, Bayes Theorem, Single Nucleotide, Investigations, Polymorphism, Single Nucleotide, Linkage Disequilibrium, Databases, Phenotype, Genetic, Haplotypes, Models, Case-Control Studies, Databases, Genetic, Humans, Computer Simulation, Genetic Predisposition to Disease, Polymorphism, Algorithms, Alleles, Genetic Association Studies
Models, Genetic, Bayes Theorem, Single Nucleotide, Investigations, Polymorphism, Single Nucleotide, Linkage Disequilibrium, Databases, Phenotype, Genetic, Haplotypes, Models, Case-Control Studies, Databases, Genetic, Humans, Computer Simulation, Genetic Predisposition to Disease, Polymorphism, Algorithms, Alleles, Genetic Association Studies
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 29 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
