
AbstractBackgroundThe genetic etiology of complex diseases in human has been commonly viewed as a complex process involving both genetic and environmental factors functioning in a complicated manner. Quite often the interactions among genetic variants play major roles in determining the susceptibility of an individual to a particular disease. Statistical methods for modeling interactions underlying complex diseases between single genetic variants (e.g. single nucleotide polymorphisms or SNPs) have been extensively studied. Recently, haplotype-based analysis has gained its popularity among genetic association studies. When multiple sequence or haplotype interactions are involved in determining an individual's susceptibility to a disease, it presents daunting challenges in statistical modeling and testing of the interaction effects, largely due to the complicated higher order epistatic complexity.ResultsIn this article, we propose a new strategy in modeling haplotype-haplotype interactions under the penalized logistic regression framework with adaptiveL1-penalty. We consider interactions of sequence variants between haplotype blocks. The adaptiveL1-penalty allows simultaneous effect estimation and variable selection in a single model. We propose a new parameter estimation method which estimates and selects parameters by the modified Gauss-Seidel method nested within the EM algorithm. Simulation studies show that it has low false positive rate and reasonable power in detecting haplotype interactions. The method is applied to test haplotype interactions involved in mother and offspring genome in a small for gestational age (SGA) neonates data set, and significant interactions between different genomes are detected.ConclusionsAs demonstrated by the simulation studies and real data analysis, the approach developed provides an efficient tool for the modeling and testing of haplotype interactions. The implementation of the method in R codes can be freely downloaded fromhttp://www.stt.msu.edu/~cui/software.html.
Likelihood Functions, Models, Genetic, Methodology Article, Infant, Newborn, QH426-470, Logistic Models, Haplotypes, Infant, Small for Gestational Age, Genetics, Humans, Genetics(clinical), Computer Simulation, Algorithms, Genetic Association Studies
Likelihood Functions, Models, Genetic, Methodology Article, Infant, Newborn, QH426-470, Logistic Models, Haplotypes, Infant, Small for Gestational Age, Genetics, Humans, Genetics(clinical), Computer Simulation, Algorithms, Genetic Association Studies
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 17 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
