
pmid: 11754464
AbstractLarge, population‐based samples and large‐scale genotyping are being used to evaluate disease/gene associations. A substantial drawback to such samples is the fact that population substructure can induce spurious associations between genes and disease. We review two methods, called genomic control (GC) and structured association (SA), that obviate many of the concerns about population substructure by using the features of the genomes present in the sample to correct for stratification. The GC approach exploits the fact that population substructure generates “over dispersion” of statistics used to assess association. By testing multiple polymorphisms throughout the genome, only some of which are pertinent to the disease of interest, the degree of overdispersion generated by population substructure can be estimated and taken into account. The SA approach assumes that the sampled population, although heterogeneous, is composed of subpopulations that are themselves homogeneous. By using multiple polymorphisms throughout the genome, this “latent class method” estimates the probability sampled individuals derive from each of these latent subpopulations. GC has the advantage of robustness, simplicity, and wide applicability, even to experimental designs such as DNA pooling. SA is a bit more complicated but has the advantage of greater power in some realistic settings, such as admixed populations or when association varies widely across subpopulations. It, too, is widely applicable. Both also have weaknesses, as elaborated in our review. Genet. Epidemiol. 21:273–284, 2001. © 2001 Wiley‐Liss, Inc.
Genetic Markers, Genotype, Linkage Disequilibrium, Genetic Heterogeneity, Quantitative Trait, Heritable, Bias, FOS: Mathematics, Humans, Probability, Analysis of Variance, Molecular Epidemiology, Polymorphism, Genetic, Models, Genetic, Statistics, Reproducibility of Results, Confounding Factors, Epidemiologic, Gene Pool, Genomics, Epidemiologic Studies, Genetics, Population, Haplotypes, Case-Control Studies, Data Interpretation, Statistical
Genetic Markers, Genotype, Linkage Disequilibrium, Genetic Heterogeneity, Quantitative Trait, Heritable, Bias, FOS: Mathematics, Humans, Probability, Analysis of Variance, Molecular Epidemiology, Polymorphism, Genetic, Models, Genetic, Statistics, Reproducibility of Results, Confounding Factors, Epidemiologic, Gene Pool, Genomics, Epidemiologic Studies, Genetics, Population, Haplotypes, Case-Control Studies, Data Interpretation, Statistical
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 120 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
