
Recently a nonparametric method called LS‐imputation has been proposed for large‐scale trait imputation based on a GWAS summary dataset and a large set of genotyped individuals. The imputed trait values, along with the genotypes, can be treated as an individual‐level dataset for downstream genetic analyses, including those that cannot be done with GWAS summary data. However, since the covariance matrix of the imputed trait values is often too large to calculate, the current method imposes a working assumption that the imputed trait values are identically and independently distributed, which is incorrect in truth. Here we propose a “divide and conquer/combine” strategy to estimate and account for the covariance matrix of the imputed trait values via batches, thus relaxing the incorrect working assumption. Applications of the methods to the UK Biobank data for marginal association analysis showed some improvement by the new method in some cases, but overall the original method performed well, which was explained by nearly constant variances of and mostly weak correlations among imputed trait values.
least squares, Phenotype, Genotype, GWAS, LS-imputation, SNP, Humans, linear models, Polymorphism, Single Nucleotide, Applications of statistics to biology and medical sciences; meta analysis, Genome-Wide Association Study
least squares, Phenotype, Genotype, GWAS, LS-imputation, SNP, Humans, linear models, Polymorphism, Single Nucleotide, Applications of statistics to biology and medical sciences; meta analysis, Genome-Wide Association Study
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
