
Abstract Background One way of investigating how genes affect human traits would be with a genome-wide association study (GWAS). Genetic markers, known as single-nucleotide polymorphism (SNP), are used in GWAS. This raises privacy and security concerns as these genetic markers can be used to identify individuals uniquely. This problem is further exacerbated by a large number of SNPs needed, which produce reliable results at a higher risk of compromising the privacy of participants. Methods We describe a method using homomorphic encryption (HE) to perform GWAS in a secure and private setting. This work is based on a proposed algorithm. Our solution mainly involves homomorphically encrypted matrix operations and suitable approximations that adapts the semi-parallel GWAS algorithm for HE. We leverage upon the complex space of the CKKS encryption scheme to increase the number of SNPs that can be packed within a ciphertext. We have also developed a cache module that manages ciphertexts, reducing the memory footprint. Results We have implemented our solution over two HE open source libraries, HEAAN and SEAL. Our best implementation took 24.70 minutes for a dataset with 245 samples, over 4 covariates and 10643 SNPs. Conclusions We demonstrate that it is possible to achieve GWAS with homomorphic encryption with suitable approximations.
FOS: Computer and information sciences, Computer Science - Cryptography and Security, QH426-470, Statistics - Applications, Polymorphism, Single Nucleotide, Homomorphic encryption (HE), Single nucleotide polymorphism (SNP), Genetics, Humans, Quantitative Biology - Genomics, Applications (stat.AP), Genome wide association studies (GWAS), Internal medicine, Computer Security, Genomics (q-bio.GN), Research, RC31-1245, Privacy, FOS: Biological sciences, Cryptography and Security (cs.CR), Algorithms, Genome-Wide Association Study
FOS: Computer and information sciences, Computer Science - Cryptography and Security, QH426-470, Statistics - Applications, Polymorphism, Single Nucleotide, Homomorphic encryption (HE), Single nucleotide polymorphism (SNP), Genetics, Humans, Quantitative Biology - Genomics, Applications (stat.AP), Genome wide association studies (GWAS), Internal medicine, Computer Security, Genomics (q-bio.GN), Research, RC31-1245, Privacy, FOS: Biological sciences, Cryptography and Security (cs.CR), Algorithms, Genome-Wide Association Study
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 27 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
