
pmid: 27535533
pmc: PMC5018207
Summary Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) sequence data for 60,706 individuals of diverse ethnicities generated as part of the Exome Aggregation Consortium (ExAC). The resulting catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We show that this catalogue can be used to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; we identify 3,230 genes with near-complete depletion of truncating variants, 72% of which have no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human “knockout” variants in protein-coding genes.
Proteome, General Science & Technology, DNA Mutational Analysis, 610, Datasets as Topic, GUIDELINES, Article, 576, Rare Diseases, Clinical Research, Exome Aggregation Consortium, Medicine and Health Sciences, Genetics, SEQUENCE VARIANTS, 2.1 Biological and endogenous factors, Humans, Exome, Genetic Testing, Aetiology, MUTATION, Science & Technology, Human Genome, Life Sciences, HUMAN-POPULATION HISTORY, Genetic Variation, FRAMEWORK, R1, HUMAN-DISEASE, EVOLUTION, NETWORKS, Multidisciplinary Sciences, Phenotype, DISCOVERY, Sample Size, Science & Technology - Other Topics, Generic health relevance, Genètica humana -- Variació, Biotechnology
Proteome, General Science & Technology, DNA Mutational Analysis, 610, Datasets as Topic, GUIDELINES, Article, 576, Rare Diseases, Clinical Research, Exome Aggregation Consortium, Medicine and Health Sciences, Genetics, SEQUENCE VARIANTS, 2.1 Biological and endogenous factors, Humans, Exome, Genetic Testing, Aetiology, MUTATION, Science & Technology, Human Genome, Life Sciences, HUMAN-POPULATION HISTORY, Genetic Variation, FRAMEWORK, R1, HUMAN-DISEASE, EVOLUTION, NETWORKS, Multidisciplinary Sciences, Phenotype, DISCOVERY, Sample Size, Science & Technology - Other Topics, Generic health relevance, Genètica humana -- Variació, Biotechnology
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 9K | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 0.01% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 0.01% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 0.01% |
