
pmid: 37891200
pmc: PMC10611741
Abstract The short lengths of short-read sequencing reads challenge the analysis of paralogous genomic regions in exome and genome sequencing data. Most genetic variants within these homologous regions therefore remain unidentified in standard analyses. Here, we present a method (Chameleolyser) that accurately identifies single nucleotide variants and small insertions/deletions (SNVs/Indels), copy number variants and ectopic gene conversion events in duplicated genomic regions using whole-exome sequencing data. Application to a cohort of 41,755 exome samples yields 20,432 rare homozygous deletions and 2,529,791 rare SNVs/Indels, of which we show that 338,084 are due to gene conversion events. None of the SNVs/Indels are detectable using regular analysis techniques. Validation by high-fidelity long-read sequencing in 20 samples confirms >88% of called variants. Focusing on variation in known disease genes leads to a direct molecular diagnosis in 25 previously undiagnosed patients. Our method can readily be applied to existing exome data.
Radboudumc 4: lnfectious Diseases and Global Health Internal Medicine, Systems Analysis, Radboudumc 7: Neurodevelopmental disorders DCMN: Donders Center for Medical Neuroscience, DNA Copy Number Variations, Radboudumc 12: Sensory disorders DCMN: Donders Center for Medical Neuroscience, Science, Q, Radboudumc 4: lnfectious Diseases and Global Health Human Genetics, Radboud University Medical Center, High-Throughput Nucleotide Sequencing, SEQUENCE, Polymorphism, Single Nucleotide, EVOLUTION, Article, Radboudumc 6: Metabolic Disorders Human Genetics, GENE CONVERSION, ALIGNMENT, INDEL Mutation, Medicine and Health Sciences, Humans, Exome, COPY NUMBER VARIATION
Radboudumc 4: lnfectious Diseases and Global Health Internal Medicine, Systems Analysis, Radboudumc 7: Neurodevelopmental disorders DCMN: Donders Center for Medical Neuroscience, DNA Copy Number Variations, Radboudumc 12: Sensory disorders DCMN: Donders Center for Medical Neuroscience, Science, Q, Radboudumc 4: lnfectious Diseases and Global Health Human Genetics, Radboud University Medical Center, High-Throughput Nucleotide Sequencing, SEQUENCE, Polymorphism, Single Nucleotide, EVOLUTION, Article, Radboudumc 6: Metabolic Disorders Human Genetics, GENE CONVERSION, ALIGNMENT, INDEL Mutation, Medicine and Health Sciences, Humans, Exome, COPY NUMBER VARIATION
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 8 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
