
pmid: 36354000
pmc: PMC9665128
Illumina Infinium DNA Methylation (5mC) arrays are a popular technology for low-cost, high-throughput, genome-scale measurement of 5mC distribution, especially in cancer and other complex diseases. After the success of its HumanMethylation450 array (450k), Illumina released the MethylationEPIC array (850k) featuring increased coverage of enhancers. Despite the widespread use of 850k, analysis of the corresponding data remains suboptimal: it still relies mostly on Illumina's default annotation, which underestimates enhancerss and long noncoding RNAs. Results: We have thus developed an approach, based on the ENCODE and LNCipedia databases, which greatly improves upon Illumina's default annotation of enhancers and long noncoding transcripts. We compared the re-annotated 850k with both 450k and reduced-representation bisulphite sequencing (RRBS), another high-throughput 5mC profiling technology. We found 850k to cover at least three times as many enhancers and long noncoding RNAs as either 450k or RRBS. We further investigated the reproducibility of the three technologies, applying various normalization methods to the 850k data. Most of these methods reduced variability to a level below that of RRBS data. We then used 850k with our new annotation and normalization to profile 5mC changes in breast cancer biopsies. 850k highlighted aberrant enhancer methylation as the predominant feature, in agreement with previous reports. Our study provides an updated processing approach for 850k data, based on refined probe annotation and normalization, allowing for improved analysis of methylation at enhancers and long noncoding RNA genes. Our findings will help to further advance understanding of the DNA methylome in health and disease.
5mc, infinium, dna methylation, Oligonucleotide Array Sequence Analysis -- methods, QH426-470, epic, Long Noncoding -- genetics, methylationepic, Genetics, Humans, long noncoding RNA, Oligonucleotide Array Sequence Analysis, 5mC, Reproducibility of Results, Sciences bio-médicales et agricoles, DNA Methylation, 850k, Benchmarking, normalization, annotation, RNA, MethylationEPIC, enhancers, CpG Islands, RNA, Long Noncoding, long noncoding rna, EPIC, Research Paper
5mc, infinium, dna methylation, Oligonucleotide Array Sequence Analysis -- methods, QH426-470, epic, Long Noncoding -- genetics, methylationepic, Genetics, Humans, long noncoding RNA, Oligonucleotide Array Sequence Analysis, 5mC, Reproducibility of Results, Sciences bio-médicales et agricoles, DNA Methylation, 850k, Benchmarking, normalization, annotation, RNA, MethylationEPIC, enhancers, CpG Islands, RNA, Long Noncoding, long noncoding rna, EPIC, Research Paper
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 15 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
