
AbstractBackgroundDNA methylation (DNAm) is an epigenetic regulator of gene expression programs that can be altered by environmental exposures, aging, and in pathogenesis. Traditional analyses that associate DNAm alterations with phenotypes suffer from multiple hypothesis testing and multi-collinearity due to the high-dimensional, continuous, interacting and non-linear nature of the data. Deep learning analyses have shown much promise to study disease heterogeneity. DNAm deep learning approaches have not yet been formalized into user-friendly frameworks for execution, training, and interpreting models. Here, we describe MethylNet, a DNAm deep learning method that can construct embeddings, make predictions, generate new data, and uncover unknown heterogeneity with minimal user supervision.ResultsThe results of our experiments indicate that MethylNet can study cellular differences, grasp higher order information of cancer sub-types, estimate age and capture factors associated with smoking in concordance with known differences.ConclusionThe ability of MethylNet to capture nonlinear interactions presents an opportunity for further study of unknown disease, cellular heterogeneity and aging processes.
Aging, DNA methylation, QH301-705.5, Methodology Article, Computer applications to medicine. Medical informatics, R858-859.7, Deep learning, DNA Methylation, Supervised, Workflow automation, Unsupervised, User-Computer Interface, Deep Learning, Neoplasms, Humans, CpG Islands, High performance computing, Biology (General)
Aging, DNA methylation, QH301-705.5, Methodology Article, Computer applications to medicine. Medical informatics, R858-859.7, Deep learning, DNA Methylation, Supervised, Workflow automation, Unsupervised, User-Computer Interface, Deep Learning, Neoplasms, Humans, CpG Islands, High performance computing, Biology (General)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 111 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
