
doi: 10.11575/prism/48306
handle: 1880/120697
Advances in technology have enabled the study of diseases through multi-omics data, which combines information from genome, epigenome, transcriptome, proteome, and metabolome levels. Unlike single-omics approaches that provide limited insights, multi-omics integration offers a comprehensive understanding of biological systems by capturing interactions across molecular layers. In recent years, several methods have been developed to integrate omics data. For example, Simidjievski et al., 2019 introduced techniques that use Variational Autoencoders (VAEs) for data integration. Similarly, Zarayeneh et al., 2017 proposed a method called the Integrative Gene Regulatory Network (iGRN), which combines multiple layers of omics data using a network made up entirely of gene nodes. This thesis focuses on developing data integration architectures based on conditional variational autoencoders (CVAEs). The key advantage of this approach is that it allows class label information to be incorporated during the data integration process. To the best of our knowledge, CVAEs have not been applied in previous multi-omics research. Additionally, new methods for integrating more than two datasets using CVAEs have been introduced. This is a novel contribution to the field of multiomics data integration, as no prior studies have explored the use of CVAEs for integrating multiple datasets in this context. The proposed architectures were tested on both real and simulated datasets. The results from both studies showed that adding an outcome variable (class labels) to regular VAEs improved predictive performance. Additionally, integrating data from multiple datasets produced better results compared to using a single dataset for predictions or using VAEs without incorporating labels.
Multi-Omics Data, Statistics, Conditional Variational Autoencoder, FOS: Mathematics, Biostatistics
Multi-Omics Data, Statistics, Conditional Variational Autoencoder, FOS: Mathematics, Biostatistics
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
