Non-linear Multi Omics Data Integration Method Using Conditional Variational Autoencoders

Name: Non-linear Multi Omics Data Integration Method Using Conditional Variational Autoencoders
Creator: Gustinna Wadu, Dimuth Adeepa Gunarathne
Keywords: Multi-Omics Data, Statistics, Conditional Variational Autoencoder, FOS: Mathematics, Biostatistics

Gustinna Wadu, Dimuth Adeepa Gunarathne

Found an issue? Give us feedback

downloadFull-Text

PRISM: University of...arrow_drop_down

PRISM: University of Calgary Digital Repository

Master thesis . 2025

Full-Text: https://prism.ucalgary.ca/bitstreams/276d142c-638e-436e-826d-ed86295b412d/download

Data sources: PRISM: University of Calgary Digital Repository

https://dx.doi.org/10.11575/pr...

Master thesis . 2025

Data sources: Datacite

Non-linear Multi Omics Data Integration Method Using Conditional Variational Autoencoders

descriptionPublicationkeyboard_double_arrow_right Master thesis 27 Jan 2025Embargo end date: 31 Jan 2025 Canada English Publisher:Graduate Studies

Authors: Gustinna Wadu, Dimuth Adeepa Gunarathne;

doi: 10.11575/prism/48306

handle: 1880/120697

Non-linear Multi Omics Data Integration Method Using Conditional Variational Autoencoders

- Summary
- Subjects
- Metrics

Abstract

Advances in technology have enabled the study of diseases through multi-omics data, which combines information from genome, epigenome, transcriptome, proteome, and metabolome levels. Unlike single-omics approaches that provide limited insights, multi-omics integration offers a comprehensive understanding of biological systems by capturing interactions across molecular layers. In recent years, several methods have been developed to integrate omics data. For example, Simidjievski et al., 2019 introduced techniques that use Variational Autoencoders (VAEs) for data integration. Similarly, Zarayeneh et al., 2017 proposed a method called the Integrative Gene Regulatory Network (iGRN), which combines multiple layers of omics data using a network made up entirely of gene nodes. This thesis focuses on developing data integration architectures based on conditional variational autoencoders (CVAEs). The key advantage of this approach is that it allows class label information to be incorporated during the data integration process. To the best of our knowledge, CVAEs have not been applied in previous multi-omics research. Additionally, new methods for integrating more than two datasets using CVAEs have been introduced. This is a novel contribution to the field of multiomics data integration, as no prior studies have explored the use of CVAEs for integrating multiple datasets in this context. The proposed architectures were tested on both real and simulated datasets. The results from both studies showed that adding an outcome variable (class labels) to regular VAEs improved predictive performance. Additionally, integrating data from multiple datasets produced better results compared to using a single dataset for predictions or using VAEs without incorporating labels.

Country

Canada

Related Organizations

University of Calgary
Canada

Keywords

Multi-Omics Data, Statistics, Conditional Variational Autoencoder, FOS: Mathematics, Biostatistics

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green