
pmid: 31660856
pmc: PMC6819613
AbstractBackgroundCancer subtype classification attains the great importance for accurate diagnosis and personalized treatment of cancer. Latest developments in high-throughput sequencing technologies have rapidly produced multi-omics data of the same cancer sample. Many computational methods have been proposed to classify cancer subtypes, however most of them generate the model by only employing gene expression data. It has been shown that integration of multi-omics data contributes to cancer subtype classification.ResultsA new hierarchical integration deep flexible neural forest framework is proposed to integrate multi-omics data for cancer subtype classification named as HI-DFNForest. Stacked autoencoder (SAE) is used to learn high-level representations in each omics data, then the complex representations are learned by integrating all learned representations into a layer of autoencoder. Final learned data representations (from the stacked autoencoder) are used to classify patients into different cancer subtypes using deep flexible neural forest (DFNForest) model.Cancer subtype classification is verified on BRCA, GBM and OV data sets from TCGA by integrating gene expression, miRNA expression and DNA methylation data. These results demonstrated that integrating multiple omics data improves the accuracy of cancer subtype classification than only using gene expression data and the proposed framework has achieved better performance compared with other conventional methods.ConclusionThe new hierarchical integration deep flexible neural forest framework(HI-DFNForest) is an effective method to integrate multi-omics data to classify cancer subtypes.
Artificial neural network, FOS: Computer and information sciences, Cancer Research, Artificial intelligence, QH301-705.5, Bioinformatics, Computer applications to medicine. Medical informatics, R858-859.7, Gene Expression, Omics, Gene, Biochemistry, Genomic Data Integration, Computational biology, Neoplasms, Biochemistry, Genetics and Molecular Biology, Microarray Data Analysis and Gene Expression Profiling, Machine learning, Genetics, Humans, Biology (General), Molecular Biology, Data mining, Biology, Cancer, Genome, Cancer subtype classification, Methodology Article, High-Throughput Nucleotide Sequencing, Life Sciences, Deep learning, Autoencoder, Genomics, DNA Methylation, Computer science, Analysis of Gene Interaction Networks, Genomic Landscape of Cancer and Mutational Signatures, MicroRNAs, Biological Network Integration, FOS: Biological sciences, Data integration, Cascade forest
Artificial neural network, FOS: Computer and information sciences, Cancer Research, Artificial intelligence, QH301-705.5, Bioinformatics, Computer applications to medicine. Medical informatics, R858-859.7, Gene Expression, Omics, Gene, Biochemistry, Genomic Data Integration, Computational biology, Neoplasms, Biochemistry, Genetics and Molecular Biology, Microarray Data Analysis and Gene Expression Profiling, Machine learning, Genetics, Humans, Biology (General), Molecular Biology, Data mining, Biology, Cancer, Genome, Cancer subtype classification, Methodology Article, High-Throughput Nucleotide Sequencing, Life Sciences, Deep learning, Autoencoder, Genomics, DNA Methylation, Computer science, Analysis of Gene Interaction Networks, Genomic Landscape of Cancer and Mutational Signatures, MicroRNAs, Biological Network Integration, FOS: Biological sciences, Data integration, Cascade forest
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 95 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
