CAEM-GBDT: a cancer subtype identifying method using multi-omics data and convolutional autoencoder network

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 15 Jul 2024Publisher:Frontiers Media SAJournal:Frontiers in Bioinformatics, volume 4 (eissn: 2673-7647,

Copyright policy )

Authors: Jiquan Shen; Xuanhui Guo; Hanwen Bai; Junwei Luo;

doi: 10.3389/fbinf.2024.1403826

pmid: 39077754

pmc: PMC11284046

CAEM-GBDT: a cancer subtype identifying method using multi-omics data and convolutional autoencoder network

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

The identification of cancer subtypes plays a very important role in the field of medicine. Accurate identification of cancer subtypes is helpful for both cancer treatment and prognosis Currently, most methods for cancer subtype identification are based on single-omics data, such as gene expression data. However, multi-omics data can show various characteristics about cancer, which also can improve the accuracy of cancer subtype identification. Therefore, how to extract features from multi-omics data for cancer subtype identification is the main challenge currently faced by researchers. In this paper, we propose a cancer subtype identification method named CAEM-GBDT, which takes gene expression data, miRNA expression data, and DNA methylation data as input, and adopts convolutional autoencoder network to identify cancer subtypes. Through a convolutional encoder layer, the method performs feature extraction on the input data. Within the convolutional encoder layer, a convolutional self-attention module is embedded to recognize higher-level representations of the multi-omics data. The extracted high-level representations from the convolutional encoder are then concatenated with the input to the decoder. The GBDT (Gradient Boosting Decision Tree) is utilized for cancer subtype identification. In the experiments, we compare CAEM-GBDT with existing cancer subtype identifying methods. Experimental results demonstrate that the proposed CAEM-GBDT outperforms other methods. The source code is available from GitHub at https://github.com/gxh-1/CAEM-GBDT.git.

Related Organizations

Anhui University
China (People's Republic of)
Central South University
China (People's Republic of)
Henan Polytechnic University
China (People's Republic of)

Keywords

Bioinformatics, Computer applications to medicine. Medical informatics, R858-859.7, convolutional block attention module, cancer subtype, cancer subtype identification, multi-omics, convolutional autoencode

1 Research products, page 1 of 1

CAEM-GBDT software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average