PIMKL: Pathway-Induced Multiple Kernel Learning

descriptionPublicationkeyboard_double_arrow_right Article , Conference object , Other literature type , Preprint 05 Mar 2019Embargo end date: 01 Jan 2019 Switzerland English Publisher:Springer Science and Business Media LLCJournal:npj Systems Biology and Applications, volume 5 (eissn: 2056-7189,

Copyright policy )Funded by:EC | PrECISE

Authors: Matteo Manica; Joris Cadow; Roland Mathis; María Rodríguez Martínez;

doi: 10.1038/s41540-019-0086-3 , 10.3929/ethz-b-000331766 , 10.5281/zenodo.3374413 , 10.5281/zenodo.3374412 , 10.48550/arxiv.1803.11274

pmid: 30854223

pmc: PMC6401099

arXiv: 1803.11274

handle: 20.500.11850/331766

PIMKL: Pathway-Induced Multiple Kernel Learning

- Summary
- Subjects
- Metrics

Abstract

AbstractReliable identification of molecular biomarkers is essential for accurate patient stratification. While state-of-the-art machine learning approaches for sample classification continue to push boundaries in terms of performance, most of these methods are not able to integrate different data types and lack generalization power, limiting their application in a clinical setting. Furthermore, many methods behave as black boxes, and we have very little understanding about the mechanisms that lead to the prediction. While opaqueness concerning machine behavior might not be a problem in deterministic domains, in health care, providing explanations about the molecular factors and phenotypes that are driving the classification is crucial to build trust in the performance of the predictive system. We propose Pathway-Induced Multiple Kernel Learning (PIMKL), a methodology to reliably classify samples that can also help gain insights into the molecular mechanisms that underlie the classification. PIMKL exploits prior knowledge in the form of a molecular interaction network and annotated gene sets, by optimizing a mixture of pathway-induced kernels using a Multiple Kernel Learning (MKL) algorithm, an approach that has demonstrated excellent performance in different machine learning applications. After optimizing the combination of kernels to predict a specific phenotype, the model provides a stable molecular signature that can be interpreted in the light of the ingested prior knowledge and that can be used in transfer learning tasks.

Country

Switzerland

Related Organizations

ETH Zurich
Switzerland
IBM RESEARCH GMBH
Switzerland
IBM (United States)
United States

Keywords

FOS: Computer and information sciences, Molecular Networks (q-bio.MN), Computational Biology, Machine Learning (stat.ML), Article, Pattern Recognition, Automated, Machine Learning, Statistics - Machine Learning, FOS: Biological sciences, Biomarkers, Tumor, Humans, Quantitative Biology - Molecular Networks, multiple kernel learning, Algorithms, Software

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	26
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%