<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

Integration of human cell lines gene expression and chemical properties of drugs for Drug Induced Liver Injury prediction

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 01 Apr 2020 English Publisher:Springer Science and Business Media LLCJournal:Biology Direct, volume 16 (eissn: 1745-6150,

Authors: Wojciech Lesiński; Krzysztof Mnich; Agnieszka Kitlas Golińska; Witold R. Rudnicki;

doi: 10.1186/s13062-020-00286-z , 10.21203/rs.3.rs-19658/v1

pmid: 33422118

pmc: PMC7796564

Integration of human cell lines gene expression and chemical properties of drugs for Drug Induced Liver Injury prediction

- Summary
- Subjects
- Related research
  (2)
- Metrics

Abstract

Abstract Motivation Drug-induced liver injury (DILI) is one of the primary problems in drug development. Early prediction of DILI can bring a significant reduction in the cost of clinical trials. In this work we examined whether occurrence of DILI can be predicted using gene expression profile in cancer cell lines and chemical properties of drugs. Methods We used gene expression profiles from 13 human cell lines, as well as molecular properties of drugs to build Machine Learning models of DILI. To this end, we have used a robust cross-validated protocol based on feature selection and Random Forest algorithm. In this protocol we first identify the most informative variables and then use them to build predictive models. The models are first built using data from single cell lines, and chemical properties. Then they are integrated using Super Learner method with several underlying methods for integration. The entire modelling process is performed using nested cross-validation. Results We have obtained weakly predictive ML models when using either molecular descriptors, or some individual cell lines (AUC ∈(0.55−0.61)). Models obtained with the Super Learner approach have a significantly improved accuracy (AUC=0.73), which allows to divide substances in two categories: low-risk and high-risk.

Related Organizations

Institute of Computer Science
Poland
Polish Academy of Sciences
Poland
Uniwersytet w Białymstoku (University of Bialystok)
Poland
University of Białystok
Poland
UNIWERSYTET W BIALYMSTOKU
Poland

Keywords

QH301-705.5, Research, Risk Assessment, Cell Line, Machine Learning, Machine learning, Drug Discovery, Humans, Data integration, Biology (General), Chemical and Drug Induced Liver Injury, Transcriptome, Algorithms, Random forest

2 Research products, page of 1

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	8
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%