Transfer learning for drug–target interaction prediction

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 01 Jun 2023 United Kingdom, Turkey English Publisher:Oxford University Press (OUP)Journal:Bioinformatics, volume 39, pages i103-i110 (issn: 1367-4803, eissn: 1367-4811,

Copyright policy )

Authors: Alperen Dalkiran; Ahmet Atakan; Ahmet Süreyya Rifaioglu; Maria Jesus Martin; Rengül Çetin-Atalay; Aybar C. Acar; Tunca Dogan; +1 Authors

doi: 10.1093/bioinformatics/btad234

pmid: 37387156

pmc: PMC10311347

handle: 20.500.11820/53499dc7-0eba-43b9-b780-788df05f2c39

Transfer learning for drug–target interaction prediction

- Summary
- Subjects
- Metrics

Abstract

Abstract Motivation Utilizing AI-driven approaches for drug–target interaction (DTI) prediction require large volumes of training data which are not available for the majority of target proteins. In this study, we investigate the use of deep transfer learning for the prediction of interactions between drug candidate compounds and understudied target proteins with scarce training data. The idea here is to first train a deep neural network classifier with a generalized source training dataset of large size and then to reuse this pre-trained neural network as an initial configuration for re-training/fine-tuning purposes with a small-sized specialized target training dataset. To explore this idea, we selected six protein families that have critical importance in biomedicine: kinases, G-protein-coupled receptors (GPCRs), ion channels, nuclear receptors, proteases, and transporters. In two independent experiments, the protein families of transporters and nuclear receptors were individually set as the target datasets, while the remaining five families were used as the source datasets. Several size-based target family training datasets were formed in a controlled manner to assess the benefit provided by the transfer learning approach. Results Here, we present a systematic evaluation of our approach by pre-training a feed-forward neural network with source training datasets and applying different modes of transfer learning from the pre-trained source network to a target dataset. The performance of deep transfer learning is evaluated and compared with that of training the same deep neural network from scratch. We found that when the training dataset contains fewer than 100 compounds, transfer learning outperforms the conventional strategy of training the system from scratch, suggesting that transfer learning is advantageous for predicting binders to under-studied targets. Availability and implementation The source code and datasets are available at https://github.com/cansyl/TransferLearning4DTI. Our web-based service containing the ready-to-use pre-trained models is accessible at https://tl4dti.kansil.org.

Countries

United Kingdom, Turkey

Related Organizations

University of Chicago
United States
Adana Science and Technology University
Turkey
Middle East Technical University
Turkey
European Bioinformatics Institute
United Kingdom
University of Edinburgh
United Kingdom

View all View all

Keywords

Artificial neural network, Neural Networks, Peptide hydrolase, Chemoinformatics, Biomedical Informatics, Machine Learning, Computer, Drug Discovery, Chemistry - Protein Stucture, Folding & Modelling - Protein Folding, Machine learning, Topographic Mapping, Neural Networks, Computer, Software, Peptide Hydrolases

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	52
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%

Found an issue? Give us feedback

52

Top 1%

Top 10%

Top 1%

Green

gold

Related to Research communities

UArctic