SAM-DTA: a sequence-agnostic model for drug–target binding affinity prediction

descriptionPublicationkeyboard_double_arrow_right Article 21 Dec 2022 English Publisher:Oxford University Press (OUP)Journal:Briefings in Bioinformatics, volume 24 (issn: 1467-5463, eissn: 1477-4054,

Copyright policy )

Authors: Zhiqiang Hu; Wenfeng Liu; Chenbin Zhang; Jiawen Huang; Shaoting Zhang 0001; Huiqun Yu; Yi Xiong 0002; +3 Authors

doi: 10.1093/bib/bbac533

pmid: 36545795

SAM-DTA: a sequence-agnostic model for drug–target binding affinity prediction

- Summary
- Subjects
- Metrics

Abstract

Abstract Drug–target binding affinity prediction is a fundamental task for drug discovery and has been studied for decades. Most methods follow the canonical paradigm that processes the inputs of the protein (target) and the ligand (drug) separately and then combines them together. In this study we demonstrate, surprisingly, that a model is able to achieve even superior performance without access to any protein-sequence-related information. Instead, a protein is characterized completely by the ligands that it interacts. Specifically, we treat different proteins separately, which are jointly trained in a multi-head manner, so as to learn a robust and universal representation of ligands that is generalizable across proteins. Empirical evidences show that the novel paradigm outperforms its competitive sequence-based counterpart, with the Mean Squared Error (MSE) of 0.4261 versus 0.7612 and the R-Square of 0.7984 versus 0.6570 compared with DeepAffinity. We also investigate the transfer learning scenario where unseen proteins are encountered after the initial training, and the cross-dataset evaluation for prospective studies. The results reveals the robustness of the proposed model in generalizing to unseen proteins as well as in predicting future data. Source codes and data are available at https://github.com/huzqatpku/SAM-DTA.

Related Organizations

Shanghai Jiao Tong University
China (People's Republic of)
Shanghai Artificial Intelligence Laboratory
China (People's Republic of)
East China University of Science and Technology
China (People's Republic of)

Keywords

Proteins, Prospective Studies, Amino Acid Sequence, Ligands, Software, Protein Binding

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	10
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

10

Top 10%

Average

Top 10%

hybrid

Fields of Science (4) View all

engineering and technology

medical engineering

Fields of Science

engineering and technology

medical engineering

View all