<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

FlowDock: Geometric Flow Matching for Generative Protein-Ligand Docking and Affinity Prediction

Name: FlowDock: Geometric Flow Matching for Generative Protein-Ligand Docking and Affinity Prediction
Keywords: FOS: Computer and information sciences, Computer Science - Machine Learning, J.3, Computer Science - Artificial Intelligence, Biomolecules (q-bio.BM), Quantitative Biology - Quantitative Methods, I.2.1, Machine Learning (cs.LG), Binding affinity, Artificial Intelligence (cs.AI)

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2024Embargo end date: 01 Jan 2024Publisher:arXivJournal:ArXiv (issn: 2331-8422,

Authors: Morehead, Alex; Cheng, Jianlin;

doi: 10.48550/arxiv.2412.10966 , 10.5281/zenodo.14478458 , 10.5281/zenodo.14660031 , 10.5281/zenodo.14478459 , 10.5281/zenodo.15066450 , 10.5281/zenodo.15066390

pmid: 39876941

pmc: PMC11774451

arXiv: http://arxiv.org/abs/2412.10966

FlowDock: Geometric Flow Matching for Generative Protein-Ligand Docking and Affinity Prediction

- Summary
- Subjects
- Metrics

Abstract

Powerful generative AI models of protein-ligand structure have recently been proposed, but few of these methods support both flexible protein-ligand docking and affinity estimation. Of those that do, none can directly model multiple binding ligands concurrently or have been rigorously benchmarked on pharmacologically relevant drug targets, hindering their widespread adoption in drug discovery efforts. In this work, we propose FlowDock, the first deep geometric generative model based on conditional flow matching that learns to directly map unbound (apo) structures to their bound (holo) counterparts for an arbitrary number of binding ligands. Furthermore, FlowDock provides predicted structural confidence scores and binding affinity values with each of its generated protein-ligand complex structures, enabling fast virtual screening of new (multi-ligand) drug targets. For the well-known PoseBusters Benchmark dataset, FlowDock outperforms single-sequence AlphaFold 3 with a 51% blind docking success rate using unbound (apo) protein input structures and without any information derived from multiple sequence alignments, and for the challenging new DockGen-E dataset, FlowDock outperforms single-sequence AlphaFold 3 and matches single-sequence Chai-1 for binding pocket generalization. Additionally, in the ligand category of the 16th community-wide Critical Assessment of Techniques for Structure Prediction (CASP16), FlowDock ranked among the top-5 methods for pharmacological binding affinity estimation across 140 protein-ligand complexes, demonstrating the efficacy of its learned representations in virtual screening. Source code, data, and pre-trained models are available at https://github.com/BioinfoMachineLearning/FlowDock.

15 pages, 2 tables, 2 algorithms, 11 figures. Code, data, pre-trained models, and baseline method predictions are available at https://github.com/BioinfoMachineLearning/FlowDock

Related Organizations

University of Missouri
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, J.3, Computer Science - Artificial Intelligence, Biomolecules (q-bio.BM), Quantitative Biology - Quantitative Methods, I.2.1, Machine Learning (cs.LG), Binding affinity, Artificial Intelligence (cs.AI), Quantitative Biology - Biomolecules, FOS: Biological sciences, Generative modeling, Flow matching, Protein-ligand structure, I.2.1; J.3, Quantitative Methods (q-bio.QM)

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green