Forces are not Enough: Benchmark and Critical Evaluation for Machine Learning Force Fields with Molecular Simulations

descriptionPublicationkeyboard_double_arrow_right Preprint , Article 01 Jan 2022Embargo end date: 01 Jan 2022Publisher:ZenodoJournal:CoRR, volume abs/2210.07237

Authors: Xiang Fu 0005; Zhenghao Wu; Wujie Wang; Tian Xie 0002; Sinan Keten; Rafael Gómez-Bombarelli; Tommi S. Jaakkola;

doi: 10.5281/zenodo.7196767 , 10.5281/zenodo.7196579 , 10.5281/zenodo.7196578 , 10.48550/arxiv.2210.07237

arXiv: 2210.07237

Forces are not Enough: Benchmark and Critical Evaluation for Machine Learning Force Fields with Molecular Simulations

- Summary
- Subjects
- Metrics

Abstract

Molecular dynamics (MD) simulation techniques are widely used for various natural science applications. Increasingly, machine learning (ML) force field (FF) models begin to replace ab-initio simulations by predicting forces directly from atomic structures. Despite significant progress in this area, such techniques are primarily benchmarked by their force/energy prediction errors, even though the practical use case would be to produce realistic MD trajectories. We aim to fill this gap by introducing a novel benchmark suite for learned MD simulation. We curate representative MD systems, including water, organic molecules, a peptide, and materials, and design evaluation metrics corresponding to the scientific objectives of respective systems. We benchmark a collection of state-of-the-art (SOTA) ML FF models and illustrate, in particular, how the commonly benchmarked force accuracy is not well aligned with relevant simulation metrics. We demonstrate when and how selected SOTA methods fail, along with offering directions for further improvement. Specifically, we identify stability as a key metric for ML models to improve. Our benchmark suite comes with a comprehensive open-source codebase for training and simulation with ML FFs to facilitate future work.

31 pages, 18 figures

Related Organizations

Massachusetts Institute of Technology
United States
Northwestern University
United States
Microsoft Research (United Kingdom)
United Kingdom

Keywords

Chemical Physics (physics.chem-ph), FOS: Computer and information sciences, Computer Science - Machine Learning, FOS: Physical sciences, Computational Physics (physics.comp-ph), molecular dynamics, machine learning force fields, Machine Learning (cs.LG), Physics - Chemical Physics, machine learning potentials, Physics - Computational Physics

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	24
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%