Beyond Force Metrics: Pre-Training MLFFs for Stable MD Simulations

๐Ÿ“… 2025-06-17
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the instability of machine-learned force fields (MLFFs) in molecular dynamics (MD) simulationsโ€”a critical issue where low force prediction error (e.g., mean absolute error, MAE) does not guarantee stable MD trajectories. We propose a pretraining-based solution: leveraging the GemNet-T graph neural network, pretrained on the large-scale OC20 dataset and subsequently fine-tuned on small-sample MD17 tasks. Our study provides the first empirical evidence that force MAE is not inherently correlated with MD trajectory stability. Pretraining significantly enhances physical consistency in force modeling: with only 5 meV/ร… force MAE, trajectory stability improves by a factor of three. This work challenges the conventional paradigm that relies solely on force accuracy as the evaluation metric for MLFFs, establishing pretraining as a key pathway to improving their physical robustness and long-term dynamical fidelity.

Technology Category

Application Category

๐Ÿ“ Abstract
Machine-learning force fields (MLFFs) have emerged as a promising solution for speeding up ab initio molecular dynamics (MD) simulations, where accurate force predictions are critical but often computationally expensive. In this work, we employ GemNet-T, a graph neural network model, as an MLFF and investigate two training strategies: (1) direct training on MD17 (10K samples) without pre-training, and (2) pre-training on the large-scale OC20 dataset followed by fine-tuning on MD17 (10K). While both approaches achieve low force mean absolute errors (MAEs), reaching 5 meV/A per atom, we find that lower force errors do not necessarily guarantee stable MD simulations. Notably, the pre-trained GemNet-T model yields significantly improved simulation stability, sustaining trajectories up to three times longer than the model trained from scratch. These findings underscore the value of pre-training on large, diverse datasets to capture complex molecular interactions and highlight that force MAE alone is not always a sufficient metric of MD simulation stability.
Problem

Research questions and friction points this paper is trying to address.

Improving stability of molecular dynamics simulations with MLFFs
Evaluating pre-training impact on force field accuracy
Assessing force MAE limitations in simulation stability
Innovation

Methods, ideas, or system contributions that make the work stand out.

Pre-training GemNet-T on OC20 dataset
Fine-tuning model on MD17 dataset
Improving MD simulation stability significantly
๐Ÿ”Ž Similar Papers
No similar papers found.
S
Shagun Maheshwari
Department of Materials Science Engineering, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA
Janghoon Ock
Janghoon Ock
Assistant Professor, University of Nebraska-Lincoln
Computational CatalysisMaterial DiscoveryAI4Science
A
Adeesh Kolluru
Department of Chemical Engineering, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA
A
A. Farimani
Department of Mechanical Engineering, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA
J
John R. Kitchin
Department of Chemical Engineering, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA