Banner Banner

Crash Testing Machine Learning Force Fields for Molecules, Materials, and Interfaces: Model Analysis in the TEA Challenge 2023

Igor Poltavsky
Anton Charkin-Gorbulin
Mirela Puleva
Gregory Cordeiro Fonseca
Ilyes Batatia
Nicholas J. Browning
Stefan Chmiela
Mengnan Cui
J. Thorben Frank
Stefan Heinen
Bing Huang
Silvan Käser
Adil Kabylda
Danish Khan
Carolin Müller
Alastair J. A. Price
Kai Riedmiller
Kai Töpfer
Tsz Wai Ko
Markus Meuwly
Matthias Rupp
Gabor Csanyi
O. Anatole von Lilienfeld
Johannes T. Margraf
Klaus-Robert Müller
Alexandre Tkatchenko

September 27, 2024

Atomistic simulations are routinely employed in academia and industry to study the behavior of molecules, materials, and their interfaces. Central to these simulations are force fields (FF), whose development is challenged by intricate interatomic interactions at different spatio-temporal scales and the vast expanse of chemical space. Machine learning (ML) FFs, trained on quantum-mechanical energies and forces, have shown the capacity to achieve sub-kcal/(mol*A) accuracy while maintaining computational efficiency. The TEA Challenge 2023 rigorously evaluated commonly used MLFFs across diverse applications, highlighting their strengths and weaknesses. Participants trained their models using provided datasets, and the results were systematically analyzed to assess the MLFFs' ability to reproduce potential energy surfaces, handle incomplete reference data, manage multi-component systems, and model complex periodic structures. This publication describes the datasets, outlines the proposed challenges, and presents a detailed analysis of the accuracy, stability, and efficiency of the MACE, SO3krates, sGDML, SOAP/GAP, and FCHL19* architectures in molecular dynamics simulations. The models represent the MLFF developers who participated in the TEA Challenge 2023. All results presented correspond to the state of the ML architectures as of October 2023. A comprehensive analysis of the molecular dynamics results obtained with different MLFFs will be presented in the second part of this manuscript.