Efficient interatomic descriptors for accurate machine learning force fields of extended molecules

Abstract Machine learning force fields (MLFFs) are gradually evolving towards enabling molecular dynamics simulations of molecules and materials with ab initio accuracy but at a small fraction of the computational cost. However, several challenges remain to be addressed to enable predictive MLFF sim...

Full description

Bibliographic Details
Main Authors: Adil Kabylda, Valentin Vassilev-Galindo, Stefan Chmiela, Igor Poltavsky, Alexandre Tkatchenko
Format: Article
Language:English
Published: Nature Portfolio 2023-06-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-023-39214-w
_version_ 1797778826812456960
author Adil Kabylda
Valentin Vassilev-Galindo
Stefan Chmiela
Igor Poltavsky
Alexandre Tkatchenko
author_facet Adil Kabylda
Valentin Vassilev-Galindo
Stefan Chmiela
Igor Poltavsky
Alexandre Tkatchenko
author_sort Adil Kabylda
collection DOAJ
description Abstract Machine learning force fields (MLFFs) are gradually evolving towards enabling molecular dynamics simulations of molecules and materials with ab initio accuracy but at a small fraction of the computational cost. However, several challenges remain to be addressed to enable predictive MLFF simulations of realistic molecules, including: (1) developing efficient descriptors for non-local interatomic interactions, which are essential to capture long-range molecular fluctuations, and (2) reducing the dimensionality of the descriptors to enhance the applicability and interpretability of MLFFs. Here we propose an automatized approach to substantially reduce the number of interatomic descriptor features while preserving the accuracy and increasing the efficiency of MLFFs. To simultaneously address the two stated challenges, we illustrate our approach on the example of the global GDML MLFF. We found that non-local features (atoms separated by as far as 15 Å in studied systems) are crucial to retain the overall accuracy of the MLFF for peptides, DNA base pairs, fatty acids, and supramolecular complexes. Interestingly, the number of required non-local features in the reduced descriptors becomes comparable to the number of local interatomic features (those below 5 Å). These results pave the way to constructing global molecular MLFFs whose cost increases linearly, instead of quadratically, with system size.
first_indexed 2024-03-12T23:22:09Z
format Article
id doaj.art-4aaf3e78e81044debdb27aae42abcef1
institution Directory Open Access Journal
issn 2041-1723
language English
last_indexed 2024-03-12T23:22:09Z
publishDate 2023-06-01
publisher Nature Portfolio
record_format Article
series Nature Communications
spelling doaj.art-4aaf3e78e81044debdb27aae42abcef12023-07-16T11:21:05ZengNature PortfolioNature Communications2041-17232023-06-0114111210.1038/s41467-023-39214-wEfficient interatomic descriptors for accurate machine learning force fields of extended moleculesAdil Kabylda0Valentin Vassilev-Galindo1Stefan Chmiela2Igor Poltavsky3Alexandre Tkatchenko4Department of Physics and Materials Science, University of LuxembourgDepartment of Physics and Materials Science, University of LuxembourgMachine Learning Group, Technische Universität BerlinDepartment of Physics and Materials Science, University of LuxembourgDepartment of Physics and Materials Science, University of LuxembourgAbstract Machine learning force fields (MLFFs) are gradually evolving towards enabling molecular dynamics simulations of molecules and materials with ab initio accuracy but at a small fraction of the computational cost. However, several challenges remain to be addressed to enable predictive MLFF simulations of realistic molecules, including: (1) developing efficient descriptors for non-local interatomic interactions, which are essential to capture long-range molecular fluctuations, and (2) reducing the dimensionality of the descriptors to enhance the applicability and interpretability of MLFFs. Here we propose an automatized approach to substantially reduce the number of interatomic descriptor features while preserving the accuracy and increasing the efficiency of MLFFs. To simultaneously address the two stated challenges, we illustrate our approach on the example of the global GDML MLFF. We found that non-local features (atoms separated by as far as 15 Å in studied systems) are crucial to retain the overall accuracy of the MLFF for peptides, DNA base pairs, fatty acids, and supramolecular complexes. Interestingly, the number of required non-local features in the reduced descriptors becomes comparable to the number of local interatomic features (those below 5 Å). These results pave the way to constructing global molecular MLFFs whose cost increases linearly, instead of quadratically, with system size.https://doi.org/10.1038/s41467-023-39214-w
spellingShingle Adil Kabylda
Valentin Vassilev-Galindo
Stefan Chmiela
Igor Poltavsky
Alexandre Tkatchenko
Efficient interatomic descriptors for accurate machine learning force fields of extended molecules
Nature Communications
title Efficient interatomic descriptors for accurate machine learning force fields of extended molecules
title_full Efficient interatomic descriptors for accurate machine learning force fields of extended molecules
title_fullStr Efficient interatomic descriptors for accurate machine learning force fields of extended molecules
title_full_unstemmed Efficient interatomic descriptors for accurate machine learning force fields of extended molecules
title_short Efficient interatomic descriptors for accurate machine learning force fields of extended molecules
title_sort efficient interatomic descriptors for accurate machine learning force fields of extended molecules
url https://doi.org/10.1038/s41467-023-39214-w
work_keys_str_mv AT adilkabylda efficientinteratomicdescriptorsforaccuratemachinelearningforcefieldsofextendedmolecules
AT valentinvassilevgalindo efficientinteratomicdescriptorsforaccuratemachinelearningforcefieldsofextendedmolecules
AT stefanchmiela efficientinteratomicdescriptorsforaccuratemachinelearningforcefieldsofextendedmolecules
AT igorpoltavsky efficientinteratomicdescriptorsforaccuratemachinelearningforcefieldsofextendedmolecules
AT alexandretkatchenko efficientinteratomicdescriptorsforaccuratemachinelearningforcefieldsofextendedmolecules