Enabling Accurate and High-Throughput Kinetics Predictions via Message Passing Neural Networks

Quantitative estimates for kinetic properties, namely reaction barrier heights and reaction energies, are essential for developing kinetic mechanisms, predicting reaction outcomes, and optimizing chemical processes. While ab initio methods, such as quantum chemistry, can be incredibly useful for pro...

Full description

Bibliographic Details
Main Author: Spiekermann, Kevin A.
Other Authors: Green, William H.
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/153044
_version_ 1811072958534254592
author Spiekermann, Kevin A.
author2 Green, William H.
author_facet Green, William H.
Spiekermann, Kevin A.
author_sort Spiekermann, Kevin A.
collection MIT
description Quantitative estimates for kinetic properties, namely reaction barrier heights and reaction energies, are essential for developing kinetic mechanisms, predicting reaction outcomes, and optimizing chemical processes. While ab initio methods, such as quantum chemistry, can be incredibly useful for providing accurate kinetic data, their high computational cost severely limits their utility for large-scale applications. High-quality experimental data is often even more rare, and such approaches are less amenable to exploring the vastness of chemical space due to monetary cost, time, and safety considerations. Modern machine learning (ML) techniques offer a promising option since they can quickly provide estimates to narrow the search space for more expensive ab initio or experimental methods. Unfortunately, the paucity of reliable quantitative chemical reaction data to train such models has presented a major hindrance for these data-driven approaches. Here, this thesis focuses on the intersection of ML and quantum chemistry with the goal of enabling automatic high-fidelity predictions of kinetic parameters. The novel contributions can be grouped into three main categories: 1. Large-scale dataset generation, with an emphasis on high-quality methods and reaction diversity. Although much of the presented work studies reactions in the gas phase, this thesis also contributes a large dataset calculated in many popular solvents. 2. Train various ML models to quickly predict accurate kinetic parameters, which avoids the challenging task of finding transition state structures. Importantly, these models operate on simple input representations and hence are ideal for automated, high-throughput applications. 3. Provide best-practice guidelines and an open-source software package to improve the status quo of ML for chemistry research. The contributions from this thesis, and from similar work, will be essential for modern high-throughput workflows and the future of automated predictive chemistry.
first_indexed 2024-09-23T09:24:32Z
format Thesis
id mit-1721.1/153044
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T09:24:32Z
publishDate 2023
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1530442023-11-28T03:48:27Z Enabling Accurate and High-Throughput Kinetics Predictions via Message Passing Neural Networks Spiekermann, Kevin A. Green, William H. Massachusetts Institute of Technology. Department of Chemical Engineering Quantitative estimates for kinetic properties, namely reaction barrier heights and reaction energies, are essential for developing kinetic mechanisms, predicting reaction outcomes, and optimizing chemical processes. While ab initio methods, such as quantum chemistry, can be incredibly useful for providing accurate kinetic data, their high computational cost severely limits their utility for large-scale applications. High-quality experimental data is often even more rare, and such approaches are less amenable to exploring the vastness of chemical space due to monetary cost, time, and safety considerations. Modern machine learning (ML) techniques offer a promising option since they can quickly provide estimates to narrow the search space for more expensive ab initio or experimental methods. Unfortunately, the paucity of reliable quantitative chemical reaction data to train such models has presented a major hindrance for these data-driven approaches. Here, this thesis focuses on the intersection of ML and quantum chemistry with the goal of enabling automatic high-fidelity predictions of kinetic parameters. The novel contributions can be grouped into three main categories: 1. Large-scale dataset generation, with an emphasis on high-quality methods and reaction diversity. Although much of the presented work studies reactions in the gas phase, this thesis also contributes a large dataset calculated in many popular solvents. 2. Train various ML models to quickly predict accurate kinetic parameters, which avoids the challenging task of finding transition state structures. Importantly, these models operate on simple input representations and hence are ideal for automated, high-throughput applications. 3. Provide best-practice guidelines and an open-source software package to improve the status quo of ML for chemistry research. The contributions from this thesis, and from similar work, will be essential for modern high-throughput workflows and the future of automated predictive chemistry. Ph.D. 2023-11-27T15:23:06Z 2023-11-27T15:23:06Z 2023-09 2023-11-09T20:06:01.862Z Thesis https://hdl.handle.net/1721.1/153044 Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) Copyright retained by author(s) https://creativecommons.org/licenses/by-sa/4.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Spiekermann, Kevin A.
Enabling Accurate and High-Throughput Kinetics Predictions via Message Passing Neural Networks
title Enabling Accurate and High-Throughput Kinetics Predictions via Message Passing Neural Networks
title_full Enabling Accurate and High-Throughput Kinetics Predictions via Message Passing Neural Networks
title_fullStr Enabling Accurate and High-Throughput Kinetics Predictions via Message Passing Neural Networks
title_full_unstemmed Enabling Accurate and High-Throughput Kinetics Predictions via Message Passing Neural Networks
title_short Enabling Accurate and High-Throughput Kinetics Predictions via Message Passing Neural Networks
title_sort enabling accurate and high throughput kinetics predictions via message passing neural networks
url https://hdl.handle.net/1721.1/153044
work_keys_str_mv AT spiekermannkevina enablingaccurateandhighthroughputkineticspredictionsviamessagepassingneuralnetworks