Optical Property Prediction and Molecular Discovery through Multi-Fidelity Deep Learning and Computational Chemistry

Optical properties are crucial for the design of molecules for numerous applications, including for display technologies and biological imaging. The accurate prediction of these properties has been the subject of decades of work in both physics-based approaches and statistical modeling. Recently, la...

Full description

Bibliographic Details
Main Author: Greenman, Kevin P.
Other Authors: Gómez-Bombarelli, Rafael
Format: Thesis
Published: Massachusetts Institute of Technology 2024
Online Access:https://hdl.handle.net/1721.1/155385
_version_ 1811069516303564800
author Greenman, Kevin P.
author2 Gómez-Bombarelli, Rafael
author_facet Gómez-Bombarelli, Rafael
Greenman, Kevin P.
author_sort Greenman, Kevin P.
collection MIT
description Optical properties are crucial for the design of molecules for numerous applications, including for display technologies and biological imaging. The accurate prediction of these properties has been the subject of decades of work in both physics-based approaches and statistical modeling. Recently, large datasets of both computed and experimental optical properties have become available, along with the advent of powerful deep learning approaches cable of learning meaningful representations from these large datasets. This thesis presents new approaches for predicting optical properties by fusing the experimental and computational data in multi-fidelity models that achieve greater accuracy and generalizability than previous methods. Additionally, it conducts a thorough benchmark of various strategies for handling multi-fidelity data to inform the modeling choices of future practitioners working with optical properties and beyond. Despite the greater availability of optical property data recently, the near-infrared (NIR) region of the spectrum remains more data-sparse despite its promise in many applications. This thesis demonstrates the shortcomings of existing methods for predicting optical properties in this region of chemical space and recommends best practices for future research in this area. Finally, this thesis highlights successful usage of data-driven optical property prediction for the discovery of novel molecules for specific applications.
first_indexed 2024-09-23T08:11:41Z
format Thesis
id mit-1721.1/155385
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T08:11:41Z
publishDate 2024
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1553852024-07-03T16:15:51Z Optical Property Prediction and Molecular Discovery through Multi-Fidelity Deep Learning and Computational Chemistry Greenman, Kevin P. Gómez-Bombarelli, Rafael Green, William H. Massachusetts Institute of Technology. Department of Chemical Engineering Massachusetts Institute of Technology. Center for Computational Science and Engineering Optical properties are crucial for the design of molecules for numerous applications, including for display technologies and biological imaging. The accurate prediction of these properties has been the subject of decades of work in both physics-based approaches and statistical modeling. Recently, large datasets of both computed and experimental optical properties have become available, along with the advent of powerful deep learning approaches cable of learning meaningful representations from these large datasets. This thesis presents new approaches for predicting optical properties by fusing the experimental and computational data in multi-fidelity models that achieve greater accuracy and generalizability than previous methods. Additionally, it conducts a thorough benchmark of various strategies for handling multi-fidelity data to inform the modeling choices of future practitioners working with optical properties and beyond. Despite the greater availability of optical property data recently, the near-infrared (NIR) region of the spectrum remains more data-sparse despite its promise in many applications. This thesis demonstrates the shortcomings of existing methods for predicting optical properties in this region of chemical space and recommends best practices for future research in this area. Finally, this thesis highlights successful usage of data-driven optical property prediction for the discovery of novel molecules for specific applications. Ph.D. 2024-06-27T19:49:42Z 2024-06-27T19:49:42Z 2024-05 2024-05-16T14:01:51.376Z Thesis https://hdl.handle.net/1721.1/155385 0000-0002-6466-1401 Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) Copyright retained by author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Greenman, Kevin P.
Optical Property Prediction and Molecular Discovery through Multi-Fidelity Deep Learning and Computational Chemistry
title Optical Property Prediction and Molecular Discovery through Multi-Fidelity Deep Learning and Computational Chemistry
title_full Optical Property Prediction and Molecular Discovery through Multi-Fidelity Deep Learning and Computational Chemistry
title_fullStr Optical Property Prediction and Molecular Discovery through Multi-Fidelity Deep Learning and Computational Chemistry
title_full_unstemmed Optical Property Prediction and Molecular Discovery through Multi-Fidelity Deep Learning and Computational Chemistry
title_short Optical Property Prediction and Molecular Discovery through Multi-Fidelity Deep Learning and Computational Chemistry
title_sort optical property prediction and molecular discovery through multi fidelity deep learning and computational chemistry
url https://hdl.handle.net/1721.1/155385
work_keys_str_mv AT greenmankevinp opticalpropertypredictionandmoleculardiscoverythroughmultifidelitydeeplearningandcomputationalchemistry