Interpretable Physics-informed Machine Learning Methods for Scientific Modeling and Data Analysis

With the recent advancement of modern machine learning methods, there are now many exciting opportunities to use machine learning in scientific research, including for modeling and data analysis. Machine learning has the potential to become an indispensable tool for scientific discovery, but it is o...

Full description

Bibliographic Details
Main Author: Lu, Peter Yucheng
Other Authors: Soljačić, Marin
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/150769
_version_ 1826200417556496384
author Lu, Peter Yucheng
author2 Soljačić, Marin
author_facet Soljačić, Marin
Lu, Peter Yucheng
author_sort Lu, Peter Yucheng
collection MIT
description With the recent advancement of modern machine learning methods, there are now many exciting opportunities to use machine learning in scientific research, including for modeling and data analysis. Machine learning has the potential to become an indispensable tool for scientific discovery, but it is often difficult to directly apply to scientific problems. Especially in the case of deep learning approaches, machine learning methods are often lacking in interpretability, robustness, out-of-distribution generalization, and data efficiency—all qualities that are necessary for many scientific and engineering applications. In this thesis, we will illustrate several approaches for addressing these issues using a variety of applications. First, we develop a physics-informed framework for partially observed system identification, showing how combining an encoder with a sparse symbolic model allows us to reconstruct unobserved hidden states as well as the exact governing equations. Then, we design a physics-informed deep representation learning architecture for analyzing spatiotemporal systems and demonstrate its ability to extract interpretable physical parameters, corresponding to uncontrolled variables, from time-series data. Finally, we use tools from optimal transport theory and manifold learning to develop a robust non-parametric method for discovering conservation laws, showing the advantage of using geometric machine learning methods to solve scientific problems. By designing physics-informed architectures and adapting representation learning methods for scientific applications, we can overcome many of the difficulties that are currently preventing machine learning from playing a more important role in scientific discovery and create more useful computational tools for scientists and engineers trying to analyze, understand, and model their data.
first_indexed 2024-09-23T11:36:04Z
format Thesis
id mit-1721.1/150769
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T11:36:04Z
publishDate 2023
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1507692023-05-18T03:32:18Z Interpretable Physics-informed Machine Learning Methods for Scientific Modeling and Data Analysis Lu, Peter Yucheng Soljačić, Marin Massachusetts Institute of Technology. Department of Physics With the recent advancement of modern machine learning methods, there are now many exciting opportunities to use machine learning in scientific research, including for modeling and data analysis. Machine learning has the potential to become an indispensable tool for scientific discovery, but it is often difficult to directly apply to scientific problems. Especially in the case of deep learning approaches, machine learning methods are often lacking in interpretability, robustness, out-of-distribution generalization, and data efficiency—all qualities that are necessary for many scientific and engineering applications. In this thesis, we will illustrate several approaches for addressing these issues using a variety of applications. First, we develop a physics-informed framework for partially observed system identification, showing how combining an encoder with a sparse symbolic model allows us to reconstruct unobserved hidden states as well as the exact governing equations. Then, we design a physics-informed deep representation learning architecture for analyzing spatiotemporal systems and demonstrate its ability to extract interpretable physical parameters, corresponding to uncontrolled variables, from time-series data. Finally, we use tools from optimal transport theory and manifold learning to develop a robust non-parametric method for discovering conservation laws, showing the advantage of using geometric machine learning methods to solve scientific problems. By designing physics-informed architectures and adapting representation learning methods for scientific applications, we can overcome many of the difficulties that are currently preventing machine learning from playing a more important role in scientific discovery and create more useful computational tools for scientists and engineers trying to analyze, understand, and model their data. Ph.D. 2023-05-17T17:41:33Z 2023-05-17T17:41:33Z 2022-09 2023-05-16T17:06:21.776Z Thesis https://hdl.handle.net/1721.1/150769 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Lu, Peter Yucheng
Interpretable Physics-informed Machine Learning Methods for Scientific Modeling and Data Analysis
title Interpretable Physics-informed Machine Learning Methods for Scientific Modeling and Data Analysis
title_full Interpretable Physics-informed Machine Learning Methods for Scientific Modeling and Data Analysis
title_fullStr Interpretable Physics-informed Machine Learning Methods for Scientific Modeling and Data Analysis
title_full_unstemmed Interpretable Physics-informed Machine Learning Methods for Scientific Modeling and Data Analysis
title_short Interpretable Physics-informed Machine Learning Methods for Scientific Modeling and Data Analysis
title_sort interpretable physics informed machine learning methods for scientific modeling and data analysis
url https://hdl.handle.net/1721.1/150769
work_keys_str_mv AT lupeteryucheng interpretablephysicsinformedmachinelearningmethodsforscientificmodelinganddataanalysis