A computational framework for physics-informed symbolic regression with straightforward integration of domain knowledge

Abstract Discovering a meaningful symbolic expression that explains experimental data is a fundamental challenge in many scientific fields. We present a novel, open-source computational framework called Scientist-Machine Equation Detector (SciMED), which integrates scientific discipline wisdom in a...

Full description

Bibliographic Details
Main Authors: Liron Simon Keren, Alex Liberzon, Teddy Lazebnik
Format: Article
Language:English
Published: Nature Portfolio 2023-01-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-28328-2
_version_ 1811175923082330112
author Liron Simon Keren
Alex Liberzon
Teddy Lazebnik
author_facet Liron Simon Keren
Alex Liberzon
Teddy Lazebnik
author_sort Liron Simon Keren
collection DOAJ
description Abstract Discovering a meaningful symbolic expression that explains experimental data is a fundamental challenge in many scientific fields. We present a novel, open-source computational framework called Scientist-Machine Equation Detector (SciMED), which integrates scientific discipline wisdom in a scientist-in-the-loop approach, with state-of-the-art symbolic regression (SR) methods. SciMED combines a wrapper selection method, that is based on a genetic algorithm, with automatic machine learning and two levels of SR methods. We test SciMED on five configurations of a settling sphere, with and without aerodynamic non-linear drag force, and with excessive noise in the measurements. We show that SciMED is sufficiently robust to discover the correct physically meaningful symbolic expressions from the data, and demonstrate how the integration of domain knowledge enhances its performance. Our results indicate better performance on these tasks than the state-of-the-art SR software packages , even in cases where no knowledge is integrated. Moreover, we demonstrate how SciMED can alert the user about possible missing features, unlike the majority of current SR systems.
first_indexed 2024-04-10T19:43:50Z
format Article
id doaj.art-1a798436a5ae4f43ad7a7d55dd4901b7
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-04-10T19:43:50Z
publishDate 2023-01-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-1a798436a5ae4f43ad7a7d55dd4901b72023-01-29T12:08:41ZengNature PortfolioScientific Reports2045-23222023-01-0113111710.1038/s41598-023-28328-2A computational framework for physics-informed symbolic regression with straightforward integration of domain knowledgeLiron Simon Keren0Alex Liberzon1Teddy Lazebnik2Turbulence Structure Laboratory, School of Mechanical Engineering, Tel Aviv UniversityTurbulence Structure Laboratory, School of Mechanical Engineering, Tel Aviv UniversityDepartment of Cancer Biology, Cancer Institute, University College LondonAbstract Discovering a meaningful symbolic expression that explains experimental data is a fundamental challenge in many scientific fields. We present a novel, open-source computational framework called Scientist-Machine Equation Detector (SciMED), which integrates scientific discipline wisdom in a scientist-in-the-loop approach, with state-of-the-art symbolic regression (SR) methods. SciMED combines a wrapper selection method, that is based on a genetic algorithm, with automatic machine learning and two levels of SR methods. We test SciMED on five configurations of a settling sphere, with and without aerodynamic non-linear drag force, and with excessive noise in the measurements. We show that SciMED is sufficiently robust to discover the correct physically meaningful symbolic expressions from the data, and demonstrate how the integration of domain knowledge enhances its performance. Our results indicate better performance on these tasks than the state-of-the-art SR software packages , even in cases where no knowledge is integrated. Moreover, we demonstrate how SciMED can alert the user about possible missing features, unlike the majority of current SR systems.https://doi.org/10.1038/s41598-023-28328-2
spellingShingle Liron Simon Keren
Alex Liberzon
Teddy Lazebnik
A computational framework for physics-informed symbolic regression with straightforward integration of domain knowledge
Scientific Reports
title A computational framework for physics-informed symbolic regression with straightforward integration of domain knowledge
title_full A computational framework for physics-informed symbolic regression with straightforward integration of domain knowledge
title_fullStr A computational framework for physics-informed symbolic regression with straightforward integration of domain knowledge
title_full_unstemmed A computational framework for physics-informed symbolic regression with straightforward integration of domain knowledge
title_short A computational framework for physics-informed symbolic regression with straightforward integration of domain knowledge
title_sort computational framework for physics informed symbolic regression with straightforward integration of domain knowledge
url https://doi.org/10.1038/s41598-023-28328-2
work_keys_str_mv AT lironsimonkeren acomputationalframeworkforphysicsinformedsymbolicregressionwithstraightforwardintegrationofdomainknowledge
AT alexliberzon acomputationalframeworkforphysicsinformedsymbolicregressionwithstraightforwardintegrationofdomainknowledge
AT teddylazebnik acomputationalframeworkforphysicsinformedsymbolicregressionwithstraightforwardintegrationofdomainknowledge
AT lironsimonkeren computationalframeworkforphysicsinformedsymbolicregressionwithstraightforwardintegrationofdomainknowledge
AT alexliberzon computationalframeworkforphysicsinformedsymbolicregressionwithstraightforwardintegrationofdomainknowledge
AT teddylazebnik computationalframeworkforphysicsinformedsymbolicregressionwithstraightforwardintegrationofdomainknowledge