Distilling identifiable and interpretable dynamic models from biological data.

Mechanistic dynamical models allow us to study the behavior of complex biological systems. They can provide an objective and quantitative understanding that would be difficult to achieve through other means. However, the systematic development of these models is a non-trivial exercise and an open pr...

Full description

Bibliographic Details
Main Authors: Gemma Massonis, Alejandro F Villaverde, Julio R Banga
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2023-10-01
Series:PLoS Computational Biology
Online Access:https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011014&type=printable
_version_ 1797640246732521472
author Gemma Massonis
Alejandro F Villaverde
Julio R Banga
author_facet Gemma Massonis
Alejandro F Villaverde
Julio R Banga
author_sort Gemma Massonis
collection DOAJ
description Mechanistic dynamical models allow us to study the behavior of complex biological systems. They can provide an objective and quantitative understanding that would be difficult to achieve through other means. However, the systematic development of these models is a non-trivial exercise and an open problem in computational biology. Currently, many research efforts are focused on model discovery, i.e. automating the development of interpretable models from data. One of the main frameworks is sparse regression, where the sparse identification of nonlinear dynamics (SINDy) algorithm and its variants have enjoyed great success. SINDy-PI is an extension which allows the discovery of rational nonlinear terms, thus enabling the identification of kinetic functions common in biochemical networks, such as Michaelis-Menten. SINDy-PI also pays special attention to the recovery of parsimonious models (Occam's razor). Here we focus on biological models composed of sets of deterministic nonlinear ordinary differential equations. We present a methodology that, combined with SINDy-PI, allows the automatic discovery of structurally identifiable and observable models which are also mechanistically interpretable. The lack of structural identifiability and observability makes it impossible to uniquely infer parameter and state variables, which can compromise the usefulness of a model by distorting its mechanistic significance and hampering its ability to produce biological insights. We illustrate the performance of our method with six case studies. We find that, despite enforcing sparsity, SINDy-PI sometimes yields models that are unidentifiable. In these cases we show how our method transforms their equations in order to obtain a structurally identifiable and observable model which is also interpretable.
first_indexed 2024-03-11T13:28:06Z
format Article
id doaj.art-1225c4bb0dd74c2c8a4a56f9d90b87f3
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-03-11T13:28:06Z
publishDate 2023-10-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-1225c4bb0dd74c2c8a4a56f9d90b87f32023-11-03T05:31:43ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582023-10-011910e101101410.1371/journal.pcbi.1011014Distilling identifiable and interpretable dynamic models from biological data.Gemma MassonisAlejandro F VillaverdeJulio R BangaMechanistic dynamical models allow us to study the behavior of complex biological systems. They can provide an objective and quantitative understanding that would be difficult to achieve through other means. However, the systematic development of these models is a non-trivial exercise and an open problem in computational biology. Currently, many research efforts are focused on model discovery, i.e. automating the development of interpretable models from data. One of the main frameworks is sparse regression, where the sparse identification of nonlinear dynamics (SINDy) algorithm and its variants have enjoyed great success. SINDy-PI is an extension which allows the discovery of rational nonlinear terms, thus enabling the identification of kinetic functions common in biochemical networks, such as Michaelis-Menten. SINDy-PI also pays special attention to the recovery of parsimonious models (Occam's razor). Here we focus on biological models composed of sets of deterministic nonlinear ordinary differential equations. We present a methodology that, combined with SINDy-PI, allows the automatic discovery of structurally identifiable and observable models which are also mechanistically interpretable. The lack of structural identifiability and observability makes it impossible to uniquely infer parameter and state variables, which can compromise the usefulness of a model by distorting its mechanistic significance and hampering its ability to produce biological insights. We illustrate the performance of our method with six case studies. We find that, despite enforcing sparsity, SINDy-PI sometimes yields models that are unidentifiable. In these cases we show how our method transforms their equations in order to obtain a structurally identifiable and observable model which is also interpretable.https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011014&type=printable
spellingShingle Gemma Massonis
Alejandro F Villaverde
Julio R Banga
Distilling identifiable and interpretable dynamic models from biological data.
PLoS Computational Biology
title Distilling identifiable and interpretable dynamic models from biological data.
title_full Distilling identifiable and interpretable dynamic models from biological data.
title_fullStr Distilling identifiable and interpretable dynamic models from biological data.
title_full_unstemmed Distilling identifiable and interpretable dynamic models from biological data.
title_short Distilling identifiable and interpretable dynamic models from biological data.
title_sort distilling identifiable and interpretable dynamic models from biological data
url https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011014&type=printable
work_keys_str_mv AT gemmamassonis distillingidentifiableandinterpretabledynamicmodelsfrombiologicaldata
AT alejandrofvillaverde distillingidentifiableandinterpretabledynamicmodelsfrombiologicaldata
AT juliorbanga distillingidentifiableandinterpretabledynamicmodelsfrombiologicaldata