Experimental Design for Optimal Shift Intervention in Causal Model

Transforming a causal system from a given initial state to a desired target state is an important task permeating multiple fields including control theory, biology, and materials science. In causal models, such transformations can be achieved by performing a set of interventions. When the space of p...

Full description

Bibliographic Details
Main Author: Zhang, Jiaqi
Other Authors: Uhler, Caroline
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/147571
_version_ 1826196006273810432
author Zhang, Jiaqi
author2 Uhler, Caroline
author_facet Uhler, Caroline
Zhang, Jiaqi
author_sort Zhang, Jiaqi
collection MIT
description Transforming a causal system from a given initial state to a desired target state is an important task permeating multiple fields including control theory, biology, and materials science. In causal models, such transformations can be achieved by performing a set of interventions. When the space of possible interventions is large, making an exhaustive search infeasible, experimental design strategies are needed. In this context, encoding the causal relationships between the variables, and thus the effect of interventions on the system, is critical in order to identify desirable interventions more efficiently. In this thesis, we develop an iterative causal method to identify optimal interventions, as measured by the discrepancy between the post-interventional mean of the distribution and a desired target mean. We formulate an active learning strategy that uses the samples obtained so far from different interventions to update the belief about the underlying causal model, as well as to identify the samples that are most informative about optimal interventions and thus should be acquired in the next batch. The approach employs a Bayesian update for the causal model and prioritizes informative interventions using a carefully designed, causally informed acquisition function. Moreover, the introduced acquisition function is evaluated in closed form, allowing for efficient optimization. The resulting algorithms are also theoretically grounded with information-theoretic bounds and provable consistency results. We illustrate the method on both synthetic data and real-world biological data, more precisely gene expression data from Perturb-CITE-seq experiments. In this case the goal is to identify optimal perturbations to induce a specific cell state transition; the proposed causal approach is observed to achieve better sample efficiency compared to several baselines. In both cases we observe that the causally informed acquisition function notably outperforms existing criteria allowing for optimal intervention design with significantly less experiments.
first_indexed 2024-09-23T10:19:21Z
format Thesis
id mit-1721.1/147571
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T10:19:21Z
publishDate 2023
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1475712023-01-20T03:47:37Z Experimental Design for Optimal Shift Intervention in Causal Model Zhang, Jiaqi Uhler, Caroline Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Transforming a causal system from a given initial state to a desired target state is an important task permeating multiple fields including control theory, biology, and materials science. In causal models, such transformations can be achieved by performing a set of interventions. When the space of possible interventions is large, making an exhaustive search infeasible, experimental design strategies are needed. In this context, encoding the causal relationships between the variables, and thus the effect of interventions on the system, is critical in order to identify desirable interventions more efficiently. In this thesis, we develop an iterative causal method to identify optimal interventions, as measured by the discrepancy between the post-interventional mean of the distribution and a desired target mean. We formulate an active learning strategy that uses the samples obtained so far from different interventions to update the belief about the underlying causal model, as well as to identify the samples that are most informative about optimal interventions and thus should be acquired in the next batch. The approach employs a Bayesian update for the causal model and prioritizes informative interventions using a carefully designed, causally informed acquisition function. Moreover, the introduced acquisition function is evaluated in closed form, allowing for efficient optimization. The resulting algorithms are also theoretically grounded with information-theoretic bounds and provable consistency results. We illustrate the method on both synthetic data and real-world biological data, more precisely gene expression data from Perturb-CITE-seq experiments. In this case the goal is to identify optimal perturbations to induce a specific cell state transition; the proposed causal approach is observed to achieve better sample efficiency compared to several baselines. In both cases we observe that the causally informed acquisition function notably outperforms existing criteria allowing for optimal intervention design with significantly less experiments. S.M. 2023-01-19T19:59:25Z 2023-01-19T19:59:25Z 2022-09 2022-10-19T18:59:19.730Z Thesis https://hdl.handle.net/1721.1/147571 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Zhang, Jiaqi
Experimental Design for Optimal Shift Intervention in Causal Model
title Experimental Design for Optimal Shift Intervention in Causal Model
title_full Experimental Design for Optimal Shift Intervention in Causal Model
title_fullStr Experimental Design for Optimal Shift Intervention in Causal Model
title_full_unstemmed Experimental Design for Optimal Shift Intervention in Causal Model
title_short Experimental Design for Optimal Shift Intervention in Causal Model
title_sort experimental design for optimal shift intervention in causal model
url https://hdl.handle.net/1721.1/147571
work_keys_str_mv AT zhangjiaqi experimentaldesignforoptimalshiftinterventionincausalmodel