Active learning for optimal intervention design in causal models

Sequential experimental design to discover interventions that achieve a desired outcome is a key problem in various domains including science, engineering and public policy. When the space of possible interventions is large, making an exhaustive search infeasible, experimental design strategies are...

Full description

Bibliographic Details
Main Authors:	Zhang, Jiaqi, Cammarata, Louis, Squires, Chandler, Sapsis, Themistoklis P., Uhler, Caroline
Other Authors:	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems
Format:	Article
Language:	English
Published:	Springer Science and Business Media LLC 2024
Subjects:	Artificial Intelligence Computer Networks and Communications Computer Vision and Pattern Recognition Human-Computer Interaction Software
Online Access:	https://hdl.handle.net/1721.1/154216

_version_	1824458300396142592
author	Zhang, Jiaqi Cammarata, Louis Squires, Chandler Sapsis, Themistoklis P. Uhler, Caroline
author2	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems
author_facet	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems Zhang, Jiaqi Cammarata, Louis Squires, Chandler Sapsis, Themistoklis P. Uhler, Caroline
author_sort	Zhang, Jiaqi
collection	MIT
description	Sequential experimental design to discover interventions that achieve a desired outcome is a key problem in various domains including science, engineering and public policy. When the space of possible interventions is large, making an exhaustive search infeasible, experimental design strategies are needed. In this context, encoding the causal relationships between the variables, and thus the effect of interventions on the system, is critical for identifying desirable interventions more efficiently. Here we develop a causal active learning strategy to identify interventions that are optimal, as measured by the discrepancy between the post-interventional mean of the distribution and a desired target mean. The approach employs a Bayesian update for the causal model and prioritizes interventions using a carefully designed, causally informed acquisition function. This acquisition function is evaluated in closed form, allowing for fast optimization. The resulting algorithms are theoretically grounded with information-theoretic bounds and provable consistency results for linear causal models with known causal graph. We apply our approach to both synthetic data and single-cell transcriptomic data from Perturb–CITE-sequencing experiments to identify optimal perturbations that induce a specific cell-state transition. The causally informed acquisition function generally outperforms existing criteria, allowing for optimal intervention design with fewer but carefully selected samples.
first_indexed	2024-09-23T14:46:22Z
format	Article
id	mit-1721.1/154216
institution	Massachusetts Institute of Technology
language	English
last_indexed	2025-02-19T04:23:42Z
publishDate	2024
publisher	Springer Science and Business Media LLC
record_format	dspace
spelling	mit-1721.1/1542162025-01-02T05:08:31Z Active learning for optimal intervention design in causal models Zhang, Jiaqi Cammarata, Louis Squires, Chandler Sapsis, Themistoklis P. Uhler, Caroline Massachusetts Institute of Technology. Laboratory for Information and Decision Systems Statistics and Data Science Center (Massachusetts Institute of Technology) Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology. Institute for Data, Systems, and Society Artificial Intelligence Computer Networks and Communications Computer Vision and Pattern Recognition Human-Computer Interaction Software Sequential experimental design to discover interventions that achieve a desired outcome is a key problem in various domains including science, engineering and public policy. When the space of possible interventions is large, making an exhaustive search infeasible, experimental design strategies are needed. In this context, encoding the causal relationships between the variables, and thus the effect of interventions on the system, is critical for identifying desirable interventions more efficiently. Here we develop a causal active learning strategy to identify interventions that are optimal, as measured by the discrepancy between the post-interventional mean of the distribution and a desired target mean. The approach employs a Bayesian update for the causal model and prioritizes interventions using a carefully designed, causally informed acquisition function. This acquisition function is evaluated in closed form, allowing for fast optimization. The resulting algorithms are theoretically grounded with information-theoretic bounds and provable consistency results for linear causal models with known causal graph. We apply our approach to both synthetic data and single-cell transcriptomic data from Perturb–CITE-sequencing experiments to identify optimal perturbations that induce a specific cell-state transition. The causally informed acquisition function generally outperforms existing criteria, allowing for optimal intervention design with fewer but carefully selected samples. 2024-04-18T17:10:24Z 2024-04-18T17:10:24Z 2023-10-02 2024-04-18T17:05:58Z Article http://purl.org/eprint/type/JournalArticle 2522-5839 https://hdl.handle.net/1721.1/154216 Zhang, J., Cammarata, L., Squires, C. et al. Active learning for optimal intervention design in causal models. Nat Mach Intell 5, 1066–1075 (2023). en 10.1038/s42256-023-00719-0 Nature Machine Intelligence Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. application/pdf Springer Science and Business Media LLC Springer Science and Business Media LLC
spellingShingle	Artificial Intelligence Computer Networks and Communications Computer Vision and Pattern Recognition Human-Computer Interaction Software Zhang, Jiaqi Cammarata, Louis Squires, Chandler Sapsis, Themistoklis P. Uhler, Caroline Active learning for optimal intervention design in causal models
title	Active learning for optimal intervention design in causal models
title_full	Active learning for optimal intervention design in causal models
title_fullStr	Active learning for optimal intervention design in causal models
title_full_unstemmed	Active learning for optimal intervention design in causal models
title_short	Active learning for optimal intervention design in causal models
title_sort	active learning for optimal intervention design in causal models
topic	Artificial Intelligence Computer Networks and Communications Computer Vision and Pattern Recognition Human-Computer Interaction Software
url	https://hdl.handle.net/1721.1/154216
work_keys_str_mv	AT zhangjiaqi activelearningforoptimalinterventiondesignincausalmodels AT cammaratalouis activelearningforoptimalinterventiondesignincausalmodels AT squireschandler activelearningforoptimalinterventiondesignincausalmodels AT sapsisthemistoklisp activelearningforoptimalinterventiondesignincausalmodels AT uhlercaroline activelearningforoptimalinterventiondesignincausalmodels

Active learning for optimal intervention design in causal models

Similar Items