Machine learning and causality: Building efficient, and reliable models for decision-making

We explore relationships between machine learning (ML) and causal inference. We focus on improvements in each by borrowing ideas from one another. ML has been successfully applied to many problems, but the lack of strong theoretical guarantees has led to many unexpected failures. Models that perf...

Full description

Bibliographic Details
Main Author:	Makar, Maggie
Other Authors:	Guttag, John V.
Format:	Thesis
Published:	Massachusetts Institute of Technology 2022
Online Access:	https://hdl.handle.net/1721.1/139131

_version_	1826194778868416512
author	Makar, Maggie
author2	Guttag, John V.
author_facet	Guttag, John V. Makar, Maggie
author_sort	Makar, Maggie
collection	MIT
description	We explore relationships between machine learning (ML) and causal inference. We focus on improvements in each by borrowing ideas from one another. ML has been successfully applied to many problems, but the lack of strong theoretical guarantees has led to many unexpected failures. Models that perform well on the training distribution tend to break down when applied to different distributions; small perturbations can “fool” the trained model and drastically change its predictions; arbitrary choices in the training algorithm lead to vastly different models; and so forth. On the other hand, while there has been tremendous progress in developing causal inference methods with strong theoretical guarantees, existing methods typically do not apply in practice since they assume an abundance of data. Working at the intersection of ML and causal inference, we directly address the lack of robustness in ML, and improve the statistical efficiency of causal inference techniques. The motivation behind the work presented in this thesis is to improve methods for building predictive, and causal models that are used to guide decision making. Throughout, we focus mostly on decision making in the healthcare context. On the ML for causality side, we use ML tools and analysis techniques to develop statistically efficient causal models that can guide clinicians when choosing between two treatments. On the causality for ML side, we study how knowledge of the causal mechanisms that generate observed data can be used to efficiently regularize predictive models without introducing biases. In a clinical context, we show how causal knowledge can be used to build robust, and accurate models to predict the spread of contagious infections. In a non-clinical setting, we study how to use causal knowledge to train models that are robust to distribution shifts in the context of image classification.
first_indexed	2024-09-23T10:01:40Z
format	Thesis
id	mit-1721.1/139131
institution	Massachusetts Institute of Technology
last_indexed	2024-09-23T10:01:40Z
publishDate	2022
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/1391312022-01-15T03:56:15Z Machine learning and causality: Building efficient, and reliable models for decision-making Makar, Maggie Guttag, John V. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science We explore relationships between machine learning (ML) and causal inference. We focus on improvements in each by borrowing ideas from one another. ML has been successfully applied to many problems, but the lack of strong theoretical guarantees has led to many unexpected failures. Models that perform well on the training distribution tend to break down when applied to different distributions; small perturbations can “fool” the trained model and drastically change its predictions; arbitrary choices in the training algorithm lead to vastly different models; and so forth. On the other hand, while there has been tremendous progress in developing causal inference methods with strong theoretical guarantees, existing methods typically do not apply in practice since they assume an abundance of data. Working at the intersection of ML and causal inference, we directly address the lack of robustness in ML, and improve the statistical efficiency of causal inference techniques. The motivation behind the work presented in this thesis is to improve methods for building predictive, and causal models that are used to guide decision making. Throughout, we focus mostly on decision making in the healthcare context. On the ML for causality side, we use ML tools and analysis techniques to develop statistically efficient causal models that can guide clinicians when choosing between two treatments. On the causality for ML side, we study how knowledge of the causal mechanisms that generate observed data can be used to efficiently regularize predictive models without introducing biases. In a clinical context, we show how causal knowledge can be used to build robust, and accurate models to predict the spread of contagious infections. In a non-clinical setting, we study how to use causal knowledge to train models that are robust to distribution shifts in the context of image classification. Ph.D. 2022-01-14T14:51:47Z 2022-01-14T14:51:47Z 2021-06 2021-06-23T19:38:35.762Z Thesis https://hdl.handle.net/1721.1/139131 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle	Makar, Maggie Machine learning and causality: Building efficient, and reliable models for decision-making
title	Machine learning and causality: Building efficient, and reliable models for decision-making
title_full	Machine learning and causality: Building efficient, and reliable models for decision-making
title_fullStr	Machine learning and causality: Building efficient, and reliable models for decision-making
title_full_unstemmed	Machine learning and causality: Building efficient, and reliable models for decision-making
title_short	Machine learning and causality: Building efficient, and reliable models for decision-making
title_sort	machine learning and causality building efficient and reliable models for decision making
url	https://hdl.handle.net/1721.1/139131
work_keys_str_mv	AT makarmaggie machinelearningandcausalitybuildingefficientandreliablemodelsfordecisionmaking

Machine learning and causality: Building efficient, and reliable models for decision-making

Similar Items