Machine learning and causality: Building efficient, and reliable models for decision-making

We explore relationships between machine learning (ML) and causal inference. We focus on improvements in each by borrowing ideas from one another. ML has been successfully applied to many problems, but the lack of strong theoretical guarantees has led to many unexpected failures. Models that perf...

Full description

Bibliographic Details
Main Author: Makar, Maggie
Other Authors: Guttag, John V.
Format: Thesis
Published: Massachusetts Institute of Technology 2022
Online Access:https://hdl.handle.net/1721.1/139131
_version_ 1826194778868416512
author Makar, Maggie
author2 Guttag, John V.
author_facet Guttag, John V.
Makar, Maggie
author_sort Makar, Maggie
collection MIT
description We explore relationships between machine learning (ML) and causal inference. We focus on improvements in each by borrowing ideas from one another. ML has been successfully applied to many problems, but the lack of strong theoretical guarantees has led to many unexpected failures. Models that perform well on the training distribution tend to break down when applied to different distributions; small perturbations can “fool” the trained model and drastically change its predictions; arbitrary choices in the training algorithm lead to vastly different models; and so forth. On the other hand, while there has been tremendous progress in developing causal inference methods with strong theoretical guarantees, existing methods typically do not apply in practice since they assume an abundance of data. Working at the intersection of ML and causal inference, we directly address the lack of robustness in ML, and improve the statistical efficiency of causal inference techniques. The motivation behind the work presented in this thesis is to improve methods for building predictive, and causal models that are used to guide decision making. Throughout, we focus mostly on decision making in the healthcare context. On the ML for causality side, we use ML tools and analysis techniques to develop statistically efficient causal models that can guide clinicians when choosing between two treatments. On the causality for ML side, we study how knowledge of the causal mechanisms that generate observed data can be used to efficiently regularize predictive models without introducing biases. In a clinical context, we show how causal knowledge can be used to build robust, and accurate models to predict the spread of contagious infections. In a non-clinical setting, we study how to use causal knowledge to train models that are robust to distribution shifts in the context of image classification.
first_indexed 2024-09-23T10:01:40Z
format Thesis
id mit-1721.1/139131
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T10:01:40Z
publishDate 2022
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1391312022-01-15T03:56:15Z Machine learning and causality: Building efficient, and reliable models for decision-making Makar, Maggie Guttag, John V. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science We explore relationships between machine learning (ML) and causal inference. We focus on improvements in each by borrowing ideas from one another. ML has been successfully applied to many problems, but the lack of strong theoretical guarantees has led to many unexpected failures. Models that perform well on the training distribution tend to break down when applied to different distributions; small perturbations can “fool” the trained model and drastically change its predictions; arbitrary choices in the training algorithm lead to vastly different models; and so forth. On the other hand, while there has been tremendous progress in developing causal inference methods with strong theoretical guarantees, existing methods typically do not apply in practice since they assume an abundance of data. Working at the intersection of ML and causal inference, we directly address the lack of robustness in ML, and improve the statistical efficiency of causal inference techniques. The motivation behind the work presented in this thesis is to improve methods for building predictive, and causal models that are used to guide decision making. Throughout, we focus mostly on decision making in the healthcare context. On the ML for causality side, we use ML tools and analysis techniques to develop statistically efficient causal models that can guide clinicians when choosing between two treatments. On the causality for ML side, we study how knowledge of the causal mechanisms that generate observed data can be used to efficiently regularize predictive models without introducing biases. In a clinical context, we show how causal knowledge can be used to build robust, and accurate models to predict the spread of contagious infections. In a non-clinical setting, we study how to use causal knowledge to train models that are robust to distribution shifts in the context of image classification. Ph.D. 2022-01-14T14:51:47Z 2022-01-14T14:51:47Z 2021-06 2021-06-23T19:38:35.762Z Thesis https://hdl.handle.net/1721.1/139131 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Makar, Maggie
Machine learning and causality: Building efficient, and reliable models for decision-making
title Machine learning and causality: Building efficient, and reliable models for decision-making
title_full Machine learning and causality: Building efficient, and reliable models for decision-making
title_fullStr Machine learning and causality: Building efficient, and reliable models for decision-making
title_full_unstemmed Machine learning and causality: Building efficient, and reliable models for decision-making
title_short Machine learning and causality: Building efficient, and reliable models for decision-making
title_sort machine learning and causality building efficient and reliable models for decision making
url https://hdl.handle.net/1721.1/139131
work_keys_str_mv AT makarmaggie machinelearningandcausalitybuildingefficientandreliablemodelsfordecisionmaking