Machine learning and causality: Building efficient, and reliable models for decision-making
We explore relationships between machine learning (ML) and causal inference. We focus on improvements in each by borrowing ideas from one another. ML has been successfully applied to many problems, but the lack of strong theoretical guarantees has led to many unexpected failures. Models that perf...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2022
|
Online Access: | https://hdl.handle.net/1721.1/139131 |
_version_ | 1826194778868416512 |
---|---|
author | Makar, Maggie |
author2 | Guttag, John V. |
author_facet | Guttag, John V. Makar, Maggie |
author_sort | Makar, Maggie |
collection | MIT |
description | We explore relationships between machine learning (ML) and causal inference. We focus on improvements in each by borrowing ideas from one another.
ML has been successfully applied to many problems, but the lack of strong theoretical guarantees has led to many unexpected failures. Models that perform well on the training distribution tend to break down when applied to different distributions; small perturbations can “fool” the trained model and drastically change its predictions; arbitrary choices in the training algorithm lead to vastly different models; and so forth. On the other hand, while there has been tremendous progress in developing causal inference methods with strong theoretical guarantees, existing methods typically do not apply in practice since they assume an abundance of data. Working at the intersection of ML and causal inference, we directly address the lack of robustness in ML, and improve the statistical efficiency of causal inference techniques.
The motivation behind the work presented in this thesis is to improve methods for building predictive, and causal models that are used to guide decision making. Throughout, we focus mostly on decision making in the healthcare context. On the ML for causality side, we use ML tools and analysis techniques to develop statistically efficient causal models that can guide clinicians when choosing between two treatments. On the causality for ML side, we study how knowledge of the causal mechanisms that generate observed data can be used to efficiently regularize predictive models without introducing biases. In a clinical context, we show how causal knowledge can be used to build robust, and accurate models to predict the spread of contagious infections. In a non-clinical setting, we study how to use causal knowledge to train models that are robust to distribution shifts in the context of image classification. |
first_indexed | 2024-09-23T10:01:40Z |
format | Thesis |
id | mit-1721.1/139131 |
institution | Massachusetts Institute of Technology |
last_indexed | 2024-09-23T10:01:40Z |
publishDate | 2022 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1391312022-01-15T03:56:15Z Machine learning and causality: Building efficient, and reliable models for decision-making Makar, Maggie Guttag, John V. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science We explore relationships between machine learning (ML) and causal inference. We focus on improvements in each by borrowing ideas from one another. ML has been successfully applied to many problems, but the lack of strong theoretical guarantees has led to many unexpected failures. Models that perform well on the training distribution tend to break down when applied to different distributions; small perturbations can “fool” the trained model and drastically change its predictions; arbitrary choices in the training algorithm lead to vastly different models; and so forth. On the other hand, while there has been tremendous progress in developing causal inference methods with strong theoretical guarantees, existing methods typically do not apply in practice since they assume an abundance of data. Working at the intersection of ML and causal inference, we directly address the lack of robustness in ML, and improve the statistical efficiency of causal inference techniques. The motivation behind the work presented in this thesis is to improve methods for building predictive, and causal models that are used to guide decision making. Throughout, we focus mostly on decision making in the healthcare context. On the ML for causality side, we use ML tools and analysis techniques to develop statistically efficient causal models that can guide clinicians when choosing between two treatments. On the causality for ML side, we study how knowledge of the causal mechanisms that generate observed data can be used to efficiently regularize predictive models without introducing biases. In a clinical context, we show how causal knowledge can be used to build robust, and accurate models to predict the spread of contagious infections. In a non-clinical setting, we study how to use causal knowledge to train models that are robust to distribution shifts in the context of image classification. Ph.D. 2022-01-14T14:51:47Z 2022-01-14T14:51:47Z 2021-06 2021-06-23T19:38:35.762Z Thesis https://hdl.handle.net/1721.1/139131 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology |
spellingShingle | Makar, Maggie Machine learning and causality: Building efficient, and reliable models for decision-making |
title | Machine learning and causality: Building efficient, and reliable models for decision-making |
title_full | Machine learning and causality: Building efficient, and reliable models for decision-making |
title_fullStr | Machine learning and causality: Building efficient, and reliable models for decision-making |
title_full_unstemmed | Machine learning and causality: Building efficient, and reliable models for decision-making |
title_short | Machine learning and causality: Building efficient, and reliable models for decision-making |
title_sort | machine learning and causality building efficient and reliable models for decision making |
url | https://hdl.handle.net/1721.1/139131 |
work_keys_str_mv | AT makarmaggie machinelearningandcausalitybuildingefficientandreliablemodelsfordecisionmaking |