Machine Learning Approaches for Equitable Healthcare

With the proliferation of clinical data and algorithms to improve clinical care, researchers are increasingly concerned about the equity and fairness of the resulting machine learning models. Because the observational data we collect can be noisy, incomplete, and biased, seemingly straight-forward i...

Full description

Bibliographic Details
Main Author: Chen, Irene Y.
Other Authors: Sontag, David
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/147451
_version_ 1826195186027331584
author Chen, Irene Y.
author2 Sontag, David
author_facet Sontag, David
Chen, Irene Y.
author_sort Chen, Irene Y.
collection MIT
description With the proliferation of clinical data and algorithms to improve clinical care, researchers are increasingly concerned about the equity and fairness of the resulting machine learning models. Because the observational data we collect can be noisy, incomplete, and biased, seemingly straight-forward implementation of existing methods for clinical intervention or better understanding human knowledge can lead to inaccurate and inequitable clinical algorithms. To begin to address these challenges, we need new tools to tackle the bias that can arise when modeling data. In this work, we present machine learning approaches for auditing, ameliorating, and preventing bias in the machine learning for healthcare model development process. In particular, we focus on case studies that can provide actionable insights. In this thesis, we present several examples of machine learning approaches towards equitable healthcare and recommend changes based on the results of the corresponding experiments. Questions of equity and bias can be thought of in terms of the different steps of the model development pipeline. We argue that these model development steps can be made more equitable and unbiased when they 1) mitigate algorithmic bias that may occur from biased data collection or model development, and 2) address known existing systemic health disparities. We present four case studies of machine learning approaches towards equitable healthcare, and demonstrate these approaches on real clinical tasks. First, we decompose the sources of discrimination and provide empirical estimation techniques. We present results on applying these techniques in the task of intensive care unit mortality prediction and salary prediction. Second, we consider the predictive analytics of health insurance providers, namely predicting the likelihood of hospitalization and the likelihood of high-risk pregnancy. We apply the same discrimination decomposition techniques towards practical steps for mitigating algorithmic discrimination. Third, we study the task of clustering interval-censored time-series data. We develop a deep generative model, called SubLign, to learn the latent delayed entry alignment value for each time-series as well as the heterogeneous progression patterns across the population. We evaluate our model in the context of synthetically generated data. Following, we study the task of disease subtyping for the improved understanding of disease progression. We present results on clustering clinical patients including heart failure and Parkinson’s disease. Finally, we study an example of using machine learning on an understudied problem that affects underserved patients: early detection of intimate partner violence. We develop a model that predicts the likelihood of eventual intimate partner violence self-reporting and radiology injury labeling from radiology reports. We conclude with a discussion about how machine learning can continue to address equity and bias in healthcare.
first_indexed 2024-09-23T10:08:43Z
format Thesis
id mit-1721.1/147451
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T10:08:43Z
publishDate 2023
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1474512023-01-20T03:49:12Z Machine Learning Approaches for Equitable Healthcare Chen, Irene Y. Sontag, David Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science With the proliferation of clinical data and algorithms to improve clinical care, researchers are increasingly concerned about the equity and fairness of the resulting machine learning models. Because the observational data we collect can be noisy, incomplete, and biased, seemingly straight-forward implementation of existing methods for clinical intervention or better understanding human knowledge can lead to inaccurate and inequitable clinical algorithms. To begin to address these challenges, we need new tools to tackle the bias that can arise when modeling data. In this work, we present machine learning approaches for auditing, ameliorating, and preventing bias in the machine learning for healthcare model development process. In particular, we focus on case studies that can provide actionable insights. In this thesis, we present several examples of machine learning approaches towards equitable healthcare and recommend changes based on the results of the corresponding experiments. Questions of equity and bias can be thought of in terms of the different steps of the model development pipeline. We argue that these model development steps can be made more equitable and unbiased when they 1) mitigate algorithmic bias that may occur from biased data collection or model development, and 2) address known existing systemic health disparities. We present four case studies of machine learning approaches towards equitable healthcare, and demonstrate these approaches on real clinical tasks. First, we decompose the sources of discrimination and provide empirical estimation techniques. We present results on applying these techniques in the task of intensive care unit mortality prediction and salary prediction. Second, we consider the predictive analytics of health insurance providers, namely predicting the likelihood of hospitalization and the likelihood of high-risk pregnancy. We apply the same discrimination decomposition techniques towards practical steps for mitigating algorithmic discrimination. Third, we study the task of clustering interval-censored time-series data. We develop a deep generative model, called SubLign, to learn the latent delayed entry alignment value for each time-series as well as the heterogeneous progression patterns across the population. We evaluate our model in the context of synthetically generated data. Following, we study the task of disease subtyping for the improved understanding of disease progression. We present results on clustering clinical patients including heart failure and Parkinson’s disease. Finally, we study an example of using machine learning on an understudied problem that affects underserved patients: early detection of intimate partner violence. We develop a model that predicts the likelihood of eventual intimate partner violence self-reporting and radiology injury labeling from radiology reports. We conclude with a discussion about how machine learning can continue to address equity and bias in healthcare. Ph.D. 2023-01-19T19:51:26Z 2023-01-19T19:51:26Z 2022-09 2022-10-19T19:07:49.973Z Thesis https://hdl.handle.net/1721.1/147451 0000-0003-0173-9133 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Chen, Irene Y.
Machine Learning Approaches for Equitable Healthcare
title Machine Learning Approaches for Equitable Healthcare
title_full Machine Learning Approaches for Equitable Healthcare
title_fullStr Machine Learning Approaches for Equitable Healthcare
title_full_unstemmed Machine Learning Approaches for Equitable Healthcare
title_short Machine Learning Approaches for Equitable Healthcare
title_sort machine learning approaches for equitable healthcare
url https://hdl.handle.net/1721.1/147451
work_keys_str_mv AT chenireney machinelearningapproachesforequitablehealthcare