A Bayesian approach to predictive uncertainty in chemotherapy patients at risk of acute care utilizationResearch in context

Summary: Background: Machine learning (ML) predictions are becoming increasingly integrated into medical practice. One commonly used method, ℓ1-penalised logistic regression (LASSO), can estimate patient risk for disease outcomes but is limited by only providing point estimates. Instead, Bayesian l...

Full description

Bibliographic Details
Main Authors:	Claudio Fanconi, Anne de Hond, Dylan Peterson, Angelo Capodici, Tina Hernandez-Boussard
Format:	Article
Language:	English
Published:	Elsevier 2023-06-01
Series:	EBioMedicine
Subjects:	Bayesian logistic LASSO regression Predictive uncertainty Acute care utilization Chemotherapy
Online Access:	http://www.sciencedirect.com/science/article/pii/S2352396423001974

_version_	1797812841800007680
author	Claudio Fanconi Anne de Hond Dylan Peterson Angelo Capodici Tina Hernandez-Boussard
author_facet	Claudio Fanconi Anne de Hond Dylan Peterson Angelo Capodici Tina Hernandez-Boussard
author_sort	Claudio Fanconi
collection	DOAJ
description	Summary: Background: Machine learning (ML) predictions are becoming increasingly integrated into medical practice. One commonly used method, ℓ1-penalised logistic regression (LASSO), can estimate patient risk for disease outcomes but is limited by only providing point estimates. Instead, Bayesian logistic LASSO regression (BLLR) models provide distributions for risk predictions, giving clinicians a better understanding of predictive uncertainty, but they are not commonly implemented. Methods: This study evaluates the predictive performance of different BLLRs compared to standard logistic LASSO regression, using real-world, high-dimensional, structured electronic health record (EHR) data from cancer patients initiating chemotherapy at a comprehensive cancer centre. Multiple BLLR models were compared against a LASSO model using an 80–20 random split using 10-fold cross-validation to predict the risk of acute care utilization (ACU) after starting chemotherapy. Findings: This study included 8439 patients. The LASSO model predicted ACU with an area under the receiver operating characteristic curve (AUROC) of 0.806 (95% CI: 0.775–0.834). BLLR with a Horseshoe+ prior and a posterior approximated by Metropolis–Hastings sampling showed similar performance: 0.807 (95% CI: 0.780–0.834) and offers the advantage of uncertainty estimation for each prediction. In addition, BLLR could identify predictions too uncertain to be automatically classified. BLLR uncertainties were stratified by different patient subgroups, demonstrating that predictive uncertainties significantly differ across race, cancer type, and stage. Interpretation: BLLRs are a promising yet underutilised tool that increases explainability by providing risk estimates while offering a similar level of performance to standard LASSO-based models. Additionally, these models can identify patient subgroups with higher uncertainty, which can augment clinical decision-making. Funding: This work was supported in part by the National Library Of Medicine of the National Institutes of Health under Award Number R01LM013362. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
first_indexed	2024-03-13T07:43:21Z
format	Article
id	doaj.art-f87a79b718e04855b1ea287d10e290fa
institution	Directory Open Access Journal
issn	2352-3964
language	English
last_indexed	2024-03-13T07:43:21Z
publishDate	2023-06-01
publisher	Elsevier
record_format	Article
series	EBioMedicine
spelling	doaj.art-f87a79b718e04855b1ea287d10e290fa2023-06-03T04:22:17ZengElsevierEBioMedicine2352-39642023-06-0192104632A Bayesian approach to predictive uncertainty in chemotherapy patients at risk of acute care utilizationResearch in contextClaudio Fanconi0Anne de Hond1Dylan Peterson2Angelo Capodici3Tina Hernandez-Boussard4Department of Information Technology and Electrical Engineering, ETH Zürich, Zürich, Switzerland; Department of Medicine (Biomedical Informatics), Stanford University, Stanford, USADepartment of Medicine (Biomedical Informatics), Stanford University, Stanford, USA; Clinical AI Implementation and Research Lab, Leiden University Medical Centre, Leiden, the NetherlandsDepartment of Medicine (Biomedical Informatics), Stanford University, Stanford, USADepartment of Medicine (Biomedical Informatics), Stanford University, Stanford, USA; Department of Biomedical and Neuromotor Science, University of Bologna, Bologna, ItalyDepartment of Medicine (Biomedical Informatics), Stanford University, Stanford, USA; Corresponding author. Stanford University, 453 Quarry Road, Palo Alto, CA, 94304, USA.Summary: Background: Machine learning (ML) predictions are becoming increasingly integrated into medical practice. One commonly used method, ℓ1-penalised logistic regression (LASSO), can estimate patient risk for disease outcomes but is limited by only providing point estimates. Instead, Bayesian logistic LASSO regression (BLLR) models provide distributions for risk predictions, giving clinicians a better understanding of predictive uncertainty, but they are not commonly implemented. Methods: This study evaluates the predictive performance of different BLLRs compared to standard logistic LASSO regression, using real-world, high-dimensional, structured electronic health record (EHR) data from cancer patients initiating chemotherapy at a comprehensive cancer centre. Multiple BLLR models were compared against a LASSO model using an 80–20 random split using 10-fold cross-validation to predict the risk of acute care utilization (ACU) after starting chemotherapy. Findings: This study included 8439 patients. The LASSO model predicted ACU with an area under the receiver operating characteristic curve (AUROC) of 0.806 (95% CI: 0.775–0.834). BLLR with a Horseshoe+ prior and a posterior approximated by Metropolis–Hastings sampling showed similar performance: 0.807 (95% CI: 0.780–0.834) and offers the advantage of uncertainty estimation for each prediction. In addition, BLLR could identify predictions too uncertain to be automatically classified. BLLR uncertainties were stratified by different patient subgroups, demonstrating that predictive uncertainties significantly differ across race, cancer type, and stage. Interpretation: BLLRs are a promising yet underutilised tool that increases explainability by providing risk estimates while offering a similar level of performance to standard LASSO-based models. Additionally, these models can identify patient subgroups with higher uncertainty, which can augment clinical decision-making. Funding: This work was supported in part by the National Library Of Medicine of the National Institutes of Health under Award Number R01LM013362. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.http://www.sciencedirect.com/science/article/pii/S2352396423001974Bayesian logistic LASSO regressionPredictive uncertaintyAcute care utilizationChemotherapy
spellingShingle	Claudio Fanconi Anne de Hond Dylan Peterson Angelo Capodici Tina Hernandez-Boussard A Bayesian approach to predictive uncertainty in chemotherapy patients at risk of acute care utilizationResearch in context EBioMedicine Bayesian logistic LASSO regression Predictive uncertainty Acute care utilization Chemotherapy
title	A Bayesian approach to predictive uncertainty in chemotherapy patients at risk of acute care utilizationResearch in context
title_full	A Bayesian approach to predictive uncertainty in chemotherapy patients at risk of acute care utilizationResearch in context
title_fullStr	A Bayesian approach to predictive uncertainty in chemotherapy patients at risk of acute care utilizationResearch in context
title_full_unstemmed	A Bayesian approach to predictive uncertainty in chemotherapy patients at risk of acute care utilizationResearch in context
title_short	A Bayesian approach to predictive uncertainty in chemotherapy patients at risk of acute care utilizationResearch in context
title_sort	bayesian approach to predictive uncertainty in chemotherapy patients at risk of acute care utilizationresearch in context
topic	Bayesian logistic LASSO regression Predictive uncertainty Acute care utilization Chemotherapy
url	http://www.sciencedirect.com/science/article/pii/S2352396423001974
work_keys_str_mv	AT claudiofanconi abayesianapproachtopredictiveuncertaintyinchemotherapypatientsatriskofacutecareutilizationresearchincontext AT annedehond abayesianapproachtopredictiveuncertaintyinchemotherapypatientsatriskofacutecareutilizationresearchincontext AT dylanpeterson abayesianapproachtopredictiveuncertaintyinchemotherapypatientsatriskofacutecareutilizationresearchincontext AT angelocapodici abayesianapproachtopredictiveuncertaintyinchemotherapypatientsatriskofacutecareutilizationresearchincontext AT tinahernandezboussard abayesianapproachtopredictiveuncertaintyinchemotherapypatientsatriskofacutecareutilizationresearchincontext AT claudiofanconi bayesianapproachtopredictiveuncertaintyinchemotherapypatientsatriskofacutecareutilizationresearchincontext AT annedehond bayesianapproachtopredictiveuncertaintyinchemotherapypatientsatriskofacutecareutilizationresearchincontext AT dylanpeterson bayesianapproachtopredictiveuncertaintyinchemotherapypatientsatriskofacutecareutilizationresearchincontext AT angelocapodici bayesianapproachtopredictiveuncertaintyinchemotherapypatientsatriskofacutecareutilizationresearchincontext AT tinahernandezboussard bayesianapproachtopredictiveuncertaintyinchemotherapypatientsatriskofacutecareutilizationresearchincontext

A Bayesian approach to predictive uncertainty in chemotherapy patients at risk of acute care utilizationResearch in context

Similar Items