A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals

<p><b>Background</b></p> Multicentre training could reduce biases in medical artificial intelligence (AI); however, ethical, legal, and technical considerations can constrain the ability of hospitals to share data. Federated learning enables institutions to participate in alg...

Description complète

Détails bibliographiques
Auteurs principaux:	Soltan, AS, Thakur, A, Yang, J, Chauhan, A, D'Cruz, LG, Dickson, P, Soltan, MA, Thickett, DR, Eyre, DW, Zhu, T, Clifton, DA
Format:	Journal article
Langue:	English
Publié:	Elsevier 2024

_version_	1826312002906095616
author	Soltan, AS Thakur, A Yang, J Chauhan, A D'Cruz, LG Dickson, P Soltan, MA Thickett, DR Eyre, DW Zhu, T Clifton, DA
author_facet	Soltan, AS Thakur, A Yang, J Chauhan, A D'Cruz, LG Dickson, P Soltan, MA Thickett, DR Eyre, DW Zhu, T Clifton, DA
author_sort	Soltan, AS
collection	OXFORD
description	<p><b>Background</b></p> Multicentre training could reduce biases in medical artificial intelligence (AI); however, ethical, legal, and technical considerations can constrain the ability of hospitals to share data. Federated learning enables institutions to participate in algorithm development while retaining custody of their data but uptake in hospitals has been limited, possibly as deployment requires specialist software and technical expertise at each site. We previously developed an artificial intelligence-driven screening test for COVID-19 in emergency departments, known as CURIAL-Lab, which uses vital signs and blood tests that are routinely available within 1 h of a patient's arrival. Here we aimed to federate our COVID-19 screening test by developing an easy-to-use embedded system—which we introduce as full-stack federated learning—to train and evaluate machine learning models across four UK hospital groups without centralising patient data. <p><b>Methods</b></p> We supplied a Raspberry Pi 4 Model B preloaded with our federated learning software pipeline to four National Health Service (NHS) hospital groups in the UK: Oxford University Hospitals NHS Foundation Trust (OUH; through the locally linked research University, University of Oxford), University Hospitals Birmingham NHS Foundation Trust (UHB), Bedfordshire Hospitals NHS Foundation Trust (BH), and Portsmouth Hospitals University NHS Trust (PUH). OUH, PUH, and UHB participated in federated training, training a deep neural network and logistic regressor over 150 rounds to form and calibrate a global model to predict COVID-19 status, using clinical data from patients admitted before the pandemic (COVID-19-negative) and testing positive for COVID-19 during the first wave of the pandemic. We conducted a federated evaluation of the global model for admissions during the second wave of the pandemic at OUH, PUH, and externally at BH. For OUH and PUH, we additionally performed local fine-tuning of the global model using the sites’ individual training data, forming a site-tuned model, and evaluated the resultant model for admissions during the second wave of the pandemic. This study included data collected between Dec 1, 2018, and March 1, 2021; the exact date ranges used varied by site. The primary outcome was overall model performance, measured as the area under the receiver operating characteristic curve (AUROC). Removable micro secure digital (microSD) storage was destroyed on study completion. <p><b>Findings</b></p> Clinical data from 130 941 patients (1772 COVID-19-positive), routinely collected across three hospital groups (OUH, PUH, and UHB), were included in federated training. The evaluation step included data from 32 986 patients (3549 COVID-19-positive) attending OUH, PUH, or BH during the second wave of the pandemic. Federated training of a global deep neural network classifier improved upon performance of models trained locally in terms of AUROC by a mean of 27·6% (SD 2·2): AUROC increased from 0·574 (95% CI 0·560–0·589) at OUH and 0·622 (0·608–0·637) at PUH using the locally trained models to 0·872 (0·862–0·882) at OUH and 0·876 (0·865–0·886) at PUH using the federated global model. Performance improvement was smaller for a logistic regression model, with a mean increase in AUROC of 13·9% (0·5%). During federated external evaluation at BH, AUROC for the global deep neural network model was 0·917 (0·893–0·942), with 89·7% sensitivity (83·6–93·6) and 76·6% specificity (73·9–79·1). Site-specific tuning of the global model did not significantly improve performance (change in AUROC <0·01). <p><b>Interpretation</b></p> We developed an embedded system for federated learning, using microcomputing to optimise for ease of deployment. We deployed full-stack federated learning across four UK hospital groups to develop a COVID-19 screening test without centralising patient data. Federation improved model performance, and the resultant global models were generalisable. Full-stack federated learning could enable hospitals to contribute to AI development at low cost and without specialist technical expertise at each site. <p><b>Funding</b></p> The Wellcome Trust, University of Oxford Medical and Life Sciences Translational Fund.
first_indexed	2024-03-07T08:19:38Z
format	Journal article
id	oxford-uuid:12e2da0c-e509-4d20-bc0d-07e5b5563eae
institution	University of Oxford
language	English
last_indexed	2024-03-07T08:19:38Z
publishDate	2024
publisher	Elsevier
record_format	dspace
spelling	oxford-uuid:12e2da0c-e509-4d20-bc0d-07e5b5563eae2024-01-26T08:46:56ZA scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitalsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:12e2da0c-e509-4d20-bc0d-07e5b5563eaeEnglishSymplectic ElementsElsevier2024Soltan, ASThakur, AYang, JChauhan, AD'Cruz, LGDickson, PSoltan, MAThickett, DREyre, DWZhu, TClifton, DA<p><b>Background</b></p> Multicentre training could reduce biases in medical artificial intelligence (AI); however, ethical, legal, and technical considerations can constrain the ability of hospitals to share data. Federated learning enables institutions to participate in algorithm development while retaining custody of their data but uptake in hospitals has been limited, possibly as deployment requires specialist software and technical expertise at each site. We previously developed an artificial intelligence-driven screening test for COVID-19 in emergency departments, known as CURIAL-Lab, which uses vital signs and blood tests that are routinely available within 1 h of a patient's arrival. Here we aimed to federate our COVID-19 screening test by developing an easy-to-use embedded system—which we introduce as full-stack federated learning—to train and evaluate machine learning models across four UK hospital groups without centralising patient data. <p><b>Methods</b></p> We supplied a Raspberry Pi 4 Model B preloaded with our federated learning software pipeline to four National Health Service (NHS) hospital groups in the UK: Oxford University Hospitals NHS Foundation Trust (OUH; through the locally linked research University, University of Oxford), University Hospitals Birmingham NHS Foundation Trust (UHB), Bedfordshire Hospitals NHS Foundation Trust (BH), and Portsmouth Hospitals University NHS Trust (PUH). OUH, PUH, and UHB participated in federated training, training a deep neural network and logistic regressor over 150 rounds to form and calibrate a global model to predict COVID-19 status, using clinical data from patients admitted before the pandemic (COVID-19-negative) and testing positive for COVID-19 during the first wave of the pandemic. We conducted a federated evaluation of the global model for admissions during the second wave of the pandemic at OUH, PUH, and externally at BH. For OUH and PUH, we additionally performed local fine-tuning of the global model using the sites’ individual training data, forming a site-tuned model, and evaluated the resultant model for admissions during the second wave of the pandemic. This study included data collected between Dec 1, 2018, and March 1, 2021; the exact date ranges used varied by site. The primary outcome was overall model performance, measured as the area under the receiver operating characteristic curve (AUROC). Removable micro secure digital (microSD) storage was destroyed on study completion. <p><b>Findings</b></p> Clinical data from 130 941 patients (1772 COVID-19-positive), routinely collected across three hospital groups (OUH, PUH, and UHB), were included in federated training. The evaluation step included data from 32 986 patients (3549 COVID-19-positive) attending OUH, PUH, or BH during the second wave of the pandemic. Federated training of a global deep neural network classifier improved upon performance of models trained locally in terms of AUROC by a mean of 27·6% (SD 2·2): AUROC increased from 0·574 (95% CI 0·560–0·589) at OUH and 0·622 (0·608–0·637) at PUH using the locally trained models to 0·872 (0·862–0·882) at OUH and 0·876 (0·865–0·886) at PUH using the federated global model. Performance improvement was smaller for a logistic regression model, with a mean increase in AUROC of 13·9% (0·5%). During federated external evaluation at BH, AUROC for the global deep neural network model was 0·917 (0·893–0·942), with 89·7% sensitivity (83·6–93·6) and 76·6% specificity (73·9–79·1). Site-specific tuning of the global model did not significantly improve performance (change in AUROC <0·01). <p><b>Interpretation</b></p> We developed an embedded system for federated learning, using microcomputing to optimise for ease of deployment. We deployed full-stack federated learning across four UK hospital groups to develop a COVID-19 screening test without centralising patient data. Federation improved model performance, and the resultant global models were generalisable. Full-stack federated learning could enable hospitals to contribute to AI development at low cost and without specialist technical expertise at each site. <p><b>Funding</b></p> The Wellcome Trust, University of Oxford Medical and Life Sciences Translational Fund.
spellingShingle	Soltan, AS Thakur, A Yang, J Chauhan, A D'Cruz, LG Dickson, P Soltan, MA Thickett, DR Eyre, DW Zhu, T Clifton, DA A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals
title	A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals
title_full	A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals
title_fullStr	A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals
title_full_unstemmed	A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals
title_short	A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals
title_sort	scalable federated learning solution for secondary care using low cost microcomputing privacy preserving development and evaluation of a covid 19 screening test in uk hospitals
work_keys_str_mv	AT soltanas ascalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT thakura ascalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT yangj ascalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT chauhana ascalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT dcruzlg ascalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT dicksonp ascalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT soltanma ascalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT thickettdr ascalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT eyredw ascalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT zhut ascalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT cliftonda ascalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT soltanas scalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT thakura scalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT yangj scalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT chauhana scalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT dcruzlg scalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT dicksonp scalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT soltanma scalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT thickettdr scalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT eyredw scalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT zhut scalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals AT cliftonda scalablefederatedlearningsolutionforsecondarycareusinglowcostmicrocomputingprivacypreservingdevelopmentandevaluationofacovid19screeningtestinukhospitals

A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals

Documents similaires