Predicting COVID-19 Outbreaks in Correctional Facilities Using Machine Learning
Introduction. The risk of infectious disease transmission, including COVID-19, is disproportionately high in correctional facilities due to close living conditions, relatively low levels of vaccination, and reduced access to testing and treatment. While much progress has been made on describing and...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SAGE Publishing
2024-01-01
|
Series: | MDM Policy & Practice |
Online Access: | https://doi.org/10.1177/23814683231222469 |
_version_ | 1827369238008954880 |
---|---|
author | Giovanni S. P. Malloy Lisa B. Puglisi Kristofer B. Bucklen Tyler D. Harvey Emily A. Wang Margaret L. Brandeau |
author_facet | Giovanni S. P. Malloy Lisa B. Puglisi Kristofer B. Bucklen Tyler D. Harvey Emily A. Wang Margaret L. Brandeau |
author_sort | Giovanni S. P. Malloy |
collection | DOAJ |
description | Introduction. The risk of infectious disease transmission, including COVID-19, is disproportionately high in correctional facilities due to close living conditions, relatively low levels of vaccination, and reduced access to testing and treatment. While much progress has been made on describing and mitigating COVID-19 and other infectious disease risk in jails and prisons, there are open questions about which data can best predict future outbreaks. Methods. We used facility data and demographic and health data collected from 24 prison facilities in the Pennsylvania Department of Corrections from March 2020 to May 2021 to determine which sources of data best predict a coming COVID-19 outbreak in a prison facility. We used machine learning methods to cluster the prisons into groups based on similar facility-level characteristics, including size, rurality, and demographics of incarcerated people. We developed logistic regression classification models to predict for each cluster, before and after vaccine availability, whether there would be no cases, an outbreak defined as 2 or more cases, or a large outbreak, defined as 10 or more cases in the next 1, 2, and 3 d. We compared these predictions to data on outbreaks that occurred. Results. Facilities were divided into 8 clusters of sizes varying from 1 to 7 facilities per cluster. We trained 60 logistic regressions; 20 had test sets with between 35% and 65% of days with outbreaks detected. Of these, 8 logistic regressions correctly predicted the occurrence of an outbreak more than 55% of the time. The most common predictive feature was incident cases among the incarcerated population from 2 to 32 d prior. Other predictive features included the number of tests administered from 1 to 33 d prior, total population, test positivity rate, and county deaths, hospitalizations, and incident cases. Cumulative cases, vaccination rates, and race, ethnicity, or age statistics for incarcerated populations were generally not predictive. Conclusions. County-level measures of COVID-19, facility population, and test positivity rate appear as potential promising predictors of COVID-19 outbreaks in correctional facilities, suggesting that correctional facilities should monitor community transmission in addition to facility transmission to inform future outbreak response decisions. These efforts should not be limited to COVID-19 but should include any large-scale infectious disease outbreak that may involve institution-community transmission. Highlights The risk of infectious disease transmission, including COVID-19, is disproportionately high in correctional facilities. We used machine learning methods with data collected from 24 prison facilities in the Pennsylvania Department of Corrections to determine which sources of data best predict a coming COVID-19 outbreak in a prison facility. Key predictors included county-level measures of COVID-19, facility population, and the test positivity rate in a facility. Fortifying correctional facilities with the ability to monitor local community rates of infection (e.g., though improved interagency collaboration and data sharing) along with continued testing of incarcerated people and staff can help correctional facilities better predict—and respond to—future infectious disease outbreaks. |
first_indexed | 2024-03-08T09:43:22Z |
format | Article |
id | doaj.art-071eec4240e84be1ae55f7507c561269 |
institution | Directory Open Access Journal |
issn | 2381-4683 |
language | English |
last_indexed | 2024-03-08T09:43:22Z |
publishDate | 2024-01-01 |
publisher | SAGE Publishing |
record_format | Article |
series | MDM Policy & Practice |
spelling | doaj.art-071eec4240e84be1ae55f7507c5612692024-01-29T16:03:21ZengSAGE PublishingMDM Policy & Practice2381-46832024-01-01910.1177/23814683231222469Predicting COVID-19 Outbreaks in Correctional Facilities Using Machine LearningGiovanni S. P. MalloyLisa B. PuglisiKristofer B. BucklenTyler D. HarveyEmily A. WangMargaret L. BrandeauIntroduction. The risk of infectious disease transmission, including COVID-19, is disproportionately high in correctional facilities due to close living conditions, relatively low levels of vaccination, and reduced access to testing and treatment. While much progress has been made on describing and mitigating COVID-19 and other infectious disease risk in jails and prisons, there are open questions about which data can best predict future outbreaks. Methods. We used facility data and demographic and health data collected from 24 prison facilities in the Pennsylvania Department of Corrections from March 2020 to May 2021 to determine which sources of data best predict a coming COVID-19 outbreak in a prison facility. We used machine learning methods to cluster the prisons into groups based on similar facility-level characteristics, including size, rurality, and demographics of incarcerated people. We developed logistic regression classification models to predict for each cluster, before and after vaccine availability, whether there would be no cases, an outbreak defined as 2 or more cases, or a large outbreak, defined as 10 or more cases in the next 1, 2, and 3 d. We compared these predictions to data on outbreaks that occurred. Results. Facilities were divided into 8 clusters of sizes varying from 1 to 7 facilities per cluster. We trained 60 logistic regressions; 20 had test sets with between 35% and 65% of days with outbreaks detected. Of these, 8 logistic regressions correctly predicted the occurrence of an outbreak more than 55% of the time. The most common predictive feature was incident cases among the incarcerated population from 2 to 32 d prior. Other predictive features included the number of tests administered from 1 to 33 d prior, total population, test positivity rate, and county deaths, hospitalizations, and incident cases. Cumulative cases, vaccination rates, and race, ethnicity, or age statistics for incarcerated populations were generally not predictive. Conclusions. County-level measures of COVID-19, facility population, and test positivity rate appear as potential promising predictors of COVID-19 outbreaks in correctional facilities, suggesting that correctional facilities should monitor community transmission in addition to facility transmission to inform future outbreak response decisions. These efforts should not be limited to COVID-19 but should include any large-scale infectious disease outbreak that may involve institution-community transmission. Highlights The risk of infectious disease transmission, including COVID-19, is disproportionately high in correctional facilities. We used machine learning methods with data collected from 24 prison facilities in the Pennsylvania Department of Corrections to determine which sources of data best predict a coming COVID-19 outbreak in a prison facility. Key predictors included county-level measures of COVID-19, facility population, and the test positivity rate in a facility. Fortifying correctional facilities with the ability to monitor local community rates of infection (e.g., though improved interagency collaboration and data sharing) along with continued testing of incarcerated people and staff can help correctional facilities better predict—and respond to—future infectious disease outbreaks.https://doi.org/10.1177/23814683231222469 |
spellingShingle | Giovanni S. P. Malloy Lisa B. Puglisi Kristofer B. Bucklen Tyler D. Harvey Emily A. Wang Margaret L. Brandeau Predicting COVID-19 Outbreaks in Correctional Facilities Using Machine Learning MDM Policy & Practice |
title | Predicting COVID-19 Outbreaks in Correctional Facilities Using Machine Learning |
title_full | Predicting COVID-19 Outbreaks in Correctional Facilities Using Machine Learning |
title_fullStr | Predicting COVID-19 Outbreaks in Correctional Facilities Using Machine Learning |
title_full_unstemmed | Predicting COVID-19 Outbreaks in Correctional Facilities Using Machine Learning |
title_short | Predicting COVID-19 Outbreaks in Correctional Facilities Using Machine Learning |
title_sort | predicting covid 19 outbreaks in correctional facilities using machine learning |
url | https://doi.org/10.1177/23814683231222469 |
work_keys_str_mv | AT giovannispmalloy predictingcovid19outbreaksincorrectionalfacilitiesusingmachinelearning AT lisabpuglisi predictingcovid19outbreaksincorrectionalfacilitiesusingmachinelearning AT kristoferbbucklen predictingcovid19outbreaksincorrectionalfacilitiesusingmachinelearning AT tylerdharvey predictingcovid19outbreaksincorrectionalfacilitiesusingmachinelearning AT emilyawang predictingcovid19outbreaksincorrectionalfacilitiesusingmachinelearning AT margaretlbrandeau predictingcovid19outbreaksincorrectionalfacilitiesusingmachinelearning |