Predicting Hospital Readmission for Campylobacteriosis from Electronic Health Records: A Machine Learning and Text Mining Perspective

(1) Background: This study investigates influential risk factors for predicting 30-day readmission to hospital for Campylobacter infections (CI). (2) Methods: We linked general practitioner and hospital admission records of 13,006 patients with CI in Wales (1990–2015). An approach called TF-zR (term...

Full description

Bibliographic Details
Main Authors: Shang-Ming Zhou, Ronan A. Lyons, Muhammad A. Rahman, Alexander Holborow, Sinead Brophy
Format: Article
Language:English
Published: MDPI AG 2022-01-01
Series:Journal of Personalized Medicine
Subjects:
Online Access:https://www.mdpi.com/2075-4426/12/1/86
_version_ 1797492809450651648
author Shang-Ming Zhou
Ronan A. Lyons
Muhammad A. Rahman
Alexander Holborow
Sinead Brophy
author_facet Shang-Ming Zhou
Ronan A. Lyons
Muhammad A. Rahman
Alexander Holborow
Sinead Brophy
author_sort Shang-Ming Zhou
collection DOAJ
description (1) Background: This study investigates influential risk factors for predicting 30-day readmission to hospital for Campylobacter infections (CI). (2) Methods: We linked general practitioner and hospital admission records of 13,006 patients with CI in Wales (1990–2015). An approach called TF-zR (term frequency-zRelevance) technique was presented to evaluates how relevant a clinical term is to a patient in a cohort characterized by coded health records. The zR is a supervised term-weighting metric to assign weight to a term based on relative frequencies of the term across different classes. Cost-sensitive classifier with swarm optimization and weighted subset learning was integrated to identify influential clinical signals as predictors and optimal model for readmission prediction. (3) Results: From a pool of up to 17,506 variables, 33 most predictive factors were identified, including age, gender, Townsend deprivation quintiles, comorbidities, medications, and procedures. The predictive model predicted readmission with 73% sensitivity and 54% specificity. Variables associated with readmission included male gender, recurrent tonsillitis, non-healing open wounds, operation for in-gown toenails. Cystitis, paracetamol/codeine use, age (21–25), and heliclear triple pack use, were associated with a lower risk of readmission. (4) Conclusions: This study gives a profile of clustered variables that are predictive of readmission associated with campylobacteriosis.
first_indexed 2024-03-10T01:09:03Z
format Article
id doaj.art-238c3add27274cf9ae7959246fe56b6a
institution Directory Open Access Journal
issn 2075-4426
language English
last_indexed 2024-03-10T01:09:03Z
publishDate 2022-01-01
publisher MDPI AG
record_format Article
series Journal of Personalized Medicine
spelling doaj.art-238c3add27274cf9ae7959246fe56b6a2023-11-23T14:20:19ZengMDPI AGJournal of Personalized Medicine2075-44262022-01-011218610.3390/jpm12010086Predicting Hospital Readmission for Campylobacteriosis from Electronic Health Records: A Machine Learning and Text Mining PerspectiveShang-Ming Zhou0Ronan A. Lyons1Muhammad A. Rahman2Alexander Holborow3Sinead Brophy4Centre for Health Technology, Faculty of Health, University of Plymouth, Plymouth PL4 8AA, UKHealth Data Research UK, Swansea University Medical School, Swansea University, Swansea SA2 8PP, UKDepartment of Computer Science, Cardiff Metropolitan University, Cardiff CF5 2YB, UKSouth West Wales Cancer Centre, Singleton Hospital, Swansea SA2 8QA, UKHealth Data Research UK, Swansea University Medical School, Swansea University, Swansea SA2 8PP, UK(1) Background: This study investigates influential risk factors for predicting 30-day readmission to hospital for Campylobacter infections (CI). (2) Methods: We linked general practitioner and hospital admission records of 13,006 patients with CI in Wales (1990–2015). An approach called TF-zR (term frequency-zRelevance) technique was presented to evaluates how relevant a clinical term is to a patient in a cohort characterized by coded health records. The zR is a supervised term-weighting metric to assign weight to a term based on relative frequencies of the term across different classes. Cost-sensitive classifier with swarm optimization and weighted subset learning was integrated to identify influential clinical signals as predictors and optimal model for readmission prediction. (3) Results: From a pool of up to 17,506 variables, 33 most predictive factors were identified, including age, gender, Townsend deprivation quintiles, comorbidities, medications, and procedures. The predictive model predicted readmission with 73% sensitivity and 54% specificity. Variables associated with readmission included male gender, recurrent tonsillitis, non-healing open wounds, operation for in-gown toenails. Cystitis, paracetamol/codeine use, age (21–25), and heliclear triple pack use, were associated with a lower risk of readmission. (4) Conclusions: This study gives a profile of clustered variables that are predictive of readmission associated with campylobacteriosis.https://www.mdpi.com/2075-4426/12/1/86hospitalisationreadmission<i>Campylobacter</i> infectionsmachine learningtext miningfeature selection
spellingShingle Shang-Ming Zhou
Ronan A. Lyons
Muhammad A. Rahman
Alexander Holborow
Sinead Brophy
Predicting Hospital Readmission for Campylobacteriosis from Electronic Health Records: A Machine Learning and Text Mining Perspective
Journal of Personalized Medicine
hospitalisation
readmission
<i>Campylobacter</i> infections
machine learning
text mining
feature selection
title Predicting Hospital Readmission for Campylobacteriosis from Electronic Health Records: A Machine Learning and Text Mining Perspective
title_full Predicting Hospital Readmission for Campylobacteriosis from Electronic Health Records: A Machine Learning and Text Mining Perspective
title_fullStr Predicting Hospital Readmission for Campylobacteriosis from Electronic Health Records: A Machine Learning and Text Mining Perspective
title_full_unstemmed Predicting Hospital Readmission for Campylobacteriosis from Electronic Health Records: A Machine Learning and Text Mining Perspective
title_short Predicting Hospital Readmission for Campylobacteriosis from Electronic Health Records: A Machine Learning and Text Mining Perspective
title_sort predicting hospital readmission for campylobacteriosis from electronic health records a machine learning and text mining perspective
topic hospitalisation
readmission
<i>Campylobacter</i> infections
machine learning
text mining
feature selection
url https://www.mdpi.com/2075-4426/12/1/86
work_keys_str_mv AT shangmingzhou predictinghospitalreadmissionforcampylobacteriosisfromelectronichealthrecordsamachinelearningandtextminingperspective
AT ronanalyons predictinghospitalreadmissionforcampylobacteriosisfromelectronichealthrecordsamachinelearningandtextminingperspective
AT muhammadarahman predictinghospitalreadmissionforcampylobacteriosisfromelectronichealthrecordsamachinelearningandtextminingperspective
AT alexanderholborow predictinghospitalreadmissionforcampylobacteriosisfromelectronichealthrecordsamachinelearningandtextminingperspective
AT sineadbrophy predictinghospitalreadmissionforcampylobacteriosisfromelectronichealthrecordsamachinelearningandtextminingperspective