Exploring a global interpretation mechanism for deep learning networks when predicting sepsis

Abstract The purpose of this study is to identify additional clinical features for sepsis detection through the use of a novel mechanism for interpreting black-box machine learning models trained and to provide a suitable evaluation for the mechanism. We use the publicly available dataset from the 2...

Full description

Bibliographic Details
Main Authors: Ethan A. T. Strickler, Joshua Thomas, Johnson P. Thomas, Bruce Benjamin, Rittika Shamsuddin
Format: Article
Language:English
Published: Nature Portfolio 2023-02-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-30091-3
_version_ 1797864949766160384
author Ethan A. T. Strickler
Joshua Thomas
Johnson P. Thomas
Bruce Benjamin
Rittika Shamsuddin
author_facet Ethan A. T. Strickler
Joshua Thomas
Johnson P. Thomas
Bruce Benjamin
Rittika Shamsuddin
author_sort Ethan A. T. Strickler
collection DOAJ
description Abstract The purpose of this study is to identify additional clinical features for sepsis detection through the use of a novel mechanism for interpreting black-box machine learning models trained and to provide a suitable evaluation for the mechanism. We use the publicly available dataset from the 2019 PhysioNet Challenge. It has around 40,000 Intensive Care Unit (ICU) patients with 40 physiological variables. Using Long Short-Term Memory (LSTM) as the representative black-box machine learning model, we adapted the Multi-set Classifier to globally interpret the black-box model for concepts it learned about sepsis. To identify relevant features, the result is compared against: (i) features used by a computational sepsis expert, (ii) clinical features from clinical collaborators, (iii) academic features from literature, and (iv) significant features from statistical hypothesis testing. Random Forest was found to be the computational sepsis expert because it had high accuracies for solving both the detection and early detection, and a high degree of overlap with clinical and literature features. Using the proposed interpretation mechanism and the dataset, we identified 17 features that the LSTM used for sepsis classification, 11 of which overlaps with the top 20 features from the Random Forest model, 10 with academic features and 5 with clinical features. Clinical opinion suggests, 3 LSTM features have strong correlation with some clinical features that were not identified by the mechanism. We also found that age, chloride ion concentration, pH and oxygen saturation should be investigated further for connection with developing sepsis. Interpretation mechanisms can bolster the incorporation of state-of-the-art machine learning models into clinical decision support systems, and might help clinicians to address the issue of early sepsis detection. The promising results from this study warrants further investigation into creation of new and improvement of existing interpretation mechanisms for black-box models, and into clinical features that are currently not used in clinical assessment of sepsis.
first_indexed 2024-04-09T23:00:35Z
format Article
id doaj.art-4f6c94787f8c4fb58d99ed9a4d6f9722
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-04-09T23:00:35Z
publishDate 2023-02-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-4f6c94787f8c4fb58d99ed9a4d6f97222023-03-22T10:59:59ZengNature PortfolioScientific Reports2045-23222023-02-0113111710.1038/s41598-023-30091-3Exploring a global interpretation mechanism for deep learning networks when predicting sepsisEthan A. T. Strickler0Joshua Thomas1Johnson P. Thomas2Bruce Benjamin3Rittika Shamsuddin4Physics and Mathematics, East Central UniversityDepartment of Internal Medicine, Rush University Medical CenterOklahoma State UniversitySchool of Biomedical Sciences, Center for Health SciencesOklahoma State UniversityAbstract The purpose of this study is to identify additional clinical features for sepsis detection through the use of a novel mechanism for interpreting black-box machine learning models trained and to provide a suitable evaluation for the mechanism. We use the publicly available dataset from the 2019 PhysioNet Challenge. It has around 40,000 Intensive Care Unit (ICU) patients with 40 physiological variables. Using Long Short-Term Memory (LSTM) as the representative black-box machine learning model, we adapted the Multi-set Classifier to globally interpret the black-box model for concepts it learned about sepsis. To identify relevant features, the result is compared against: (i) features used by a computational sepsis expert, (ii) clinical features from clinical collaborators, (iii) academic features from literature, and (iv) significant features from statistical hypothesis testing. Random Forest was found to be the computational sepsis expert because it had high accuracies for solving both the detection and early detection, and a high degree of overlap with clinical and literature features. Using the proposed interpretation mechanism and the dataset, we identified 17 features that the LSTM used for sepsis classification, 11 of which overlaps with the top 20 features from the Random Forest model, 10 with academic features and 5 with clinical features. Clinical opinion suggests, 3 LSTM features have strong correlation with some clinical features that were not identified by the mechanism. We also found that age, chloride ion concentration, pH and oxygen saturation should be investigated further for connection with developing sepsis. Interpretation mechanisms can bolster the incorporation of state-of-the-art machine learning models into clinical decision support systems, and might help clinicians to address the issue of early sepsis detection. The promising results from this study warrants further investigation into creation of new and improvement of existing interpretation mechanisms for black-box models, and into clinical features that are currently not used in clinical assessment of sepsis.https://doi.org/10.1038/s41598-023-30091-3
spellingShingle Ethan A. T. Strickler
Joshua Thomas
Johnson P. Thomas
Bruce Benjamin
Rittika Shamsuddin
Exploring a global interpretation mechanism for deep learning networks when predicting sepsis
Scientific Reports
title Exploring a global interpretation mechanism for deep learning networks when predicting sepsis
title_full Exploring a global interpretation mechanism for deep learning networks when predicting sepsis
title_fullStr Exploring a global interpretation mechanism for deep learning networks when predicting sepsis
title_full_unstemmed Exploring a global interpretation mechanism for deep learning networks when predicting sepsis
title_short Exploring a global interpretation mechanism for deep learning networks when predicting sepsis
title_sort exploring a global interpretation mechanism for deep learning networks when predicting sepsis
url https://doi.org/10.1038/s41598-023-30091-3
work_keys_str_mv AT ethanatstrickler exploringaglobalinterpretationmechanismfordeeplearningnetworkswhenpredictingsepsis
AT joshuathomas exploringaglobalinterpretationmechanismfordeeplearningnetworkswhenpredictingsepsis
AT johnsonpthomas exploringaglobalinterpretationmechanismfordeeplearningnetworkswhenpredictingsepsis
AT brucebenjamin exploringaglobalinterpretationmechanismfordeeplearningnetworkswhenpredictingsepsis
AT rittikashamsuddin exploringaglobalinterpretationmechanismfordeeplearningnetworkswhenpredictingsepsis