Prediction of Tuberculosis From Lung Tissue Images of Diversity Outbred Mice Using Jump Knowledge Based Cell Graph Neural Network

Tuberculosis (TB), primarily affecting the lungs, is caused by the bacterium Mycobacterium tuberculosis and poses a significant health risk. Detecting acid-fast bacilli (AFB) in stained samples is critical for TB diagnosis. Whole Slide (WS) Imaging allows for digitally examining these stained sample...

Full description

Bibliographic Details
Main Authors: Vasundhara Acharya, Diana Choi, Bulent Yener, Gillian Beamer
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10416836/
_version_ 1797323889029677056
author Vasundhara Acharya
Diana Choi
Bulent Yener
Gillian Beamer
author_facet Vasundhara Acharya
Diana Choi
Bulent Yener
Gillian Beamer
author_sort Vasundhara Acharya
collection DOAJ
description Tuberculosis (TB), primarily affecting the lungs, is caused by the bacterium Mycobacterium tuberculosis and poses a significant health risk. Detecting acid-fast bacilli (AFB) in stained samples is critical for TB diagnosis. Whole Slide (WS) Imaging allows for digitally examining these stained samples. However, current deep-learning approaches to analyzing large-sized whole slide images (WSIs) often employ patch-wise analysis, potentially missing the complex spatial patterns observed in the granuloma essential for accurate TB classification. To address this limitation, we propose an approach that models cell characteristics and interactions as a graph, capturing both cell-level information and the overall tissue micro-architecture. This method differs from the strategies in related cell graph-based works that rely on edge thresholds based on sparsity/density in cell graph construction, emphasizing a biologically informed threshold determination instead. We introduce a cell graph-based jumping knowledge neural network (CG-JKNN) that operates on the cell graphs where the edge thresholds are selected based on the length of the mycobacteria’s cords and the activated macrophage nucleus’s size to reflect the actual biological interactions observed in the tissue. The primary process involves training a Convolutional Neural Network (CNN) to segment AFBs and macrophage nuclei, followed by converting large (42831*41159 pixels) lung histology images into cell graphs where an activated macrophage nucleus/AFB represents each node within the graph and their interactions are denoted as edges. To enhance the interpretability of our model, we employ Integrated Gradients and Shapely Additive Explanations (SHAP). Our analysis incorporated a combination of 33 graph metrics and 20 cell morphology features. In terms of traditional machine learning models, Extreme Gradient Boosting (XGBoost) was the best performer, achieving an F1 score of 0.9813 and an Area under the Precision-Recall Curve (AUPRC) of 0.9848 on the test set. Among graph-based models, our CG-JKNN was the top performer, attaining an F1 score of 0.9549 and an AUPRC of 0.9846 on the held-out test set. The integration of graph-based and morphological features proved highly effective, with CG-JKNN and XGBoost showing promising results in classifying instances into AFB and activated macrophage nucleus. The features identified as significant by our models closely align with the criteria used by pathologists in practice, highlighting the clinical applicability of our approach. Future work will explore knowledge distillation techniques and graph-level classification into distinct TB progression categories.
first_indexed 2024-03-08T05:34:37Z
format Article
id doaj.art-168aa713ee60461989ab1ab9d9ae738d
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-08T05:34:37Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-168aa713ee60461989ab1ab9d9ae738d2024-02-06T00:01:33ZengIEEEIEEE Access2169-35362024-01-0112171641719410.1109/ACCESS.2024.335998910416836Prediction of Tuberculosis From Lung Tissue Images of Diversity Outbred Mice Using Jump Knowledge Based Cell Graph Neural NetworkVasundhara Acharya0https://orcid.org/0000-0002-8720-633XDiana Choi1Bulent Yener2https://orcid.org/0000-0003-3989-6097Gillian Beamer3https://orcid.org/0000-0001-6782-4424Rensselaer Polytechnic Institute, Troy, NY, USACummings School of Veterinary Medicine, Tufts University, North Grafton, MA, USARensselaer Polytechnic Institute, Troy, NY, USAResearch Pathology, Aiforia Technologies, Cambridge, MA, USATuberculosis (TB), primarily affecting the lungs, is caused by the bacterium Mycobacterium tuberculosis and poses a significant health risk. Detecting acid-fast bacilli (AFB) in stained samples is critical for TB diagnosis. Whole Slide (WS) Imaging allows for digitally examining these stained samples. However, current deep-learning approaches to analyzing large-sized whole slide images (WSIs) often employ patch-wise analysis, potentially missing the complex spatial patterns observed in the granuloma essential for accurate TB classification. To address this limitation, we propose an approach that models cell characteristics and interactions as a graph, capturing both cell-level information and the overall tissue micro-architecture. This method differs from the strategies in related cell graph-based works that rely on edge thresholds based on sparsity/density in cell graph construction, emphasizing a biologically informed threshold determination instead. We introduce a cell graph-based jumping knowledge neural network (CG-JKNN) that operates on the cell graphs where the edge thresholds are selected based on the length of the mycobacteria’s cords and the activated macrophage nucleus’s size to reflect the actual biological interactions observed in the tissue. The primary process involves training a Convolutional Neural Network (CNN) to segment AFBs and macrophage nuclei, followed by converting large (42831*41159 pixels) lung histology images into cell graphs where an activated macrophage nucleus/AFB represents each node within the graph and their interactions are denoted as edges. To enhance the interpretability of our model, we employ Integrated Gradients and Shapely Additive Explanations (SHAP). Our analysis incorporated a combination of 33 graph metrics and 20 cell morphology features. In terms of traditional machine learning models, Extreme Gradient Boosting (XGBoost) was the best performer, achieving an F1 score of 0.9813 and an Area under the Precision-Recall Curve (AUPRC) of 0.9848 on the test set. Among graph-based models, our CG-JKNN was the top performer, attaining an F1 score of 0.9549 and an AUPRC of 0.9846 on the held-out test set. The integration of graph-based and morphological features proved highly effective, with CG-JKNN and XGBoost showing promising results in classifying instances into AFB and activated macrophage nucleus. The features identified as significant by our models closely align with the criteria used by pathologists in practice, highlighting the clinical applicability of our approach. Future work will explore knowledge distillation techniques and graph-level classification into distinct TB progression categories.https://ieeexplore.ieee.org/document/10416836/Acid-fast bacillicell graphsconvolutional neural networkgranulomajumping knowledge neural networkpulmonary tuberculosis
spellingShingle Vasundhara Acharya
Diana Choi
Bulent Yener
Gillian Beamer
Prediction of Tuberculosis From Lung Tissue Images of Diversity Outbred Mice Using Jump Knowledge Based Cell Graph Neural Network
IEEE Access
Acid-fast bacilli
cell graphs
convolutional neural network
granuloma
jumping knowledge neural network
pulmonary tuberculosis
title Prediction of Tuberculosis From Lung Tissue Images of Diversity Outbred Mice Using Jump Knowledge Based Cell Graph Neural Network
title_full Prediction of Tuberculosis From Lung Tissue Images of Diversity Outbred Mice Using Jump Knowledge Based Cell Graph Neural Network
title_fullStr Prediction of Tuberculosis From Lung Tissue Images of Diversity Outbred Mice Using Jump Knowledge Based Cell Graph Neural Network
title_full_unstemmed Prediction of Tuberculosis From Lung Tissue Images of Diversity Outbred Mice Using Jump Knowledge Based Cell Graph Neural Network
title_short Prediction of Tuberculosis From Lung Tissue Images of Diversity Outbred Mice Using Jump Knowledge Based Cell Graph Neural Network
title_sort prediction of tuberculosis from lung tissue images of diversity outbred mice using jump knowledge based cell graph neural network
topic Acid-fast bacilli
cell graphs
convolutional neural network
granuloma
jumping knowledge neural network
pulmonary tuberculosis
url https://ieeexplore.ieee.org/document/10416836/
work_keys_str_mv AT vasundharaacharya predictionoftuberculosisfromlungtissueimagesofdiversityoutbredmiceusingjumpknowledgebasedcellgraphneuralnetwork
AT dianachoi predictionoftuberculosisfromlungtissueimagesofdiversityoutbredmiceusingjumpknowledgebasedcellgraphneuralnetwork
AT bulentyener predictionoftuberculosisfromlungtissueimagesofdiversityoutbredmiceusingjumpknowledgebasedcellgraphneuralnetwork
AT gillianbeamer predictionoftuberculosisfromlungtissueimagesofdiversityoutbredmiceusingjumpknowledgebasedcellgraphneuralnetwork