Robust framework for COVID-19 identication from a multicenter dataset of chest CT scans.

The main objective of this study is to develop a robust deep learning-based framework to distinguish COVID-19, Community-Acquired Pneumonia (CAP), and Normal cases based on volumetric chest CT scans, which are acquired in different imaging centers using different scanners and technical settings. We...

Full description

Bibliographic Details
Main Authors: Sadaf Khademi, Shahin Heidarian, Parnian Afshar, Nastaran Enshaei, Farnoosh Naderkhani, Moezedin Javad Rafiee, Anastasia Oikonomou, Akbar Shafiee, Faranak Babaki Fard, Konstantinos N Plataniotis, Arash Mohammadi
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2023-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0282121
_version_ 1827963330674819072
author Sadaf Khademi
Shahin Heidarian
Parnian Afshar
Nastaran Enshaei
Farnoosh Naderkhani
Moezedin Javad Rafiee
Anastasia Oikonomou
Akbar Shafiee
Faranak Babaki Fard
Konstantinos N Plataniotis
Arash Mohammadi
author_facet Sadaf Khademi
Shahin Heidarian
Parnian Afshar
Nastaran Enshaei
Farnoosh Naderkhani
Moezedin Javad Rafiee
Anastasia Oikonomou
Akbar Shafiee
Faranak Babaki Fard
Konstantinos N Plataniotis
Arash Mohammadi
author_sort Sadaf Khademi
collection DOAJ
description The main objective of this study is to develop a robust deep learning-based framework to distinguish COVID-19, Community-Acquired Pneumonia (CAP), and Normal cases based on volumetric chest CT scans, which are acquired in different imaging centers using different scanners and technical settings. We demonstrated that while our proposed model is trained on a relatively small dataset acquired from only one imaging center using a specific scanning protocol, it performs well on heterogeneous test sets obtained by multiple scanners using different technical parameters. We also showed that the model can be updated via an unsupervised approach to cope with the data shift between the train and test sets and enhance the robustness of the model upon receiving a new external dataset from a different center. More specifically, we extracted the subset of the test images for which the model generated a confident prediction and used the extracted subset along with the training set to retrain and update the benchmark model (the model trained on the initial train set). Finally, we adopted an ensemble architecture to aggregate the predictions from multiple versions of the model. For initial training and development purposes, an in-house dataset of 171 COVID-19, 60 CAP, and 76 Normal cases was used, which contained volumetric CT scans acquired from one imaging center using a single scanning protocol and standard radiation dose. To evaluate the model, we collected four different test sets retrospectively to investigate the effects of the shifts in the data characteristics on the model's performance. Among the test cases, there were CT scans with similar characteristics as the train set as well as noisy low-dose and ultra-low-dose CT scans. In addition, some test CT scans were obtained from patients with a history of cardiovascular diseases or surgeries. This dataset is referred to as the "SPGC-COVID" dataset. The entire test dataset used in this study contains 51 COVID-19, 28 CAP, and 51 Normal cases. Experimental results indicate that our proposed framework performs well on all test sets achieving total accuracy of 96.15% (95%CI: [91.25-98.74]), COVID-19 sensitivity of 96.08% (95%CI: [86.54-99.5]), CAP sensitivity of 92.86% (95%CI: [76.50-99.19]), Normal sensitivity of 98.04% (95%CI: [89.55-99.95]) while the confidence intervals are obtained using the significance level of 0.05. The obtained AUC values (One class vs Others) are 0.993 (95%CI: [0.977-1]), 0.989 (95%CI: [0.962-1]), and 0.990 (95%CI: [0.971-1]) for COVID-19, CAP, and Normal classes, respectively. The experimental results also demonstrate the capability of the proposed unsupervised enhancement approach in improving the performance and robustness of the model when being evaluated on varied external test sets.
first_indexed 2024-04-09T16:57:26Z
format Article
id doaj.art-d214701340fc4ff0a0643df8cee84cff
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-09T16:57:26Z
publishDate 2023-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-d214701340fc4ff0a0643df8cee84cff2023-04-21T05:35:15ZengPublic Library of Science (PLoS)PLoS ONE1932-62032023-01-01183e028212110.1371/journal.pone.0282121Robust framework for COVID-19 identication from a multicenter dataset of chest CT scans.Sadaf KhademiShahin HeidarianParnian AfsharNastaran EnshaeiFarnoosh NaderkhaniMoezedin Javad RafieeAnastasia OikonomouAkbar ShafieeFaranak Babaki FardKonstantinos N PlataniotisArash MohammadiThe main objective of this study is to develop a robust deep learning-based framework to distinguish COVID-19, Community-Acquired Pneumonia (CAP), and Normal cases based on volumetric chest CT scans, which are acquired in different imaging centers using different scanners and technical settings. We demonstrated that while our proposed model is trained on a relatively small dataset acquired from only one imaging center using a specific scanning protocol, it performs well on heterogeneous test sets obtained by multiple scanners using different technical parameters. We also showed that the model can be updated via an unsupervised approach to cope with the data shift between the train and test sets and enhance the robustness of the model upon receiving a new external dataset from a different center. More specifically, we extracted the subset of the test images for which the model generated a confident prediction and used the extracted subset along with the training set to retrain and update the benchmark model (the model trained on the initial train set). Finally, we adopted an ensemble architecture to aggregate the predictions from multiple versions of the model. For initial training and development purposes, an in-house dataset of 171 COVID-19, 60 CAP, and 76 Normal cases was used, which contained volumetric CT scans acquired from one imaging center using a single scanning protocol and standard radiation dose. To evaluate the model, we collected four different test sets retrospectively to investigate the effects of the shifts in the data characteristics on the model's performance. Among the test cases, there were CT scans with similar characteristics as the train set as well as noisy low-dose and ultra-low-dose CT scans. In addition, some test CT scans were obtained from patients with a history of cardiovascular diseases or surgeries. This dataset is referred to as the "SPGC-COVID" dataset. The entire test dataset used in this study contains 51 COVID-19, 28 CAP, and 51 Normal cases. Experimental results indicate that our proposed framework performs well on all test sets achieving total accuracy of 96.15% (95%CI: [91.25-98.74]), COVID-19 sensitivity of 96.08% (95%CI: [86.54-99.5]), CAP sensitivity of 92.86% (95%CI: [76.50-99.19]), Normal sensitivity of 98.04% (95%CI: [89.55-99.95]) while the confidence intervals are obtained using the significance level of 0.05. The obtained AUC values (One class vs Others) are 0.993 (95%CI: [0.977-1]), 0.989 (95%CI: [0.962-1]), and 0.990 (95%CI: [0.971-1]) for COVID-19, CAP, and Normal classes, respectively. The experimental results also demonstrate the capability of the proposed unsupervised enhancement approach in improving the performance and robustness of the model when being evaluated on varied external test sets.https://doi.org/10.1371/journal.pone.0282121
spellingShingle Sadaf Khademi
Shahin Heidarian
Parnian Afshar
Nastaran Enshaei
Farnoosh Naderkhani
Moezedin Javad Rafiee
Anastasia Oikonomou
Akbar Shafiee
Faranak Babaki Fard
Konstantinos N Plataniotis
Arash Mohammadi
Robust framework for COVID-19 identication from a multicenter dataset of chest CT scans.
PLoS ONE
title Robust framework for COVID-19 identication from a multicenter dataset of chest CT scans.
title_full Robust framework for COVID-19 identication from a multicenter dataset of chest CT scans.
title_fullStr Robust framework for COVID-19 identication from a multicenter dataset of chest CT scans.
title_full_unstemmed Robust framework for COVID-19 identication from a multicenter dataset of chest CT scans.
title_short Robust framework for COVID-19 identication from a multicenter dataset of chest CT scans.
title_sort robust framework for covid 19 identication from a multicenter dataset of chest ct scans
url https://doi.org/10.1371/journal.pone.0282121
work_keys_str_mv AT sadafkhademi robustframeworkforcovid19identicationfromamulticenterdatasetofchestctscans
AT shahinheidarian robustframeworkforcovid19identicationfromamulticenterdatasetofchestctscans
AT parnianafshar robustframeworkforcovid19identicationfromamulticenterdatasetofchestctscans
AT nastaranenshaei robustframeworkforcovid19identicationfromamulticenterdatasetofchestctscans
AT farnooshnaderkhani robustframeworkforcovid19identicationfromamulticenterdatasetofchestctscans
AT moezedinjavadrafiee robustframeworkforcovid19identicationfromamulticenterdatasetofchestctscans
AT anastasiaoikonomou robustframeworkforcovid19identicationfromamulticenterdatasetofchestctscans
AT akbarshafiee robustframeworkforcovid19identicationfromamulticenterdatasetofchestctscans
AT faranakbabakifard robustframeworkforcovid19identicationfromamulticenterdatasetofchestctscans
AT konstantinosnplataniotis robustframeworkforcovid19identicationfromamulticenterdatasetofchestctscans
AT arashmohammadi robustframeworkforcovid19identicationfromamulticenterdatasetofchestctscans