Ensemble Deep Learning for the Detection of COVID-19 in Unbalanced Chest X-ray Dataset

The ongoing COVID-19 pandemic has caused devastating effects on humanity worldwide. With practical advantages and wide accessibility, chest X-rays (CXRs) play vital roles in the diagnosis of COVID-19 and the evaluation of the extent of lung damages incurred by the virus. This study aimed to leverage...

Full description

Bibliographic Details
Main Authors: Khin Yadanar Win, Noppadol Maneerat, Syna Sreng, Kazuhiko Hamamoto
Format: Article
Language:English
Published: MDPI AG 2021-11-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/22/10528
_version_ 1797511346386894848
author Khin Yadanar Win
Noppadol Maneerat
Syna Sreng
Kazuhiko Hamamoto
author_facet Khin Yadanar Win
Noppadol Maneerat
Syna Sreng
Kazuhiko Hamamoto
author_sort Khin Yadanar Win
collection DOAJ
description The ongoing COVID-19 pandemic has caused devastating effects on humanity worldwide. With practical advantages and wide accessibility, chest X-rays (CXRs) play vital roles in the diagnosis of COVID-19 and the evaluation of the extent of lung damages incurred by the virus. This study aimed to leverage deep-learning-based methods toward the automated classification of COVID-19 from normal and viral pneumonia on CXRs, and the identification of indicative regions of COVID-19 biomarkers. Initially, we preprocessed and segmented the lung regions usingDeepLabV3+ method, and subsequently cropped the lung regions. The cropped lung regions were used as inputs to several deep convolutional neural networks (CNNs) for the prediction of COVID-19. The dataset was highly unbalanced; the vast majority were normal images, with a small number of COVID-19 and pneumonia images. To remedy the unbalanced distribution and to avoid biased classification results, we applied five different approaches: (i) balancing the class using weighted loss; (ii) image augmentation to add more images to minority cases; (iii) the undersampling of majority classes; (iv) the oversampling of minority classes; and (v) a hybrid resampling approach of oversampling and undersampling. The best-performing methods from each approach were combined as the ensemble classifier using two voting strategies. Finally, we used the saliency map of CNNs to identify the indicative regions of COVID-19 biomarkers which are deemed useful for interpretability. The algorithms were evaluated using the largest publicly available COVID-19 dataset. An ensemble of the top five CNNs with image augmentation achieved the highest accuracy of 99.23% and area under curve (AUC) of 99.97%, surpassing the results of previous studies.
first_indexed 2024-03-10T05:44:02Z
format Article
id doaj.art-4cb34389d3d5420d848a1865b585639c
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T05:44:02Z
publishDate 2021-11-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-4cb34389d3d5420d848a1865b585639c2023-11-22T22:15:05ZengMDPI AGApplied Sciences2076-34172021-11-0111221052810.3390/app112210528Ensemble Deep Learning for the Detection of COVID-19 in Unbalanced Chest X-ray DatasetKhin Yadanar Win0Noppadol Maneerat1Syna Sreng2Kazuhiko Hamamoto3King Mongkut’s Institute of Technology Ladkrabang, School of Engineering, Bangkok 10520, ThailandKing Mongkut’s Institute of Technology Ladkrabang, School of Engineering, Bangkok 10520, ThailandKing Mongkut’s Institute of Technology Ladkrabang, School of Engineering, Bangkok 10520, ThailandSchool of Information and Telecommunication Engineering, Tokai University, Tokyo 108-8619, JapanThe ongoing COVID-19 pandemic has caused devastating effects on humanity worldwide. With practical advantages and wide accessibility, chest X-rays (CXRs) play vital roles in the diagnosis of COVID-19 and the evaluation of the extent of lung damages incurred by the virus. This study aimed to leverage deep-learning-based methods toward the automated classification of COVID-19 from normal and viral pneumonia on CXRs, and the identification of indicative regions of COVID-19 biomarkers. Initially, we preprocessed and segmented the lung regions usingDeepLabV3+ method, and subsequently cropped the lung regions. The cropped lung regions were used as inputs to several deep convolutional neural networks (CNNs) for the prediction of COVID-19. The dataset was highly unbalanced; the vast majority were normal images, with a small number of COVID-19 and pneumonia images. To remedy the unbalanced distribution and to avoid biased classification results, we applied five different approaches: (i) balancing the class using weighted loss; (ii) image augmentation to add more images to minority cases; (iii) the undersampling of majority classes; (iv) the oversampling of minority classes; and (v) a hybrid resampling approach of oversampling and undersampling. The best-performing methods from each approach were combined as the ensemble classifier using two voting strategies. Finally, we used the saliency map of CNNs to identify the indicative regions of COVID-19 biomarkers which are deemed useful for interpretability. The algorithms were evaluated using the largest publicly available COVID-19 dataset. An ensemble of the top five CNNs with image augmentation achieved the highest accuracy of 99.23% and area under curve (AUC) of 99.97%, surpassing the results of previous studies.https://www.mdpi.com/2076-3417/11/22/10528COVID-19chest X-raysdeep learningensemble learningimage augmentationoversampling
spellingShingle Khin Yadanar Win
Noppadol Maneerat
Syna Sreng
Kazuhiko Hamamoto
Ensemble Deep Learning for the Detection of COVID-19 in Unbalanced Chest X-ray Dataset
Applied Sciences
COVID-19
chest X-rays
deep learning
ensemble learning
image augmentation
oversampling
title Ensemble Deep Learning for the Detection of COVID-19 in Unbalanced Chest X-ray Dataset
title_full Ensemble Deep Learning for the Detection of COVID-19 in Unbalanced Chest X-ray Dataset
title_fullStr Ensemble Deep Learning for the Detection of COVID-19 in Unbalanced Chest X-ray Dataset
title_full_unstemmed Ensemble Deep Learning for the Detection of COVID-19 in Unbalanced Chest X-ray Dataset
title_short Ensemble Deep Learning for the Detection of COVID-19 in Unbalanced Chest X-ray Dataset
title_sort ensemble deep learning for the detection of covid 19 in unbalanced chest x ray dataset
topic COVID-19
chest X-rays
deep learning
ensemble learning
image augmentation
oversampling
url https://www.mdpi.com/2076-3417/11/22/10528
work_keys_str_mv AT khinyadanarwin ensembledeeplearningforthedetectionofcovid19inunbalancedchestxraydataset
AT noppadolmaneerat ensembledeeplearningforthedetectionofcovid19inunbalancedchestxraydataset
AT synasreng ensembledeeplearningforthedetectionofcovid19inunbalancedchestxraydataset
AT kazuhikohamamoto ensembledeeplearningforthedetectionofcovid19inunbalancedchestxraydataset