Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images
Lung cancer is the leading cause of cancer death and morbidity worldwide. Many studies have shown machine learning models to be effective in detecting lung nodules from chest X-ray images. However, these techniques have yet to be embraced by the medical community due to several practical, ethical, a...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-10-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/21/19/6655 |
_version_ | 1797515743571476480 |
---|---|
author | Michael Horry Subrata Chakraborty Biswajeet Pradhan Manoranjan Paul Douglas Gomes Anwaar Ul-Haq Abdullah Alamri |
author_facet | Michael Horry Subrata Chakraborty Biswajeet Pradhan Manoranjan Paul Douglas Gomes Anwaar Ul-Haq Abdullah Alamri |
author_sort | Michael Horry |
collection | DOAJ |
description | Lung cancer is the leading cause of cancer death and morbidity worldwide. Many studies have shown machine learning models to be effective in detecting lung nodules from chest X-ray images. However, these techniques have yet to be embraced by the medical community due to several practical, ethical, and regulatory constraints stemming from the “black-box” nature of deep learning models. Additionally, most lung nodules visible on chest X-rays are benign; therefore, the narrow task of computer vision-based lung nodule detection cannot be equated to automated lung cancer detection. Addressing both concerns, this study introduces a novel hybrid deep learning and decision tree-based computer vision model, which presents lung cancer malignancy predictions as interpretable decision trees. The deep learning component of this process is trained using a large publicly available dataset on pathological biomarkers associated with lung cancer. These models are then used to inference biomarker scores for chest X-ray images from two independent data sets, for which malignancy metadata is available. Next, multi-variate predictive models were mined by fitting shallow decision trees to the malignancy stratified datasets and interrogating a range of metrics to determine the best model. The best decision tree model achieved sensitivity and specificity of 86.7% and 80.0%, respectively, with a positive predictive value of 92.9%. Decision trees mined using this method may be considered as a starting point for refinement into clinically useful multi-variate lung cancer malignancy models for implementation as a workflow augmentation tool to improve the efficiency of human radiologists. |
first_indexed | 2024-03-10T06:51:35Z |
format | Article |
id | doaj.art-e12f67114ac04e7f923f2fd7f13bfdef |
institution | Directory Open Access Journal |
issn | 1424-8220 |
language | English |
last_indexed | 2024-03-10T06:51:35Z |
publishDate | 2021-10-01 |
publisher | MDPI AG |
record_format | Article |
series | Sensors |
spelling | doaj.art-e12f67114ac04e7f923f2fd7f13bfdef2023-11-22T16:49:24ZengMDPI AGSensors1424-82202021-10-012119665510.3390/s21196655Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray ImagesMichael Horry0Subrata Chakraborty1Biswajeet Pradhan2Manoranjan Paul3Douglas Gomes4Anwaar Ul-Haq5Abdullah Alamri6Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Sydney, NSW 2007, AustraliaCentre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Sydney, NSW 2007, AustraliaCentre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Sydney, NSW 2007, AustraliaMachine Vision and Digital Health (MaViDH), School of Computing and Mathematics, Charles Sturt University, Bathurst, NSW 2795, AustraliaMachine Vision and Digital Health (MaViDH), School of Computing and Mathematics, Charles Sturt University, Bathurst, NSW 2795, AustraliaMachine Vision and Digital Health (MaViDH), School of Computing and Mathematics, Charles Sturt University, Bathurst, NSW 2795, AustraliaDepartment of Geology and Geophysics, College of Science, King Saud University, P.O. Box 2455, Riyadh 11451, Saudi ArabiaLung cancer is the leading cause of cancer death and morbidity worldwide. Many studies have shown machine learning models to be effective in detecting lung nodules from chest X-ray images. However, these techniques have yet to be embraced by the medical community due to several practical, ethical, and regulatory constraints stemming from the “black-box” nature of deep learning models. Additionally, most lung nodules visible on chest X-rays are benign; therefore, the narrow task of computer vision-based lung nodule detection cannot be equated to automated lung cancer detection. Addressing both concerns, this study introduces a novel hybrid deep learning and decision tree-based computer vision model, which presents lung cancer malignancy predictions as interpretable decision trees. The deep learning component of this process is trained using a large publicly available dataset on pathological biomarkers associated with lung cancer. These models are then used to inference biomarker scores for chest X-ray images from two independent data sets, for which malignancy metadata is available. Next, multi-variate predictive models were mined by fitting shallow decision trees to the malignancy stratified datasets and interrogating a range of metrics to determine the best model. The best decision tree model achieved sensitivity and specificity of 86.7% and 80.0%, respectively, with a positive predictive value of 92.9%. Decision trees mined using this method may be considered as a starting point for refinement into clinically useful multi-variate lung cancer malignancy models for implementation as a workflow augmentation tool to improve the efficiency of human radiologists.https://www.mdpi.com/1424-8220/21/19/6655lung cancerchest X-raymalignancy predictive modelsartificial intelligencemachine learningcomputer vision |
spellingShingle | Michael Horry Subrata Chakraborty Biswajeet Pradhan Manoranjan Paul Douglas Gomes Anwaar Ul-Haq Abdullah Alamri Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images Sensors lung cancer chest X-ray malignancy predictive models artificial intelligence machine learning computer vision |
title | Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images |
title_full | Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images |
title_fullStr | Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images |
title_full_unstemmed | Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images |
title_short | Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images |
title_sort | deep mining generation of lung cancer malignancy models from chest x ray images |
topic | lung cancer chest X-ray malignancy predictive models artificial intelligence machine learning computer vision |
url | https://www.mdpi.com/1424-8220/21/19/6655 |
work_keys_str_mv | AT michaelhorry deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages AT subratachakraborty deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages AT biswajeetpradhan deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages AT manoranjanpaul deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages AT douglasgomes deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages AT anwaarulhaq deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages AT abdullahalamri deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages |