Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images

Lung cancer is the leading cause of cancer death and morbidity worldwide. Many studies have shown machine learning models to be effective in detecting lung nodules from chest X-ray images. However, these techniques have yet to be embraced by the medical community due to several practical, ethical, a...

Full description

Bibliographic Details
Main Authors: Michael Horry, Subrata Chakraborty, Biswajeet Pradhan, Manoranjan Paul, Douglas Gomes, Anwaar Ul-Haq, Abdullah Alamri
Format: Article
Language:English
Published: MDPI AG 2021-10-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/21/19/6655
_version_ 1797515743571476480
author Michael Horry
Subrata Chakraborty
Biswajeet Pradhan
Manoranjan Paul
Douglas Gomes
Anwaar Ul-Haq
Abdullah Alamri
author_facet Michael Horry
Subrata Chakraborty
Biswajeet Pradhan
Manoranjan Paul
Douglas Gomes
Anwaar Ul-Haq
Abdullah Alamri
author_sort Michael Horry
collection DOAJ
description Lung cancer is the leading cause of cancer death and morbidity worldwide. Many studies have shown machine learning models to be effective in detecting lung nodules from chest X-ray images. However, these techniques have yet to be embraced by the medical community due to several practical, ethical, and regulatory constraints stemming from the “black-box” nature of deep learning models. Additionally, most lung nodules visible on chest X-rays are benign; therefore, the narrow task of computer vision-based lung nodule detection cannot be equated to automated lung cancer detection. Addressing both concerns, this study introduces a novel hybrid deep learning and decision tree-based computer vision model, which presents lung cancer malignancy predictions as interpretable decision trees. The deep learning component of this process is trained using a large publicly available dataset on pathological biomarkers associated with lung cancer. These models are then used to inference biomarker scores for chest X-ray images from two independent data sets, for which malignancy metadata is available. Next, multi-variate predictive models were mined by fitting shallow decision trees to the malignancy stratified datasets and interrogating a range of metrics to determine the best model. The best decision tree model achieved sensitivity and specificity of 86.7% and 80.0%, respectively, with a positive predictive value of 92.9%. Decision trees mined using this method may be considered as a starting point for refinement into clinically useful multi-variate lung cancer malignancy models for implementation as a workflow augmentation tool to improve the efficiency of human radiologists.
first_indexed 2024-03-10T06:51:35Z
format Article
id doaj.art-e12f67114ac04e7f923f2fd7f13bfdef
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-10T06:51:35Z
publishDate 2021-10-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-e12f67114ac04e7f923f2fd7f13bfdef2023-11-22T16:49:24ZengMDPI AGSensors1424-82202021-10-012119665510.3390/s21196655Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray ImagesMichael Horry0Subrata Chakraborty1Biswajeet Pradhan2Manoranjan Paul3Douglas Gomes4Anwaar Ul-Haq5Abdullah Alamri6Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Sydney, NSW 2007, AustraliaCentre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Sydney, NSW 2007, AustraliaCentre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Sydney, NSW 2007, AustraliaMachine Vision and Digital Health (MaViDH), School of Computing and Mathematics, Charles Sturt University, Bathurst, NSW 2795, AustraliaMachine Vision and Digital Health (MaViDH), School of Computing and Mathematics, Charles Sturt University, Bathurst, NSW 2795, AustraliaMachine Vision and Digital Health (MaViDH), School of Computing and Mathematics, Charles Sturt University, Bathurst, NSW 2795, AustraliaDepartment of Geology and Geophysics, College of Science, King Saud University, P.O. Box 2455, Riyadh 11451, Saudi ArabiaLung cancer is the leading cause of cancer death and morbidity worldwide. Many studies have shown machine learning models to be effective in detecting lung nodules from chest X-ray images. However, these techniques have yet to be embraced by the medical community due to several practical, ethical, and regulatory constraints stemming from the “black-box” nature of deep learning models. Additionally, most lung nodules visible on chest X-rays are benign; therefore, the narrow task of computer vision-based lung nodule detection cannot be equated to automated lung cancer detection. Addressing both concerns, this study introduces a novel hybrid deep learning and decision tree-based computer vision model, which presents lung cancer malignancy predictions as interpretable decision trees. The deep learning component of this process is trained using a large publicly available dataset on pathological biomarkers associated with lung cancer. These models are then used to inference biomarker scores for chest X-ray images from two independent data sets, for which malignancy metadata is available. Next, multi-variate predictive models were mined by fitting shallow decision trees to the malignancy stratified datasets and interrogating a range of metrics to determine the best model. The best decision tree model achieved sensitivity and specificity of 86.7% and 80.0%, respectively, with a positive predictive value of 92.9%. Decision trees mined using this method may be considered as a starting point for refinement into clinically useful multi-variate lung cancer malignancy models for implementation as a workflow augmentation tool to improve the efficiency of human radiologists.https://www.mdpi.com/1424-8220/21/19/6655lung cancerchest X-raymalignancy predictive modelsartificial intelligencemachine learningcomputer vision
spellingShingle Michael Horry
Subrata Chakraborty
Biswajeet Pradhan
Manoranjan Paul
Douglas Gomes
Anwaar Ul-Haq
Abdullah Alamri
Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images
Sensors
lung cancer
chest X-ray
malignancy predictive models
artificial intelligence
machine learning
computer vision
title Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images
title_full Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images
title_fullStr Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images
title_full_unstemmed Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images
title_short Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images
title_sort deep mining generation of lung cancer malignancy models from chest x ray images
topic lung cancer
chest X-ray
malignancy predictive models
artificial intelligence
machine learning
computer vision
url https://www.mdpi.com/1424-8220/21/19/6655
work_keys_str_mv AT michaelhorry deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages
AT subratachakraborty deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages
AT biswajeetpradhan deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages
AT manoranjanpaul deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages
AT douglasgomes deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages
AT anwaarulhaq deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages
AT abdullahalamri deepmininggenerationoflungcancermalignancymodelsfromchestxrayimages