Comparison of Machine-Learning and Deep-Learning Methods for the Prediction of Osteoradionecrosis Resulting From Head and Neck Cancer Radiation Therapy

Purpose: Deep-learning (DL) techniques have been successful in disease-prediction tasks and could improve the prediction of mandible osteoradionecrosis (ORN) resulting from head and neck cancer (HNC) radiation therapy. In this study, we retrospectively compared the performance of DL algorithms and t...

Full description

Bibliographic Details
Main Authors:	Brandon Reber, BS, Lisanne Van Dijk, PhD, Brian Anderson, PhD, Abdallah Sherif Radwan Mohamed, MD, PhD, Clifton Fuller, MD, PhD, Stephen Lai, MD, PhD, Kristy Brock, PhD
Format:	Article
Language:	English
Published:	Elsevier 2023-07-01
Series:	Advances in Radiation Oncology
Online Access:	http://www.sciencedirect.com/science/article/pii/S245210942200269X

_version_	1811173293433028608
author	Brandon Reber, BS Lisanne Van Dijk, PhD Brian Anderson, PhD Abdallah Sherif Radwan Mohamed, MD, PhD Clifton Fuller, MD, PhD Stephen Lai, MD, PhD Kristy Brock, PhD
author_facet	Brandon Reber, BS Lisanne Van Dijk, PhD Brian Anderson, PhD Abdallah Sherif Radwan Mohamed, MD, PhD Clifton Fuller, MD, PhD Stephen Lai, MD, PhD Kristy Brock, PhD
author_sort	Brandon Reber, BS
collection	DOAJ
description	Purpose: Deep-learning (DL) techniques have been successful in disease-prediction tasks and could improve the prediction of mandible osteoradionecrosis (ORN) resulting from head and neck cancer (HNC) radiation therapy. In this study, we retrospectively compared the performance of DL algorithms and traditional machine-learning (ML) techniques to predict mandible ORN binary outcome in an extensive cohort of patients with HNC. Methods and Materials: Patients who received HNC radiation therapy at the University of Texas MD Anderson Cancer Center from 2005 to 2015 were identified for the ML (n = 1259) and DL (n = 1236) studies. The subjects were followed for ORN development for at least 12 months, with 173 developing ORN and 1086 having no evidence of ORN. The ML models used dose-volume histogram parameters to predict ORN development. These models included logistic regression, random forest, support vector machine, and a random classifier reference. The DL models were based on ResNet, DenseNet, and autoencoder-based architectures. The DL models used each participant's dose cropped to the mandible. The effect of increasing the amount of available training data on the DL models’ prediction performance was evaluated by training the DL models using increasing ratios of the original training data. Results: The F1 score for the logistic regression model, the best-performing ML model, was 0.3. The best-performing ResNet, DenseNet, and autoencoder-based models had F1 scores of 0.07, 0.14, and 0.23, respectively, whereas the random classifier's F1 score was 0.17. No performance increase was apparent when we increased the amount of training data available for DL model training. Conclusions: The ML models had superior performance to their DL counterparts. The lack of improvement in DL performance with increased training data suggests that either more data are needed for appropriate DL model construction or that the image features used in DL models are not suitable for this task.
first_indexed	2024-04-10T17:45:33Z
format	Article
id	doaj.art-8de3d5b32ac343cca629c88b5409c8d8
institution	Directory Open Access Journal
issn	2452-1094
language	English
last_indexed	2024-04-10T17:45:33Z
publishDate	2023-07-01
publisher	Elsevier
record_format	Article
series	Advances in Radiation Oncology
spelling	doaj.art-8de3d5b32ac343cca629c88b5409c8d82023-02-03T05:00:38ZengElsevierAdvances in Radiation Oncology2452-10942023-07-0184101163Comparison of Machine-Learning and Deep-Learning Methods for the Prediction of Osteoradionecrosis Resulting From Head and Neck Cancer Radiation TherapyBrandon Reber, BS0Lisanne Van Dijk, PhD1Brian Anderson, PhD2Abdallah Sherif Radwan Mohamed, MD, PhD3Clifton Fuller, MD, PhD4Stephen Lai, MD, PhD5Kristy Brock, PhD6Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, Texas; Corresponding author: Brandon Reber.Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, Texas; University of Groningen, Groningen, NetherlandsDepartment of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, Texas; University of California, San Diego, San Diego, CaliforniaDepartment of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TexasDepartment of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TexasDepartment of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TexasDepartment of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TexasPurpose: Deep-learning (DL) techniques have been successful in disease-prediction tasks and could improve the prediction of mandible osteoradionecrosis (ORN) resulting from head and neck cancer (HNC) radiation therapy. In this study, we retrospectively compared the performance of DL algorithms and traditional machine-learning (ML) techniques to predict mandible ORN binary outcome in an extensive cohort of patients with HNC. Methods and Materials: Patients who received HNC radiation therapy at the University of Texas MD Anderson Cancer Center from 2005 to 2015 were identified for the ML (n = 1259) and DL (n = 1236) studies. The subjects were followed for ORN development for at least 12 months, with 173 developing ORN and 1086 having no evidence of ORN. The ML models used dose-volume histogram parameters to predict ORN development. These models included logistic regression, random forest, support vector machine, and a random classifier reference. The DL models were based on ResNet, DenseNet, and autoencoder-based architectures. The DL models used each participant's dose cropped to the mandible. The effect of increasing the amount of available training data on the DL models’ prediction performance was evaluated by training the DL models using increasing ratios of the original training data. Results: The F1 score for the logistic regression model, the best-performing ML model, was 0.3. The best-performing ResNet, DenseNet, and autoencoder-based models had F1 scores of 0.07, 0.14, and 0.23, respectively, whereas the random classifier's F1 score was 0.17. No performance increase was apparent when we increased the amount of training data available for DL model training. Conclusions: The ML models had superior performance to their DL counterparts. The lack of improvement in DL performance with increased training data suggests that either more data are needed for appropriate DL model construction or that the image features used in DL models are not suitable for this task.http://www.sciencedirect.com/science/article/pii/S245210942200269X
spellingShingle	Brandon Reber, BS Lisanne Van Dijk, PhD Brian Anderson, PhD Abdallah Sherif Radwan Mohamed, MD, PhD Clifton Fuller, MD, PhD Stephen Lai, MD, PhD Kristy Brock, PhD Comparison of Machine-Learning and Deep-Learning Methods for the Prediction of Osteoradionecrosis Resulting From Head and Neck Cancer Radiation Therapy Advances in Radiation Oncology
title	Comparison of Machine-Learning and Deep-Learning Methods for the Prediction of Osteoradionecrosis Resulting From Head and Neck Cancer Radiation Therapy
title_full	Comparison of Machine-Learning and Deep-Learning Methods for the Prediction of Osteoradionecrosis Resulting From Head and Neck Cancer Radiation Therapy
title_fullStr	Comparison of Machine-Learning and Deep-Learning Methods for the Prediction of Osteoradionecrosis Resulting From Head and Neck Cancer Radiation Therapy
title_full_unstemmed	Comparison of Machine-Learning and Deep-Learning Methods for the Prediction of Osteoradionecrosis Resulting From Head and Neck Cancer Radiation Therapy
title_short	Comparison of Machine-Learning and Deep-Learning Methods for the Prediction of Osteoradionecrosis Resulting From Head and Neck Cancer Radiation Therapy
title_sort	comparison of machine learning and deep learning methods for the prediction of osteoradionecrosis resulting from head and neck cancer radiation therapy
url	http://www.sciencedirect.com/science/article/pii/S245210942200269X
work_keys_str_mv	AT brandonreberbs comparisonofmachinelearninganddeeplearningmethodsforthepredictionofosteoradionecrosisresultingfromheadandneckcancerradiationtherapy AT lisannevandijkphd comparisonofmachinelearninganddeeplearningmethodsforthepredictionofosteoradionecrosisresultingfromheadandneckcancerradiationtherapy AT brianandersonphd comparisonofmachinelearninganddeeplearningmethodsforthepredictionofosteoradionecrosisresultingfromheadandneckcancerradiationtherapy AT abdallahsherifradwanmohamedmdphd comparisonofmachinelearninganddeeplearningmethodsforthepredictionofosteoradionecrosisresultingfromheadandneckcancerradiationtherapy AT cliftonfullermdphd comparisonofmachinelearninganddeeplearningmethodsforthepredictionofosteoradionecrosisresultingfromheadandneckcancerradiationtherapy AT stephenlaimdphd comparisonofmachinelearninganddeeplearningmethodsforthepredictionofosteoradionecrosisresultingfromheadandneckcancerradiationtherapy AT kristybrockphd comparisonofmachinelearninganddeeplearningmethodsforthepredictionofosteoradionecrosisresultingfromheadandneckcancerradiationtherapy

Comparison of Machine-Learning and Deep-Learning Methods for the Prediction of Osteoradionecrosis Resulting From Head and Neck Cancer Radiation Therapy

Similar Items