Comparing supervised and semi-supervised machine learning approaches in NTCP modeling to predict complications in head and neck cancer patients

Background and purpose: Head and neck cancer (HNC) patients treated with radiotherapy often suffer from radiation-induced toxicities. Normal Tissue Complication Probability (NTCP) modeling can be used to determine the probability to develop these toxicities based on patient, tumor, treatment and dos...

Full description

Bibliographic Details
Main Authors: I. Spiero, E. Schuit, O.B. Wijers, F.J.P. Hoebers, J.A. Langendijk, A.M. Leeuwenberg
Format: Article
Language:English
Published: Elsevier 2023-11-01
Series:Clinical and Translational Radiation Oncology
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405630823001027
_version_ 1797642536410415104
author I. Spiero
E. Schuit
O.B. Wijers
F.J.P. Hoebers
J.A. Langendijk
A.M. Leeuwenberg
author_facet I. Spiero
E. Schuit
O.B. Wijers
F.J.P. Hoebers
J.A. Langendijk
A.M. Leeuwenberg
author_sort I. Spiero
collection DOAJ
description Background and purpose: Head and neck cancer (HNC) patients treated with radiotherapy often suffer from radiation-induced toxicities. Normal Tissue Complication Probability (NTCP) modeling can be used to determine the probability to develop these toxicities based on patient, tumor, treatment and dose characteristics. Since the currently used NTCP models are developed using supervised methods that discard unlabeled patient data, we assessed whether the addition of unlabeled patient data by using semi-supervised modeling would gain predictive performance. Materials and methods: The semi-supervised method of self-training was compared to supervised regression methods with and without prior multiple imputation by chained equation (MICE). The models were developed for the most common toxicity outcomes in HNC patients, xerostomia (dry mouth) and dysphagia (difficulty swallowing), measured at six months after treatment, in a development cohort of 750 HNC patients. The models were externally validated in a validation cohort of 395 HNC patients. Model performance was assessed by discrimination and calibration. Results: MICE and self-training did not improve performance in terms of discrimination or calibration at external validation compared to current regression models. In addition, the relative performance of the different models did not change upon a decrease in the amount of (labeled) data available for model development. Models using ridge regression outperformed the logistic models for the dysphagia outcome. Conclusion: Since there was no apparent gain in the addition of unlabeled patient data by using the semi-supervised method of self-training or MICE, the supervised regression models would still be preferred in current NTCP modeling for HNC patients.
first_indexed 2024-03-11T14:02:25Z
format Article
id doaj.art-d66df0edb9e449aab8711281e300be43
institution Directory Open Access Journal
issn 2405-6308
language English
last_indexed 2024-03-11T14:02:25Z
publishDate 2023-11-01
publisher Elsevier
record_format Article
series Clinical and Translational Radiation Oncology
spelling doaj.art-d66df0edb9e449aab8711281e300be432023-11-02T04:13:38ZengElsevierClinical and Translational Radiation Oncology2405-63082023-11-0143100677Comparing supervised and semi-supervised machine learning approaches in NTCP modeling to predict complications in head and neck cancer patientsI. Spiero0E. Schuit1O.B. Wijers2F.J.P. Hoebers3J.A. Langendijk4A.M. Leeuwenberg5Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands; Corresponding author at: Str. 5.128, Universiteitsweg 100, 3508 GA Utrecht, The Netherlands.Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the NetherlandsRadiotherapeutic Institute Friesland, Leeuwarden, the NetherlandsDepartment of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, the NetherlandsDepartment of Radiation Oncology, University of Groningen, University Medical Center Groningen, Groningen, the NetherlandsJulius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the NetherlandsBackground and purpose: Head and neck cancer (HNC) patients treated with radiotherapy often suffer from radiation-induced toxicities. Normal Tissue Complication Probability (NTCP) modeling can be used to determine the probability to develop these toxicities based on patient, tumor, treatment and dose characteristics. Since the currently used NTCP models are developed using supervised methods that discard unlabeled patient data, we assessed whether the addition of unlabeled patient data by using semi-supervised modeling would gain predictive performance. Materials and methods: The semi-supervised method of self-training was compared to supervised regression methods with and without prior multiple imputation by chained equation (MICE). The models were developed for the most common toxicity outcomes in HNC patients, xerostomia (dry mouth) and dysphagia (difficulty swallowing), measured at six months after treatment, in a development cohort of 750 HNC patients. The models were externally validated in a validation cohort of 395 HNC patients. Model performance was assessed by discrimination and calibration. Results: MICE and self-training did not improve performance in terms of discrimination or calibration at external validation compared to current regression models. In addition, the relative performance of the different models did not change upon a decrease in the amount of (labeled) data available for model development. Models using ridge regression outperformed the logistic models for the dysphagia outcome. Conclusion: Since there was no apparent gain in the addition of unlabeled patient data by using the semi-supervised method of self-training or MICE, the supervised regression models would still be preferred in current NTCP modeling for HNC patients.http://www.sciencedirect.com/science/article/pii/S2405630823001027Head and neck cancerRadiation-induced toxicityNTCP modelingSemi-supervised learning
spellingShingle I. Spiero
E. Schuit
O.B. Wijers
F.J.P. Hoebers
J.A. Langendijk
A.M. Leeuwenberg
Comparing supervised and semi-supervised machine learning approaches in NTCP modeling to predict complications in head and neck cancer patients
Clinical and Translational Radiation Oncology
Head and neck cancer
Radiation-induced toxicity
NTCP modeling
Semi-supervised learning
title Comparing supervised and semi-supervised machine learning approaches in NTCP modeling to predict complications in head and neck cancer patients
title_full Comparing supervised and semi-supervised machine learning approaches in NTCP modeling to predict complications in head and neck cancer patients
title_fullStr Comparing supervised and semi-supervised machine learning approaches in NTCP modeling to predict complications in head and neck cancer patients
title_full_unstemmed Comparing supervised and semi-supervised machine learning approaches in NTCP modeling to predict complications in head and neck cancer patients
title_short Comparing supervised and semi-supervised machine learning approaches in NTCP modeling to predict complications in head and neck cancer patients
title_sort comparing supervised and semi supervised machine learning approaches in ntcp modeling to predict complications in head and neck cancer patients
topic Head and neck cancer
Radiation-induced toxicity
NTCP modeling
Semi-supervised learning
url http://www.sciencedirect.com/science/article/pii/S2405630823001027
work_keys_str_mv AT ispiero comparingsupervisedandsemisupervisedmachinelearningapproachesinntcpmodelingtopredictcomplicationsinheadandneckcancerpatients
AT eschuit comparingsupervisedandsemisupervisedmachinelearningapproachesinntcpmodelingtopredictcomplicationsinheadandneckcancerpatients
AT obwijers comparingsupervisedandsemisupervisedmachinelearningapproachesinntcpmodelingtopredictcomplicationsinheadandneckcancerpatients
AT fjphoebers comparingsupervisedandsemisupervisedmachinelearningapproachesinntcpmodelingtopredictcomplicationsinheadandneckcancerpatients
AT jalangendijk comparingsupervisedandsemisupervisedmachinelearningapproachesinntcpmodelingtopredictcomplicationsinheadandneckcancerpatients
AT amleeuwenberg comparingsupervisedandsemisupervisedmachinelearningapproachesinntcpmodelingtopredictcomplicationsinheadandneckcancerpatients