A comparison of machine learning methods for predicting recurrence and death after curative-intent radiotherapy for non-small cell lung cancer: Development and validation of multivariable clinical prediction models

Summary: Background: Surveillance is universally recommended for non-small cell lung cancer (NSCLC) patients treated with curative-intent radiotherapy. High-quality evidence to inform optimal surveillance strategies is lacking. Machine learning demonstrates promise in accurate outcome prediction fo...

Full description

Bibliographic Details
Main Authors: Sumeet Hindocha, Thomas G. Charlton, Kristofer Linton-Reid, Benjamin Hunter, Charleen Chan, Merina Ahmed, Emily J. Robinson, Matthew Orton, Shahreen Ahmad, Fiona McDonald, Imogen Locke, Danielle Power, Matthew Blackledge, Richard W. Lee, Eric O. Aboagye
Format: Article
Language:English
Published: Elsevier 2022-03-01
Series:EBioMedicine
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352396422000950
_version_ 1818910106961575936
author Sumeet Hindocha
Thomas G. Charlton
Kristofer Linton-Reid
Benjamin Hunter
Charleen Chan
Merina Ahmed
Emily J. Robinson
Matthew Orton
Shahreen Ahmad
Fiona McDonald
Imogen Locke
Danielle Power
Matthew Blackledge
Richard W. Lee
Eric O. Aboagye
author_facet Sumeet Hindocha
Thomas G. Charlton
Kristofer Linton-Reid
Benjamin Hunter
Charleen Chan
Merina Ahmed
Emily J. Robinson
Matthew Orton
Shahreen Ahmad
Fiona McDonald
Imogen Locke
Danielle Power
Matthew Blackledge
Richard W. Lee
Eric O. Aboagye
author_sort Sumeet Hindocha
collection DOAJ
description Summary: Background: Surveillance is universally recommended for non-small cell lung cancer (NSCLC) patients treated with curative-intent radiotherapy. High-quality evidence to inform optimal surveillance strategies is lacking. Machine learning demonstrates promise in accurate outcome prediction for a variety of health conditions. The purpose of this study was to utilise readily available patient, tumour, and treatment data to develop, validate and externally test machine learning models for predicting recurrence, recurrence-free survival (RFS) and overall survival (OS) at 2 years from treatment. Methods: A retrospective, multicentre study of patients receiving curative-intent radiotherapy for NSCLC was undertaken. A total of 657 patients from 5 hospitals were eligible for inclusion. Data pre-processing derived 34 features for predictive modelling. Combinations of 8 feature reduction methods and 10 machine learning classification algorithms were compared, producing risk-stratification models for predicting recurrence, RFS and OS. Models were compared with 10-fold cross validation and an external test set and benchmarked against TNM-stage and performance status. Youden Index was derived from validation set ROC curves to distinguish high and low risk groups and Kaplan-Meier analyses performed. Findings: Median follow-up time was 852 days. Parameters were well matched across training-validation and external test sets: Mean age was 73 and 71 respectively, and recurrence, RFS and OS rates at 2 years were 43% vs 34%, 54% vs 47% and 54% vs 47% respectively. The respective validation and test set AUCs were as follows: 1) RFS: 0·682 (0·575–0·788) and 0·681 (0·597–0·766), 2) Recurrence: 0·687 (0·582–0·793) and 0·722 (0·635–0·81), and 3) OS: 0·759 (0·663–0·855) and 0·717 (0·634–0·8). Our models were superior to TNM stage and performance status in predicting recurrence and OS. Interpretation: This robust and ready to use machine learning method, validated and externally tested, sets the stage for future clinical trials entailing quantitative personalised risk-stratification and surveillance following curative-intent radiotherapy for NSCLC. Funding: A full list of funding bodies that contributed to this study can be found in the Acknowledgements section.
first_indexed 2024-12-19T22:37:32Z
format Article
id doaj.art-338d2d91deec420ba0ba2951aac1baea
institution Directory Open Access Journal
issn 2352-3964
language English
last_indexed 2024-12-19T22:37:32Z
publishDate 2022-03-01
publisher Elsevier
record_format Article
series EBioMedicine
spelling doaj.art-338d2d91deec420ba0ba2951aac1baea2022-12-21T20:03:09ZengElsevierEBioMedicine2352-39642022-03-0177103911A comparison of machine learning methods for predicting recurrence and death after curative-intent radiotherapy for non-small cell lung cancer: Development and validation of multivariable clinical prediction modelsSumeet Hindocha0Thomas G. Charlton1Kristofer Linton-Reid2Benjamin Hunter3Charleen Chan4Merina Ahmed5Emily J. Robinson6Matthew Orton7Shahreen Ahmad8Fiona McDonald9Imogen Locke10Danielle Power11Matthew Blackledge12Richard W. Lee13Eric O. Aboagye14Lung Unit, The Royal Marsden NHS Foundation Trust, Fulham Road, London SW36JJ, UK; AI for Healthcare Centre for Doctoral Training, Imperial College London, Exhibition Road, London SW7 2BX, UK; Department of Clinical Oncology, Institute of Cancer Research NIHR Biomedical Research Centre, London, UK; Cancer Imaging Centre, Department of Surgery and Cancer, Imperial College London, Du Cane Road, London W12 0NN, UK; Early Diagnosis and Detection Centre, National Institute for Health Research (NIHR) Biomedical Research Centre at The Royal Marsden NHS Foundation Trust and the Institute of Cancer Research, LondonGuy's Cancer Centre, Guy's and St Thomas’ NHS Foundation Trust, Great Maze Pond, London SE19RT UKCancer Imaging Centre, Department of Surgery and Cancer, Imperial College London, Du Cane Road, London W12 0NN, UKLung Unit, The Royal Marsden NHS Foundation Trust, Fulham Road, London SW36JJ, UK; Department of Clinical Oncology, Institute of Cancer Research NIHR Biomedical Research Centre, London, UK; Cancer Imaging Centre, Department of Surgery and Cancer, Imperial College London, Du Cane Road, London W12 0NN, UK; Early Diagnosis and Detection Centre, National Institute for Health Research (NIHR) Biomedical Research Centre at The Royal Marsden NHS Foundation Trust and the Institute of Cancer Research, LondonDepartment of Clinical Oncology, Institute of Cancer Research NIHR Biomedical Research Centre, London, UKLung Unit, The Royal Marsden NHS Foundation Trust, Downs Road, Sutton SM25PT, UKClinical Trials Unit, Royal Marsden NHS Foundation Trust, Downs Road, Sutton SM25PT, UKArtificial Intelligence Imaging Hub, Royal Marsden NHS Foundation Trust, Downs Road, Sutton SM25PT, UKGuy's Cancer Centre, Guy's and St Thomas’ NHS Foundation Trust, Great Maze Pond, London SE19RT UKLung Unit, The Royal Marsden NHS Foundation Trust, Fulham Road, London SW36JJ, UK; Department of Clinical Oncology, Institute of Cancer Research NIHR Biomedical Research Centre, London, UKLung Unit, The Royal Marsden NHS Foundation Trust, Downs Road, Sutton SM25PT, UKDepartment of Clinical Oncology, Charing Cross Hospital, Fulham Palace Road, London W6 8RF, UKRadiotherapy and Imaging, Institute of Cancer Research, 123 Old Brompton Road, London SW7 3RP, UKLung Unit, The Royal Marsden NHS Foundation Trust, Fulham Road, London SW36JJ, UK; Early Diagnosis and Detection Centre, National Institute for Health Research (NIHR) Biomedical Research Centre at The Royal Marsden NHS Foundation Trust and the Institute of Cancer Research, London; National Heart and Lung Institute, Imperial College, London, UK; Corresponding author at: Lung Unit, The Royal Marsden NHS Foundation Trust, Fulham Road, London SW36JJ, UK.Department of Clinical Oncology, Institute of Cancer Research NIHR Biomedical Research Centre, London, UK; Corresponding author at: Lung Unit, The Royal Marsden NHS Foundation Trust, Fulham Road, London SW36JJ, UK.Summary: Background: Surveillance is universally recommended for non-small cell lung cancer (NSCLC) patients treated with curative-intent radiotherapy. High-quality evidence to inform optimal surveillance strategies is lacking. Machine learning demonstrates promise in accurate outcome prediction for a variety of health conditions. The purpose of this study was to utilise readily available patient, tumour, and treatment data to develop, validate and externally test machine learning models for predicting recurrence, recurrence-free survival (RFS) and overall survival (OS) at 2 years from treatment. Methods: A retrospective, multicentre study of patients receiving curative-intent radiotherapy for NSCLC was undertaken. A total of 657 patients from 5 hospitals were eligible for inclusion. Data pre-processing derived 34 features for predictive modelling. Combinations of 8 feature reduction methods and 10 machine learning classification algorithms were compared, producing risk-stratification models for predicting recurrence, RFS and OS. Models were compared with 10-fold cross validation and an external test set and benchmarked against TNM-stage and performance status. Youden Index was derived from validation set ROC curves to distinguish high and low risk groups and Kaplan-Meier analyses performed. Findings: Median follow-up time was 852 days. Parameters were well matched across training-validation and external test sets: Mean age was 73 and 71 respectively, and recurrence, RFS and OS rates at 2 years were 43% vs 34%, 54% vs 47% and 54% vs 47% respectively. The respective validation and test set AUCs were as follows: 1) RFS: 0·682 (0·575–0·788) and 0·681 (0·597–0·766), 2) Recurrence: 0·687 (0·582–0·793) and 0·722 (0·635–0·81), and 3) OS: 0·759 (0·663–0·855) and 0·717 (0·634–0·8). Our models were superior to TNM stage and performance status in predicting recurrence and OS. Interpretation: This robust and ready to use machine learning method, validated and externally tested, sets the stage for future clinical trials entailing quantitative personalised risk-stratification and surveillance following curative-intent radiotherapy for NSCLC. Funding: A full list of funding bodies that contributed to this study can be found in the Acknowledgements section.http://www.sciencedirect.com/science/article/pii/S2352396422000950Non-small cell lung cancerRadiotherapyMachine learningRecurrenceOverall survivalPrediction
spellingShingle Sumeet Hindocha
Thomas G. Charlton
Kristofer Linton-Reid
Benjamin Hunter
Charleen Chan
Merina Ahmed
Emily J. Robinson
Matthew Orton
Shahreen Ahmad
Fiona McDonald
Imogen Locke
Danielle Power
Matthew Blackledge
Richard W. Lee
Eric O. Aboagye
A comparison of machine learning methods for predicting recurrence and death after curative-intent radiotherapy for non-small cell lung cancer: Development and validation of multivariable clinical prediction models
EBioMedicine
Non-small cell lung cancer
Radiotherapy
Machine learning
Recurrence
Overall survival
Prediction
title A comparison of machine learning methods for predicting recurrence and death after curative-intent radiotherapy for non-small cell lung cancer: Development and validation of multivariable clinical prediction models
title_full A comparison of machine learning methods for predicting recurrence and death after curative-intent radiotherapy for non-small cell lung cancer: Development and validation of multivariable clinical prediction models
title_fullStr A comparison of machine learning methods for predicting recurrence and death after curative-intent radiotherapy for non-small cell lung cancer: Development and validation of multivariable clinical prediction models
title_full_unstemmed A comparison of machine learning methods for predicting recurrence and death after curative-intent radiotherapy for non-small cell lung cancer: Development and validation of multivariable clinical prediction models
title_short A comparison of machine learning methods for predicting recurrence and death after curative-intent radiotherapy for non-small cell lung cancer: Development and validation of multivariable clinical prediction models
title_sort comparison of machine learning methods for predicting recurrence and death after curative intent radiotherapy for non small cell lung cancer development and validation of multivariable clinical prediction models
topic Non-small cell lung cancer
Radiotherapy
Machine learning
Recurrence
Overall survival
Prediction
url http://www.sciencedirect.com/science/article/pii/S2352396422000950
work_keys_str_mv AT sumeethindocha acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT thomasgcharlton acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT kristoferlintonreid acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT benjaminhunter acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT charleenchan acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT merinaahmed acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT emilyjrobinson acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT mattheworton acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT shahreenahmad acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT fionamcdonald acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT imogenlocke acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT daniellepower acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT matthewblackledge acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT richardwlee acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT ericoaboagye acomparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT sumeethindocha comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT thomasgcharlton comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT kristoferlintonreid comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT benjaminhunter comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT charleenchan comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT merinaahmed comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT emilyjrobinson comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT mattheworton comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT shahreenahmad comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT fionamcdonald comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT imogenlocke comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT daniellepower comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT matthewblackledge comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT richardwlee comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels
AT ericoaboagye comparisonofmachinelearningmethodsforpredictingrecurrenceanddeathaftercurativeintentradiotherapyfornonsmallcelllungcancerdevelopmentandvalidationofmultivariableclinicalpredictionmodels