Risk factors affecting patients survival with colorectal cancer in Morocco: survival analysis using an interpretable machine learning approach

Abstract The aim of our study was to assess the overall survival rates for colorectal cancer at 3 years and to identify associated strong prognostic factors among patients in Morocco through an interpretable machine learning approach. This approach is based on a fully non-parametric survival random...

Full description

Bibliographic Details
Main Authors: Imad El Badisy, Zineb Ben Brahim, Mohamed Khalis, Soukaina Elansari, Youssef ElHitmi, Fouad Abbass, Nawfal Mellas, Karima EL Rhazi
Format: Article
Language:English
Published: Nature Portfolio 2024-02-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-024-51304-3
_version_ 1797275053580091392
author Imad El Badisy
Zineb Ben Brahim
Mohamed Khalis
Soukaina Elansari
Youssef ElHitmi
Fouad Abbass
Nawfal Mellas
Karima EL Rhazi
author_facet Imad El Badisy
Zineb Ben Brahim
Mohamed Khalis
Soukaina Elansari
Youssef ElHitmi
Fouad Abbass
Nawfal Mellas
Karima EL Rhazi
author_sort Imad El Badisy
collection DOAJ
description Abstract The aim of our study was to assess the overall survival rates for colorectal cancer at 3 years and to identify associated strong prognostic factors among patients in Morocco through an interpretable machine learning approach. This approach is based on a fully non-parametric survival random forest (RSF), incorporating variable importance and partial dependence effects. The data was povided from a retrospective study of 343 patients diagnosed and followed at Hassan II University Hospital. Covariate selection was performed using the variable importance based on permutation and partial dependence plots were displayed to explore in depth the relationship between the estimated partial effect of a given predictor and survival rates. The predictive performance was measured by two metrics, the Concordance Index (C-index) and the Brier Score (BS). Overall survival rates at 1, 2 and 3 years were, respectively, 87% (SE = 0.02; CI-95% 0.84–0.91), 77% (SE = 0.02; CI-95% 0.73–0.82) and 60% (SE = 0.03; CI-95% 0.54–0.66). In the Cox model after adjustment for all covariates, sex, tumor differentiation had no significant effect on prognosis, but rather tumor site had a significant effect. The variable importance obtained from RSF strengthens that surgery, stage, insurance, residency, and age were the most important prognostic factors. The discriminative capacity of the Cox PH and RSF was, respectively, 0.771 and 0.798 for the C-index while the accuracy of the Cox PH and RSF was, respectively, 0.257 and 0.207 for the BS. This shows that RSF had both better discriminative capacity and predictive accuracy. Our results show that patients who are older than 70, living in rural areas, without health insurance, at a distant stage and who have not had surgery constitute a subgroup of patients with poor prognosis.
first_indexed 2024-03-07T15:08:49Z
format Article
id doaj.art-18c0d4ebce9c46ceadd145cea36d0e65
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-03-07T15:08:49Z
publishDate 2024-02-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-18c0d4ebce9c46ceadd145cea36d0e652024-03-05T18:46:06ZengNature PortfolioScientific Reports2045-23222024-02-0114111310.1038/s41598-024-51304-3Risk factors affecting patients survival with colorectal cancer in Morocco: survival analysis using an interpretable machine learning approachImad El Badisy0Zineb Ben Brahim1Mohamed Khalis2Soukaina Elansari3Youssef ElHitmi4Fouad Abbass5Nawfal Mellas6Karima EL Rhazi7Mohammed VI Center for Research and InnovationDepartment of Oncology, University Hospital Hassan II, Sidi Mohamed Ben Abdellah UniversityMohammed VI Center for Research and InnovationDepartment of Oncology, University Hospital Hassan II, Sidi Mohamed Ben Abdellah UniversityDepartment of Oncology, University Hospital Hassan II, Sidi Mohamed Ben Abdellah UniversityLaboratory of Epidemiology and Research in Health Sciences, Department of Epidemiology and Public Health, Faculty of Medicine of Fez, Sidi Mohamed Ben Abdillah UniversityDepartment of Oncology, University Hospital Hassan II, Sidi Mohamed Ben Abdellah UniversityLaboratory of Epidemiology and Research in Health Sciences, Department of Epidemiology and Public Health, Faculty of Medicine of Fez, Sidi Mohamed Ben Abdillah UniversityAbstract The aim of our study was to assess the overall survival rates for colorectal cancer at 3 years and to identify associated strong prognostic factors among patients in Morocco through an interpretable machine learning approach. This approach is based on a fully non-parametric survival random forest (RSF), incorporating variable importance and partial dependence effects. The data was povided from a retrospective study of 343 patients diagnosed and followed at Hassan II University Hospital. Covariate selection was performed using the variable importance based on permutation and partial dependence plots were displayed to explore in depth the relationship between the estimated partial effect of a given predictor and survival rates. The predictive performance was measured by two metrics, the Concordance Index (C-index) and the Brier Score (BS). Overall survival rates at 1, 2 and 3 years were, respectively, 87% (SE = 0.02; CI-95% 0.84–0.91), 77% (SE = 0.02; CI-95% 0.73–0.82) and 60% (SE = 0.03; CI-95% 0.54–0.66). In the Cox model after adjustment for all covariates, sex, tumor differentiation had no significant effect on prognosis, but rather tumor site had a significant effect. The variable importance obtained from RSF strengthens that surgery, stage, insurance, residency, and age were the most important prognostic factors. The discriminative capacity of the Cox PH and RSF was, respectively, 0.771 and 0.798 for the C-index while the accuracy of the Cox PH and RSF was, respectively, 0.257 and 0.207 for the BS. This shows that RSF had both better discriminative capacity and predictive accuracy. Our results show that patients who are older than 70, living in rural areas, without health insurance, at a distant stage and who have not had surgery constitute a subgroup of patients with poor prognosis.https://doi.org/10.1038/s41598-024-51304-3
spellingShingle Imad El Badisy
Zineb Ben Brahim
Mohamed Khalis
Soukaina Elansari
Youssef ElHitmi
Fouad Abbass
Nawfal Mellas
Karima EL Rhazi
Risk factors affecting patients survival with colorectal cancer in Morocco: survival analysis using an interpretable machine learning approach
Scientific Reports
title Risk factors affecting patients survival with colorectal cancer in Morocco: survival analysis using an interpretable machine learning approach
title_full Risk factors affecting patients survival with colorectal cancer in Morocco: survival analysis using an interpretable machine learning approach
title_fullStr Risk factors affecting patients survival with colorectal cancer in Morocco: survival analysis using an interpretable machine learning approach
title_full_unstemmed Risk factors affecting patients survival with colorectal cancer in Morocco: survival analysis using an interpretable machine learning approach
title_short Risk factors affecting patients survival with colorectal cancer in Morocco: survival analysis using an interpretable machine learning approach
title_sort risk factors affecting patients survival with colorectal cancer in morocco survival analysis using an interpretable machine learning approach
url https://doi.org/10.1038/s41598-024-51304-3
work_keys_str_mv AT imadelbadisy riskfactorsaffectingpatientssurvivalwithcolorectalcancerinmoroccosurvivalanalysisusinganinterpretablemachinelearningapproach
AT zinebbenbrahim riskfactorsaffectingpatientssurvivalwithcolorectalcancerinmoroccosurvivalanalysisusinganinterpretablemachinelearningapproach
AT mohamedkhalis riskfactorsaffectingpatientssurvivalwithcolorectalcancerinmoroccosurvivalanalysisusinganinterpretablemachinelearningapproach
AT soukainaelansari riskfactorsaffectingpatientssurvivalwithcolorectalcancerinmoroccosurvivalanalysisusinganinterpretablemachinelearningapproach
AT youssefelhitmi riskfactorsaffectingpatientssurvivalwithcolorectalcancerinmoroccosurvivalanalysisusinganinterpretablemachinelearningapproach
AT fouadabbass riskfactorsaffectingpatientssurvivalwithcolorectalcancerinmoroccosurvivalanalysisusinganinterpretablemachinelearningapproach
AT nawfalmellas riskfactorsaffectingpatientssurvivalwithcolorectalcancerinmoroccosurvivalanalysisusinganinterpretablemachinelearningapproach
AT karimaelrhazi riskfactorsaffectingpatientssurvivalwithcolorectalcancerinmoroccosurvivalanalysisusinganinterpretablemachinelearningapproach