Predicting 10-year breast cancer mortality risk in the general female population in England: a model development and validation study

<p><strong>Background</strong> Identifying female individuals at highest risk of developing life-threatening breast cancers could inform novel stratified early detection and prevention strategies to reduce breast cancer mortality, rather than only considering cancer incidence. We a...

Full description

Bibliographic Details
Main Authors: Clift, AK, Collins, GS, Lord, S, Petrou, S, Dodwell, D, Brady, M, Hippisley-Cox, J
Format: Journal article
Language:English
Published: Elsevier 2023
_version_ 1797110585775620096
author Clift, AK
Collins, GS
Lord, S
Petrou, S
Dodwell, D
Brady, M
Hippisley-Cox, J
author_facet Clift, AK
Collins, GS
Lord, S
Petrou, S
Dodwell, D
Brady, M
Hippisley-Cox, J
author_sort Clift, AK
collection OXFORD
description <p><strong>Background</strong> Identifying female individuals at highest risk of developing life-threatening breast cancers could inform novel stratified early detection and prevention strategies to reduce breast cancer mortality, rather than only considering cancer incidence. We aimed to develop a prognostic model that accurately predicts the 10-year risk of breast cancer mortality in female individuals without breast cancer at baseline.</p><br> <p><strong>Methods</strong> In this model development and validation study, we used an open cohort study from the QResearch primary care database, which was linked to secondary care and national cancer and mortality registers in England, UK. The data extracted were from female individuals aged 20–90 years without previous breast cancer or ductal carcinoma in situ who entered the cohort between Jan 1, 2000, and Dec 31, 2020. The primary outcome was breast cancer-related death, which was assessed in the full dataset. Cox proportional hazards, competing risks regression, XGBoost, and neural network modelling approaches were used to predict the risk of breast cancer death within 10 years using routinely collected health-care data. Death due to causes other than breast cancer was the competing risk. Internal–external validation was used to evaluate prognostic model performance (using Harrell's C, calibration slope, and calibration in the large), performance heterogeneity, and transportability. Internal–external validation involved dataset partitioning by time period and geographical region. Decision curve analysis was used to assess clinical utility.</p><br> <p><strong>Findings</strong> We identified data for 11 626 969 female individuals, with 70 095 574 person-years of follow-up. There were 142 712 (1·2%) diagnoses of breast cancer, 24 043 (0·2%) breast cancer-related deaths, and 696 106 (6·0%) deaths from other causes. Meta-analysis pooled estimates of Harrell's C were highest for the competing risks model (0·932, 95% CI 0·917–0·946). The competing risks model was well calibrated overall (slope 1·011, 95% CI 0·978–1·044), and across different ethnic groups. Decision curve analysis suggested favourable clinical utility across all age groups. The XGBoost and neural network models had variable performance across age and ethnic groups.</p><br> <p><strong>Interpretation</strong> A model that predicts the combined risk of developing and then dying from breast cancer at the population level could inform stratified screening or chemoprevention strategies. Further evaluation of the competing risks model should comprise effect and health economic assessment of model-informed strategies.</p><br> <p><strong>Funding</strong> Cancer Research UK.</p>
first_indexed 2024-03-07T07:56:52Z
format Journal article
id oxford-uuid:030bf5c9-119a-4c7d-9109-2aebd0b1f483
institution University of Oxford
language English
last_indexed 2024-03-07T07:56:52Z
publishDate 2023
publisher Elsevier
record_format dspace
spelling oxford-uuid:030bf5c9-119a-4c7d-9109-2aebd0b1f4832023-09-04T08:43:19ZPredicting 10-year breast cancer mortality risk in the general female population in England: a model development and validation studyJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:030bf5c9-119a-4c7d-9109-2aebd0b1f483EnglishSymplectic ElementsElsevier2023Clift, AKCollins, GSLord, SPetrou, SDodwell, DBrady, MHippisley-Cox, J<p><strong>Background</strong> Identifying female individuals at highest risk of developing life-threatening breast cancers could inform novel stratified early detection and prevention strategies to reduce breast cancer mortality, rather than only considering cancer incidence. We aimed to develop a prognostic model that accurately predicts the 10-year risk of breast cancer mortality in female individuals without breast cancer at baseline.</p><br> <p><strong>Methods</strong> In this model development and validation study, we used an open cohort study from the QResearch primary care database, which was linked to secondary care and national cancer and mortality registers in England, UK. The data extracted were from female individuals aged 20–90 years without previous breast cancer or ductal carcinoma in situ who entered the cohort between Jan 1, 2000, and Dec 31, 2020. The primary outcome was breast cancer-related death, which was assessed in the full dataset. Cox proportional hazards, competing risks regression, XGBoost, and neural network modelling approaches were used to predict the risk of breast cancer death within 10 years using routinely collected health-care data. Death due to causes other than breast cancer was the competing risk. Internal–external validation was used to evaluate prognostic model performance (using Harrell's C, calibration slope, and calibration in the large), performance heterogeneity, and transportability. Internal–external validation involved dataset partitioning by time period and geographical region. Decision curve analysis was used to assess clinical utility.</p><br> <p><strong>Findings</strong> We identified data for 11 626 969 female individuals, with 70 095 574 person-years of follow-up. There were 142 712 (1·2%) diagnoses of breast cancer, 24 043 (0·2%) breast cancer-related deaths, and 696 106 (6·0%) deaths from other causes. Meta-analysis pooled estimates of Harrell's C were highest for the competing risks model (0·932, 95% CI 0·917–0·946). The competing risks model was well calibrated overall (slope 1·011, 95% CI 0·978–1·044), and across different ethnic groups. Decision curve analysis suggested favourable clinical utility across all age groups. The XGBoost and neural network models had variable performance across age and ethnic groups.</p><br> <p><strong>Interpretation</strong> A model that predicts the combined risk of developing and then dying from breast cancer at the population level could inform stratified screening or chemoprevention strategies. Further evaluation of the competing risks model should comprise effect and health economic assessment of model-informed strategies.</p><br> <p><strong>Funding</strong> Cancer Research UK.</p>
spellingShingle Clift, AK
Collins, GS
Lord, S
Petrou, S
Dodwell, D
Brady, M
Hippisley-Cox, J
Predicting 10-year breast cancer mortality risk in the general female population in England: a model development and validation study
title Predicting 10-year breast cancer mortality risk in the general female population in England: a model development and validation study
title_full Predicting 10-year breast cancer mortality risk in the general female population in England: a model development and validation study
title_fullStr Predicting 10-year breast cancer mortality risk in the general female population in England: a model development and validation study
title_full_unstemmed Predicting 10-year breast cancer mortality risk in the general female population in England: a model development and validation study
title_short Predicting 10-year breast cancer mortality risk in the general female population in England: a model development and validation study
title_sort predicting 10 year breast cancer mortality risk in the general female population in england a model development and validation study
work_keys_str_mv AT cliftak predicting10yearbreastcancermortalityriskinthegeneralfemalepopulationinenglandamodeldevelopmentandvalidationstudy
AT collinsgs predicting10yearbreastcancermortalityriskinthegeneralfemalepopulationinenglandamodeldevelopmentandvalidationstudy
AT lords predicting10yearbreastcancermortalityriskinthegeneralfemalepopulationinenglandamodeldevelopmentandvalidationstudy
AT petrous predicting10yearbreastcancermortalityriskinthegeneralfemalepopulationinenglandamodeldevelopmentandvalidationstudy
AT dodwelld predicting10yearbreastcancermortalityriskinthegeneralfemalepopulationinenglandamodeldevelopmentandvalidationstudy
AT bradym predicting10yearbreastcancermortalityriskinthegeneralfemalepopulationinenglandamodeldevelopmentandvalidationstudy
AT hippisleycoxj predicting10yearbreastcancermortalityriskinthegeneralfemalepopulationinenglandamodeldevelopmentandvalidationstudy