Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study

Lung cancer can be challenging to diagnose in the early stages, where treatment options are optimal. We aimed to develop 1-year prediction models for the individual risk of incident lung cancer for all individuals aged 40 or above living in Denmark on 1 January 2017. The study was conducted using po...

Full description

Bibliographic Details
Main Authors: Katrine H. Rubin, Peter F. Haastrup, Anne Nicolaisen, Sören Möller, Sonja Wehberg, Sanne Rasmussen, Kirubakaran Balasubramaniam, Jens Søndergaard, Dorte E. Jarbøl
Format: Article
Language:English
Published: MDPI AG 2023-01-01
Series:Cancers
Subjects:
Online Access:https://www.mdpi.com/2072-6694/15/2/487
_version_ 1797444811124375552
author Katrine H. Rubin
Peter F. Haastrup
Anne Nicolaisen
Sören Möller
Sonja Wehberg
Sanne Rasmussen
Kirubakaran Balasubramaniam
Jens Søndergaard
Dorte E. Jarbøl
author_facet Katrine H. Rubin
Peter F. Haastrup
Anne Nicolaisen
Sören Möller
Sonja Wehberg
Sanne Rasmussen
Kirubakaran Balasubramaniam
Jens Søndergaard
Dorte E. Jarbøl
author_sort Katrine H. Rubin
collection DOAJ
description Lung cancer can be challenging to diagnose in the early stages, where treatment options are optimal. We aimed to develop 1-year prediction models for the individual risk of incident lung cancer for all individuals aged 40 or above living in Denmark on 1 January 2017. The study was conducted using population-based registers on health and sociodemographics from 2007–2016. We applied backward selection on all variables by logistic regression to develop a risk model for lung cancer and applied the models to the validation cohort, calculated receiver-operating characteristic curves, and estimated the corresponding areas under the curve (AUC). In the populations without and with previously confirmed cancer, 4274/2,826,249 (0.15%) and 482/172,513 (0.3%) individuals received a lung cancer diagnosis in 2017, respectively. For both populations, older age was a relevant predictor, and the most complex models, containing variables related to diagnoses, medication, general practitioner, and specialist contacts, as well as baseline sociodemographic characteristics, had the highest AUC. These models achieved a positive predictive value (PPV) of 0.0127 (0.006) and a negative predictive value (NPV) of 0.989 (0.997) with a 1% cut-off in the population without (with) previous cancer. This corresponds to 1.2% of the screened population experiencing a positive prediction, of which 1.3% would be incident with lung cancer. We have developed and tested a prediction model with a reasonable potential to support clinicians and healthcare planners in identifying patients at risk of lung cancer.
first_indexed 2024-03-09T13:17:56Z
format Article
id doaj.art-e7631c0582c8417b80e2769185f3a35b
institution Directory Open Access Journal
issn 2072-6694
language English
last_indexed 2024-03-09T13:17:56Z
publishDate 2023-01-01
publisher MDPI AG
record_format Article
series Cancers
spelling doaj.art-e7631c0582c8417b80e2769185f3a35b2023-11-30T21:34:35ZengMDPI AGCancers2072-66942023-01-0115248710.3390/cancers15020487Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based StudyKatrine H. Rubin0Peter F. Haastrup1Anne Nicolaisen2Sören Möller3Sonja Wehberg4Sanne Rasmussen5Kirubakaran Balasubramaniam6Jens Søndergaard7Dorte E. Jarbøl8OPEN—Open Patient Data Explorative Network, Odense University Hospital, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkOPEN—Open Patient Data Explorative Network, Odense University Hospital, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkLung cancer can be challenging to diagnose in the early stages, where treatment options are optimal. We aimed to develop 1-year prediction models for the individual risk of incident lung cancer for all individuals aged 40 or above living in Denmark on 1 January 2017. The study was conducted using population-based registers on health and sociodemographics from 2007–2016. We applied backward selection on all variables by logistic regression to develop a risk model for lung cancer and applied the models to the validation cohort, calculated receiver-operating characteristic curves, and estimated the corresponding areas under the curve (AUC). In the populations without and with previously confirmed cancer, 4274/2,826,249 (0.15%) and 482/172,513 (0.3%) individuals received a lung cancer diagnosis in 2017, respectively. For both populations, older age was a relevant predictor, and the most complex models, containing variables related to diagnoses, medication, general practitioner, and specialist contacts, as well as baseline sociodemographic characteristics, had the highest AUC. These models achieved a positive predictive value (PPV) of 0.0127 (0.006) and a negative predictive value (NPV) of 0.989 (0.997) with a 1% cut-off in the population without (with) previous cancer. This corresponds to 1.2% of the screened population experiencing a positive prediction, of which 1.3% would be incident with lung cancer. We have developed and tested a prediction model with a reasonable potential to support clinicians and healthcare planners in identifying patients at risk of lung cancer.https://www.mdpi.com/2072-6694/15/2/487cancer diagnosisautomated risk calculationprediction modelsregister datalung cancersocioeconomic status
spellingShingle Katrine H. Rubin
Peter F. Haastrup
Anne Nicolaisen
Sören Möller
Sonja Wehberg
Sanne Rasmussen
Kirubakaran Balasubramaniam
Jens Søndergaard
Dorte E. Jarbøl
Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study
Cancers
cancer diagnosis
automated risk calculation
prediction models
register data
lung cancer
socioeconomic status
title Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study
title_full Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study
title_fullStr Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study
title_full_unstemmed Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study
title_short Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study
title_sort developing and validating a lung cancer risk prediction model a nationwide population based study
topic cancer diagnosis
automated risk calculation
prediction models
register data
lung cancer
socioeconomic status
url https://www.mdpi.com/2072-6694/15/2/487
work_keys_str_mv AT katrinehrubin developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy
AT peterfhaastrup developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy
AT annenicolaisen developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy
AT sorenmoller developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy
AT sonjawehberg developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy
AT sannerasmussen developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy
AT kirubakaranbalasubramaniam developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy
AT jenssøndergaard developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy
AT dorteejarbøl developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy