Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study
Lung cancer can be challenging to diagnose in the early stages, where treatment options are optimal. We aimed to develop 1-year prediction models for the individual risk of incident lung cancer for all individuals aged 40 or above living in Denmark on 1 January 2017. The study was conducted using po...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-01-01
|
Series: | Cancers |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-6694/15/2/487 |
_version_ | 1797444811124375552 |
---|---|
author | Katrine H. Rubin Peter F. Haastrup Anne Nicolaisen Sören Möller Sonja Wehberg Sanne Rasmussen Kirubakaran Balasubramaniam Jens Søndergaard Dorte E. Jarbøl |
author_facet | Katrine H. Rubin Peter F. Haastrup Anne Nicolaisen Sören Möller Sonja Wehberg Sanne Rasmussen Kirubakaran Balasubramaniam Jens Søndergaard Dorte E. Jarbøl |
author_sort | Katrine H. Rubin |
collection | DOAJ |
description | Lung cancer can be challenging to diagnose in the early stages, where treatment options are optimal. We aimed to develop 1-year prediction models for the individual risk of incident lung cancer for all individuals aged 40 or above living in Denmark on 1 January 2017. The study was conducted using population-based registers on health and sociodemographics from 2007–2016. We applied backward selection on all variables by logistic regression to develop a risk model for lung cancer and applied the models to the validation cohort, calculated receiver-operating characteristic curves, and estimated the corresponding areas under the curve (AUC). In the populations without and with previously confirmed cancer, 4274/2,826,249 (0.15%) and 482/172,513 (0.3%) individuals received a lung cancer diagnosis in 2017, respectively. For both populations, older age was a relevant predictor, and the most complex models, containing variables related to diagnoses, medication, general practitioner, and specialist contacts, as well as baseline sociodemographic characteristics, had the highest AUC. These models achieved a positive predictive value (PPV) of 0.0127 (0.006) and a negative predictive value (NPV) of 0.989 (0.997) with a 1% cut-off in the population without (with) previous cancer. This corresponds to 1.2% of the screened population experiencing a positive prediction, of which 1.3% would be incident with lung cancer. We have developed and tested a prediction model with a reasonable potential to support clinicians and healthcare planners in identifying patients at risk of lung cancer. |
first_indexed | 2024-03-09T13:17:56Z |
format | Article |
id | doaj.art-e7631c0582c8417b80e2769185f3a35b |
institution | Directory Open Access Journal |
issn | 2072-6694 |
language | English |
last_indexed | 2024-03-09T13:17:56Z |
publishDate | 2023-01-01 |
publisher | MDPI AG |
record_format | Article |
series | Cancers |
spelling | doaj.art-e7631c0582c8417b80e2769185f3a35b2023-11-30T21:34:35ZengMDPI AGCancers2072-66942023-01-0115248710.3390/cancers15020487Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based StudyKatrine H. Rubin0Peter F. Haastrup1Anne Nicolaisen2Sören Möller3Sonja Wehberg4Sanne Rasmussen5Kirubakaran Balasubramaniam6Jens Søndergaard7Dorte E. Jarbøl8OPEN—Open Patient Data Explorative Network, Odense University Hospital, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkOPEN—Open Patient Data Explorative Network, Odense University Hospital, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkResearch Unit of General Practice, Department of Public Health, University of Southern Denmark, 5000 Odense, DenmarkLung cancer can be challenging to diagnose in the early stages, where treatment options are optimal. We aimed to develop 1-year prediction models for the individual risk of incident lung cancer for all individuals aged 40 or above living in Denmark on 1 January 2017. The study was conducted using population-based registers on health and sociodemographics from 2007–2016. We applied backward selection on all variables by logistic regression to develop a risk model for lung cancer and applied the models to the validation cohort, calculated receiver-operating characteristic curves, and estimated the corresponding areas under the curve (AUC). In the populations without and with previously confirmed cancer, 4274/2,826,249 (0.15%) and 482/172,513 (0.3%) individuals received a lung cancer diagnosis in 2017, respectively. For both populations, older age was a relevant predictor, and the most complex models, containing variables related to diagnoses, medication, general practitioner, and specialist contacts, as well as baseline sociodemographic characteristics, had the highest AUC. These models achieved a positive predictive value (PPV) of 0.0127 (0.006) and a negative predictive value (NPV) of 0.989 (0.997) with a 1% cut-off in the population without (with) previous cancer. This corresponds to 1.2% of the screened population experiencing a positive prediction, of which 1.3% would be incident with lung cancer. We have developed and tested a prediction model with a reasonable potential to support clinicians and healthcare planners in identifying patients at risk of lung cancer.https://www.mdpi.com/2072-6694/15/2/487cancer diagnosisautomated risk calculationprediction modelsregister datalung cancersocioeconomic status |
spellingShingle | Katrine H. Rubin Peter F. Haastrup Anne Nicolaisen Sören Möller Sonja Wehberg Sanne Rasmussen Kirubakaran Balasubramaniam Jens Søndergaard Dorte E. Jarbøl Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study Cancers cancer diagnosis automated risk calculation prediction models register data lung cancer socioeconomic status |
title | Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study |
title_full | Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study |
title_fullStr | Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study |
title_full_unstemmed | Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study |
title_short | Developing and Validating a Lung Cancer Risk Prediction Model: A Nationwide Population-Based Study |
title_sort | developing and validating a lung cancer risk prediction model a nationwide population based study |
topic | cancer diagnosis automated risk calculation prediction models register data lung cancer socioeconomic status |
url | https://www.mdpi.com/2072-6694/15/2/487 |
work_keys_str_mv | AT katrinehrubin developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy AT peterfhaastrup developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy AT annenicolaisen developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy AT sorenmoller developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy AT sonjawehberg developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy AT sannerasmussen developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy AT kirubakaranbalasubramaniam developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy AT jenssøndergaard developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy AT dorteejarbøl developingandvalidatingalungcancerriskpredictionmodelanationwidepopulationbasedstudy |