Lung cancer prediction using machine learning on data from a symptom e-questionnaire for never smokers, formers smokers and current smokers.

<h4>Purpose</h4>The aim of the present study was to investigate the predictive ability for lung cancer of symptoms reported in an adaptive e-questionnaire, separately for never smokers, former smokers, and current smokers.<h4>Patients and methods</h4>Consecutive patients refe...

Full description

Bibliographic Details
Main Authors: Elinor Nemlander, Andreas Rosenblad, Eliya Abedi, Simon Ekman, Jan Hasselström, Lars E Eriksson, Axel C Carlsson
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2022-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0276703
_version_ 1811326215928152064
author Elinor Nemlander
Andreas Rosenblad
Eliya Abedi
Simon Ekman
Jan Hasselström
Lars E Eriksson
Axel C Carlsson
author_facet Elinor Nemlander
Andreas Rosenblad
Eliya Abedi
Simon Ekman
Jan Hasselström
Lars E Eriksson
Axel C Carlsson
author_sort Elinor Nemlander
collection DOAJ
description <h4>Purpose</h4>The aim of the present study was to investigate the predictive ability for lung cancer of symptoms reported in an adaptive e-questionnaire, separately for never smokers, former smokers, and current smokers.<h4>Patients and methods</h4>Consecutive patients referred for suspected lung cancer were recruited between September 2014 and November 2015 from the lung clinic at the Karolinska University Hospital, Stockholm, Sweden. A total of 504 patients were later diagnosed with lung cancer (n = 310) or no cancer (n = 194). All participants answered an adaptive e-questionnaire with a maximum of 342 items, covering background variables and symptoms/sensations suspected to be associated with lung cancer. Stochastic gradient boosting, stratified on smoking status, was used to train and test a model for predicting the presence of lung cancer.<h4>Results</h4>Among never smokers, 17 predictors contributed to predicting lung cancer with 82% of the patients being correctly classified, compared with 26 predictors with an accuracy of 77% among current smokers and 36 predictors with an accuracy of 63% among former smokers. Age, sex, and education level were the most important predictors in all models.<h4>Conclusion</h4>Methods or tools to assess the likelihood of lung cancer based on smoking status and to prioritize investigative and treatment measures among all patients seeking care with diffuse symptoms are much needed. Our study presents risk assessment models for patients with different smoking status that may be developed into clinical risk assessment tools that can help clinicians in assessing a patient's risk of having lung cancer.
first_indexed 2024-04-13T14:46:14Z
format Article
id doaj.art-9654b4dac27441989483bd9b9f97bd61
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-13T14:46:14Z
publishDate 2022-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-9654b4dac27441989483bd9b9f97bd612022-12-22T02:42:45ZengPublic Library of Science (PLoS)PLoS ONE1932-62032022-01-011710e027670310.1371/journal.pone.0276703Lung cancer prediction using machine learning on data from a symptom e-questionnaire for never smokers, formers smokers and current smokers.Elinor NemlanderAndreas RosenbladEliya AbediSimon EkmanJan HasselströmLars E ErikssonAxel C Carlsson<h4>Purpose</h4>The aim of the present study was to investigate the predictive ability for lung cancer of symptoms reported in an adaptive e-questionnaire, separately for never smokers, former smokers, and current smokers.<h4>Patients and methods</h4>Consecutive patients referred for suspected lung cancer were recruited between September 2014 and November 2015 from the lung clinic at the Karolinska University Hospital, Stockholm, Sweden. A total of 504 patients were later diagnosed with lung cancer (n = 310) or no cancer (n = 194). All participants answered an adaptive e-questionnaire with a maximum of 342 items, covering background variables and symptoms/sensations suspected to be associated with lung cancer. Stochastic gradient boosting, stratified on smoking status, was used to train and test a model for predicting the presence of lung cancer.<h4>Results</h4>Among never smokers, 17 predictors contributed to predicting lung cancer with 82% of the patients being correctly classified, compared with 26 predictors with an accuracy of 77% among current smokers and 36 predictors with an accuracy of 63% among former smokers. Age, sex, and education level were the most important predictors in all models.<h4>Conclusion</h4>Methods or tools to assess the likelihood of lung cancer based on smoking status and to prioritize investigative and treatment measures among all patients seeking care with diffuse symptoms are much needed. Our study presents risk assessment models for patients with different smoking status that may be developed into clinical risk assessment tools that can help clinicians in assessing a patient's risk of having lung cancer.https://doi.org/10.1371/journal.pone.0276703
spellingShingle Elinor Nemlander
Andreas Rosenblad
Eliya Abedi
Simon Ekman
Jan Hasselström
Lars E Eriksson
Axel C Carlsson
Lung cancer prediction using machine learning on data from a symptom e-questionnaire for never smokers, formers smokers and current smokers.
PLoS ONE
title Lung cancer prediction using machine learning on data from a symptom e-questionnaire for never smokers, formers smokers and current smokers.
title_full Lung cancer prediction using machine learning on data from a symptom e-questionnaire for never smokers, formers smokers and current smokers.
title_fullStr Lung cancer prediction using machine learning on data from a symptom e-questionnaire for never smokers, formers smokers and current smokers.
title_full_unstemmed Lung cancer prediction using machine learning on data from a symptom e-questionnaire for never smokers, formers smokers and current smokers.
title_short Lung cancer prediction using machine learning on data from a symptom e-questionnaire for never smokers, formers smokers and current smokers.
title_sort lung cancer prediction using machine learning on data from a symptom e questionnaire for never smokers formers smokers and current smokers
url https://doi.org/10.1371/journal.pone.0276703
work_keys_str_mv AT elinornemlander lungcancerpredictionusingmachinelearningondatafromasymptomequestionnaireforneversmokersformerssmokersandcurrentsmokers
AT andreasrosenblad lungcancerpredictionusingmachinelearningondatafromasymptomequestionnaireforneversmokersformerssmokersandcurrentsmokers
AT eliyaabedi lungcancerpredictionusingmachinelearningondatafromasymptomequestionnaireforneversmokersformerssmokersandcurrentsmokers
AT simonekman lungcancerpredictionusingmachinelearningondatafromasymptomequestionnaireforneversmokersformerssmokersandcurrentsmokers
AT janhasselstrom lungcancerpredictionusingmachinelearningondatafromasymptomequestionnaireforneversmokersformerssmokersandcurrentsmokers
AT larseeriksson lungcancerpredictionusingmachinelearningondatafromasymptomequestionnaireforneversmokersformerssmokersandcurrentsmokers
AT axelccarlsson lungcancerpredictionusingmachinelearningondatafromasymptomequestionnaireforneversmokersformerssmokersandcurrentsmokers