Comparing the predictive performance of a decision tree with logistic regression for oral cavity cancer mortality: A retrospective study

Background: A decision tree is a popular predictive modeling technique used to classify observations based on their properties and can also be used for numeric or categorical predictions. A logistic regression model estimates the relationship between a dependent categorical variable and one or more...

Full description

Bibliographic Details
Main Authors: K Sevvanthi, Sachit Ganapathy, Prasanth Penumadu, K T Harichandrakumar
Format: Article
Language:English
Published: Wolters Kluwer Medknow Publications 2023-01-01
Series:Cancer Research, Statistics, and Treatment
Subjects:
Online Access:http://www.crstonline.com/article.asp?issn=2590-3233;year=2023;volume=6;issue=1;spage=103;epage=110;aulast=Sevvanthi
_version_ 1797824463615557632
author K Sevvanthi
Sachit Ganapathy
Prasanth Penumadu
K T Harichandrakumar
author_facet K Sevvanthi
Sachit Ganapathy
Prasanth Penumadu
K T Harichandrakumar
author_sort K Sevvanthi
collection DOAJ
description Background: A decision tree is a popular predictive modeling technique used to classify observations based on their properties and can also be used for numeric or categorical predictions. A logistic regression model estimates the relationship between a dependent categorical variable and one or more independent variables. Objective: To compare the performances of logistic regression and decision tree models for predicting mortality in patients with oral cavity cancer. Materials and Methods: This was a retrospective study on de-identified records of patients with oral cavity cancer who received treatment at Jawaharlal Institute of Post Graduate Medical Education and Research, a tertiary healthcare hospital in Puducherry, South India, from 2011 to 2017. The models were built by incorporating potential predictive variables such as stage of the cancer, tumor and node categories, and margin status. The performances of the classification models were compared using sensitivity, specificity, predictive accuracy, and by plotting the area under the receiver operating characteristic (ROC) curves. Results: Among the 427 patients who received treatment for oral cavity cancer between 2011 and 2017, we included 275 who had undergone surgery and whose histopathology details were available. There were 180 (65.5%) male patients. The median age was 55 years (range, 26-87). The tumor stage and margin status of the cancer were found to be significant predictors of mortality in a multiple logistic regression model. The predictive accuracies and the areas under the ROC curves of logistic regression and decision tree models were found to be similar. Conclusion: A decision tree method along with other machine learning algorithms could be an innovative alternative to statistical models for the prediction of outcomes when the assumptions of these models fail. The application of these machine learning algorithms can help risk stratify patients with oral cavity cancer and thus aid in their treatment planning.
first_indexed 2024-03-13T10:39:21Z
format Article
id doaj.art-c48c3434227b437999120a5bd88cf6c6
institution Directory Open Access Journal
issn 2590-3233
2590-3225
language English
last_indexed 2024-03-13T10:39:21Z
publishDate 2023-01-01
publisher Wolters Kluwer Medknow Publications
record_format Article
series Cancer Research, Statistics, and Treatment
spelling doaj.art-c48c3434227b437999120a5bd88cf6c62023-05-18T05:04:23ZengWolters Kluwer Medknow PublicationsCancer Research, Statistics, and Treatment2590-32332590-32252023-01-016110311010.4103/crst.crst_234_22Comparing the predictive performance of a decision tree with logistic regression for oral cavity cancer mortality: A retrospective studyK SevvanthiSachit GanapathyPrasanth PenumaduK T HarichandrakumarBackground: A decision tree is a popular predictive modeling technique used to classify observations based on their properties and can also be used for numeric or categorical predictions. A logistic regression model estimates the relationship between a dependent categorical variable and one or more independent variables. Objective: To compare the performances of logistic regression and decision tree models for predicting mortality in patients with oral cavity cancer. Materials and Methods: This was a retrospective study on de-identified records of patients with oral cavity cancer who received treatment at Jawaharlal Institute of Post Graduate Medical Education and Research, a tertiary healthcare hospital in Puducherry, South India, from 2011 to 2017. The models were built by incorporating potential predictive variables such as stage of the cancer, tumor and node categories, and margin status. The performances of the classification models were compared using sensitivity, specificity, predictive accuracy, and by plotting the area under the receiver operating characteristic (ROC) curves. Results: Among the 427 patients who received treatment for oral cavity cancer between 2011 and 2017, we included 275 who had undergone surgery and whose histopathology details were available. There were 180 (65.5%) male patients. The median age was 55 years (range, 26-87). The tumor stage and margin status of the cancer were found to be significant predictors of mortality in a multiple logistic regression model. The predictive accuracies and the areas under the ROC curves of logistic regression and decision tree models were found to be similar. Conclusion: A decision tree method along with other machine learning algorithms could be an innovative alternative to statistical models for the prediction of outcomes when the assumptions of these models fail. The application of these machine learning algorithms can help risk stratify patients with oral cavity cancer and thus aid in their treatment planning.http://www.crstonline.com/article.asp?issn=2590-3233;year=2023;volume=6;issue=1;spage=103;epage=110;aulast=Sevvanthidecision treelogistic regressionmortalitypredictive accuracy
spellingShingle K Sevvanthi
Sachit Ganapathy
Prasanth Penumadu
K T Harichandrakumar
Comparing the predictive performance of a decision tree with logistic regression for oral cavity cancer mortality: A retrospective study
Cancer Research, Statistics, and Treatment
decision tree
logistic regression
mortality
predictive accuracy
title Comparing the predictive performance of a decision tree with logistic regression for oral cavity cancer mortality: A retrospective study
title_full Comparing the predictive performance of a decision tree with logistic regression for oral cavity cancer mortality: A retrospective study
title_fullStr Comparing the predictive performance of a decision tree with logistic regression for oral cavity cancer mortality: A retrospective study
title_full_unstemmed Comparing the predictive performance of a decision tree with logistic regression for oral cavity cancer mortality: A retrospective study
title_short Comparing the predictive performance of a decision tree with logistic regression for oral cavity cancer mortality: A retrospective study
title_sort comparing the predictive performance of a decision tree with logistic regression for oral cavity cancer mortality a retrospective study
topic decision tree
logistic regression
mortality
predictive accuracy
url http://www.crstonline.com/article.asp?issn=2590-3233;year=2023;volume=6;issue=1;spage=103;epage=110;aulast=Sevvanthi
work_keys_str_mv AT ksevvanthi comparingthepredictiveperformanceofadecisiontreewithlogisticregressionfororalcavitycancermortalityaretrospectivestudy
AT sachitganapathy comparingthepredictiveperformanceofadecisiontreewithlogisticregressionfororalcavitycancermortalityaretrospectivestudy
AT prasanthpenumadu comparingthepredictiveperformanceofadecisiontreewithlogisticregressionfororalcavitycancermortalityaretrospectivestudy
AT ktharichandrakumar comparingthepredictiveperformanceofadecisiontreewithlogisticregressionfororalcavitycancermortalityaretrospectivestudy