Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network

Background: We aimed to assess the high-risk group for suicide using different classification methods includinglogistic regression (LR), decision tree (DT), artificial neural network (ANN), and support vector machine (SVM). Methods: We used the dataset of a study conducted to predict risk factors o...

Full description

Bibliographic Details
Main Authors: Payam AMINI, Hasan AHMADINIA, Jalal POOROLAJAL, Mohammad MOQADDASI AMIRI
Format: Article
Language:English
Published: Tehran University of Medical Sciences 2016-10-01
Series:Iranian Journal of Public Health
Subjects:
Online Access:https://ijph.tums.ac.ir/index.php/ijph/article/view/7869
_version_ 1818864637389570048
author Payam AMINI
Hasan AHMADINIA
Jalal POOROLAJAL
Mohammad MOQADDASI AMIRI
author_facet Payam AMINI
Hasan AHMADINIA
Jalal POOROLAJAL
Mohammad MOQADDASI AMIRI
author_sort Payam AMINI
collection DOAJ
description Background: We aimed to assess the high-risk group for suicide using different classification methods includinglogistic regression (LR), decision tree (DT), artificial neural network (ANN), and support vector machine (SVM). Methods: We used the dataset of a study conducted to predict risk factors of completed suicide in Hamadan Province, the west of Iran, in 2010. To evaluate the high-risk groups for suicide, LR, SVM, DT and ANNwere performed. The applied methods were compared using sensitivity, specificity, positive predicted value, negative predicted value, accuracy and the area under curve. Cochran-Q test was implied to check differences in proportion among methods. To assess the association between the observed and predicted values, Ø coefficient, contingency coefficient, and Kendall tau-b were calculated. Results: Gender, age, and job were the most important risk factors for fatal suicide attempts in common for four methods. SVM method showed the highest accuracy 0.68 and 0.67 for training and testing sample, respectively. However, this method resulted in the highest specificity (0.67 for training and 0.68 for testing sample) and the highest sensitivity for training sample (0.85), but the lowest sensitivity for the testing sample (0.53). Cochran-Q test resulted in differences between proportions in different methods (P<0.001). The association of SVM predictions and observed values, Ø coefficient, contingency coefficient, and Kendall tau-b were 0.239, 0.232 and 0.239, respectively. Conclusion: SVM had the best performance to classify fatal suicide attempts comparing to DT, LR and ANN.
first_indexed 2024-12-19T10:34:49Z
format Article
id doaj.art-c323996368634bed96cf5d9f298a8f8f
institution Directory Open Access Journal
issn 2251-6085
2251-6093
language English
last_indexed 2024-12-19T10:34:49Z
publishDate 2016-10-01
publisher Tehran University of Medical Sciences
record_format Article
series Iranian Journal of Public Health
spelling doaj.art-c323996368634bed96cf5d9f298a8f8f2022-12-21T20:25:39ZengTehran University of Medical SciencesIranian Journal of Public Health2251-60852251-60932016-10-014595018Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural NetworkPayam AMINI0Hasan AHMADINIA1Jalal POOROLAJAL2Mohammad MOQADDASI AMIRI3Dept. of Epidemiology & Reproductive Health, Reproductive Epidemiology Research Center, Royan Institute for Reproductive Biomedicine, ACECR, Tehran, IranDept. of Biostatistics & Epidemiology, Hamadan University of Medical Sciences, Hamadan, IranResearch Center for Health Sciences and Dept. of Biostatistics & Epidemiology, Hamadan University of Medical Sciences, Hamadan, IranDept. of Biostatistics & Epidemiology, Hamadan University of Medical Sciences, Hamadan, IranBackground: We aimed to assess the high-risk group for suicide using different classification methods includinglogistic regression (LR), decision tree (DT), artificial neural network (ANN), and support vector machine (SVM). Methods: We used the dataset of a study conducted to predict risk factors of completed suicide in Hamadan Province, the west of Iran, in 2010. To evaluate the high-risk groups for suicide, LR, SVM, DT and ANNwere performed. The applied methods were compared using sensitivity, specificity, positive predicted value, negative predicted value, accuracy and the area under curve. Cochran-Q test was implied to check differences in proportion among methods. To assess the association between the observed and predicted values, Ø coefficient, contingency coefficient, and Kendall tau-b were calculated. Results: Gender, age, and job were the most important risk factors for fatal suicide attempts in common for four methods. SVM method showed the highest accuracy 0.68 and 0.67 for training and testing sample, respectively. However, this method resulted in the highest specificity (0.67 for training and 0.68 for testing sample) and the highest sensitivity for training sample (0.85), but the lowest sensitivity for the testing sample (0.53). Cochran-Q test resulted in differences between proportions in different methods (P<0.001). The association of SVM predictions and observed values, Ø coefficient, contingency coefficient, and Kendall tau-b were 0.239, 0.232 and 0.239, respectively. Conclusion: SVM had the best performance to classify fatal suicide attempts comparing to DT, LR and ANN.https://ijph.tums.ac.ir/index.php/ijph/article/view/7869SuicideSupport vector machineNeuralnetworksLogistic regressionDecision treeClassification
spellingShingle Payam AMINI
Hasan AHMADINIA
Jalal POOROLAJAL
Mohammad MOQADDASI AMIRI
Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
Iranian Journal of Public Health
Suicide
Support vector machine
Neuralnetworks
Logistic regression
Decision tree
Classification
title Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
title_full Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
title_fullStr Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
title_full_unstemmed Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
title_short Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
title_sort evaluating the high risk groups for suicide a comparison of logistic regression support vector machine decision tree and artificial neural network
topic Suicide
Support vector machine
Neuralnetworks
Logistic regression
Decision tree
Classification
url https://ijph.tums.ac.ir/index.php/ijph/article/view/7869
work_keys_str_mv AT payamamini evaluatingthehighriskgroupsforsuicideacomparisonoflogisticregressionsupportvectormachinedecisiontreeandartificialneuralnetwork
AT hasanahmadinia evaluatingthehighriskgroupsforsuicideacomparisonoflogisticregressionsupportvectormachinedecisiontreeandartificialneuralnetwork
AT jalalpoorolajal evaluatingthehighriskgroupsforsuicideacomparisonoflogisticregressionsupportvectormachinedecisiontreeandartificialneuralnetwork
AT mohammadmoqaddasiamiri evaluatingthehighriskgroupsforsuicideacomparisonoflogisticregressionsupportvectormachinedecisiontreeandartificialneuralnetwork