Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications

Using a decision support system (DSS) that classifies various cancers provides support to the clinicians/researchers to make better decisions that can aid in early cancer diagnosis, thereby reducing chances of incorrect disease diagnosis. Thus, this work aimed at designing a classification model tha...

Full description

Bibliographic Details
Main Authors: K Chandrashekar, Anagha S Setlur, Adithya Sabhapathi C, Satyam Suresh Raiker, Satyam Singh, Vidya Niranjan
Format: Article
Language:English
Published: SAGE Publishing 2023-01-01
Series:Cancer Informatics
Online Access:https://doi.org/10.1177/11769351221147244
_version_ 1797944160451297280
author K Chandrashekar
Anagha S Setlur
Adithya Sabhapathi C
Satyam Suresh Raiker
Satyam Singh
Vidya Niranjan
author_facet K Chandrashekar
Anagha S Setlur
Adithya Sabhapathi C
Satyam Suresh Raiker
Satyam Singh
Vidya Niranjan
author_sort K Chandrashekar
collection DOAJ
description Using a decision support system (DSS) that classifies various cancers provides support to the clinicians/researchers to make better decisions that can aid in early cancer diagnosis, thereby reducing chances of incorrect disease diagnosis. Thus, this work aimed at designing a classification model that can predict accurately for 5 different cancer types comprising of 20 cancer exomes, using the mutations identified from whole exome cancer analysis. Initially, a basic model was designed using supervised machine learning classification algorithms such as K-nearest neighbor (KNN), support vector machine (SVM), decision tree, naïve bayes and random forest (RF), among which decision tree and random forest performed better in terms of preliminary model accuracy. However, output predictions were incorrect due to less training scores. Thus, 16 essential features were then selected for model improvement using 2 approaches. All imbalanced datasets were balanced using SMOTE. In the first approach, all features from 20 cancer exome datasets were trained and models were designed using decision tree and random forest. Balanced datasets for decision tree model showed an accuracy of 77%, while with the RF model, the accuracy improved to 82% where all 5 cancer types were predicted correctly. Area under the curve for RF model was closer to 1, than decision tree model. In the second approach, all 15 datasets were trained, while 5 were tested. However, only 2 cancer types were predicted correctly. To cross validate RF model, Matthew’s correlation co-efficient (MCC) test was performed. For method 1, the MCC test and MCC cross validation was found to be 0.7796 and 0.9356 respectively. Likewise, for second approach, MCC was observed to be 0.9365, corroborating the accuracy of the designed model. The model was successfully deployed using Streamlit as a web application for easy use. This study presents insights for allowing easy cancer classifications.
first_indexed 2024-04-10T20:35:17Z
format Article
id doaj.art-b8a1d03eb0ae4eb7acd613684692cfce
institution Directory Open Access Journal
issn 1176-9351
language English
last_indexed 2024-04-10T20:35:17Z
publishDate 2023-01-01
publisher SAGE Publishing
record_format Article
series Cancer Informatics
spelling doaj.art-b8a1d03eb0ae4eb7acd613684692cfce2023-01-24T22:45:54ZengSAGE PublishingCancer Informatics1176-93512023-01-012210.1177/11769351221147244Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer ClassificationsK ChandrashekarAnagha S SetlurAdithya Sabhapathi CSatyam Suresh RaikerSatyam SinghVidya NiranjanUsing a decision support system (DSS) that classifies various cancers provides support to the clinicians/researchers to make better decisions that can aid in early cancer diagnosis, thereby reducing chances of incorrect disease diagnosis. Thus, this work aimed at designing a classification model that can predict accurately for 5 different cancer types comprising of 20 cancer exomes, using the mutations identified from whole exome cancer analysis. Initially, a basic model was designed using supervised machine learning classification algorithms such as K-nearest neighbor (KNN), support vector machine (SVM), decision tree, naïve bayes and random forest (RF), among which decision tree and random forest performed better in terms of preliminary model accuracy. However, output predictions were incorrect due to less training scores. Thus, 16 essential features were then selected for model improvement using 2 approaches. All imbalanced datasets were balanced using SMOTE. In the first approach, all features from 20 cancer exome datasets were trained and models were designed using decision tree and random forest. Balanced datasets for decision tree model showed an accuracy of 77%, while with the RF model, the accuracy improved to 82% where all 5 cancer types were predicted correctly. Area under the curve for RF model was closer to 1, than decision tree model. In the second approach, all 15 datasets were trained, while 5 were tested. However, only 2 cancer types were predicted correctly. To cross validate RF model, Matthew’s correlation co-efficient (MCC) test was performed. For method 1, the MCC test and MCC cross validation was found to be 0.7796 and 0.9356 respectively. Likewise, for second approach, MCC was observed to be 0.9365, corroborating the accuracy of the designed model. The model was successfully deployed using Streamlit as a web application for easy use. This study presents insights for allowing easy cancer classifications.https://doi.org/10.1177/11769351221147244
spellingShingle K Chandrashekar
Anagha S Setlur
Adithya Sabhapathi C
Satyam Suresh Raiker
Satyam Singh
Vidya Niranjan
Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
Cancer Informatics
title Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
title_full Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
title_fullStr Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
title_full_unstemmed Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
title_short Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
title_sort decision support system and web application using supervised machine learning algorithms for easy cancer classifications
url https://doi.org/10.1177/11769351221147244
work_keys_str_mv AT kchandrashekar decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications
AT anaghassetlur decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications
AT adithyasabhapathic decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications
AT satyamsureshraiker decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications
AT satyamsingh decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications
AT vidyaniranjan decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications