Application of support vector machine algorithm for early differential diagnosis of prostate cancer

Prostate cancer (PCa) symptoms are commonly confused with benign prostate hyperplasia (BPH), particularly in the early stages due to similarities between symptoms, and in some instances, underdiagnoses. Clinical methods have been utilized to diagnose PCa; however, at the full-blown stage, clinical m...

Full description

Bibliographic Details
Main Authors: Boluwaji A. Akinnuwesi, Kehinde A. Olayanju, Benjamin S. Aribisala, Stephen G. Fashoto, Elliot Mbunge, Moses Okpeku, Patrick Owate
Format: Article
Language:English
Published: KeAi Communications Co. Ltd. 2023-03-01
Series:Data Science and Management
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666764922000443
_version_ 1797807180528746496
author Boluwaji A. Akinnuwesi
Kehinde A. Olayanju
Benjamin S. Aribisala
Stephen G. Fashoto
Elliot Mbunge
Moses Okpeku
Patrick Owate
author_facet Boluwaji A. Akinnuwesi
Kehinde A. Olayanju
Benjamin S. Aribisala
Stephen G. Fashoto
Elliot Mbunge
Moses Okpeku
Patrick Owate
author_sort Boluwaji A. Akinnuwesi
collection DOAJ
description Prostate cancer (PCa) symptoms are commonly confused with benign prostate hyperplasia (BPH), particularly in the early stages due to similarities between symptoms, and in some instances, underdiagnoses. Clinical methods have been utilized to diagnose PCa; however, at the full-blown stage, clinical methods usually present high risks of complicated side effects. Therefore, we proposed the use of support vector machine for early differential diagnosis of PCa (SVM-PCa-EDD). SVM was used to classify persons with and without PCa. We used the PCa dataset from the Kaggle Healthcare repository to develop and validate SVM model for classification. The PCa dataset consisted of 250 features and one class of features. Attributes considered in this study were age, body mass index (BMI), race, family history, obesity, trouble urinating, urine stream force, blood in semen, bone pain, and erectile dysfunction. The SVM-PCa-EDD was used for preprocessing the PCa dataset, specifically dealing with class imbalance, and for dimensionality reduction. After eliminating class imbalance, the area under the receiver operating characteristic (ROC) curve (AUC) of the logistic regression (LR) model trained with the downsampled dataset was 58.4%, whereas that of the AUC-ROC of LR trained with the class imbalance dataset was 54.3%. The SVM-PCa-EDD achieved 90% accuracy, 80% sensitivity, and 80% specificity. The validation of SVM-PCa-EDD using random forest and LR showed that SVM-PCa-EDD performed better in early differential diagnosis of PCa. The proposed model can assist medical experts in early diagnosis of PCa, particularly in resource-constrained healthcare settings and making further recommendations for PCa testing and treatment.
first_indexed 2024-03-13T06:18:41Z
format Article
id doaj.art-6d54b4c028a148b383b27137b387dc03
institution Directory Open Access Journal
issn 2666-7649
language English
last_indexed 2024-03-13T06:18:41Z
publishDate 2023-03-01
publisher KeAi Communications Co. Ltd.
record_format Article
series Data Science and Management
spelling doaj.art-6d54b4c028a148b383b27137b387dc032023-06-10T04:28:41ZengKeAi Communications Co. Ltd.Data Science and Management2666-76492023-03-0161112Application of support vector machine algorithm for early differential diagnosis of prostate cancerBoluwaji A. Akinnuwesi0Kehinde A. Olayanju1Benjamin S. Aribisala2Stephen G. Fashoto3Elliot Mbunge4Moses Okpeku5Patrick Owate6Department of Computer Science, Faculty of Science and Engineering, University of Eswatini, Kwaluseni, M201, Swaziland; Corresponding author.Department of Computer Science Education, Federal College of Education (Technology), Akoka, Lagos State, 100213, NigeriaDepartment of Computer Science, Faculty of Science, Lagos State University, Ojo, Lagos State, 102101, NigeriaDepartment of Computer Science, Faculty of Science and Engineering, University of Eswatini, Kwaluseni, M201, SwazilandDepartment of Computer Science, Faculty of Science and Engineering, University of Eswatini, Kwaluseni, M201, SwazilandDepartment of Genetics, University of KwaZulu-Natal, Durban, 4041, South AfricaDepartment of Computer Science, Faculty of Science, Lagos State University, Ojo, Lagos State, 102101, NigeriaProstate cancer (PCa) symptoms are commonly confused with benign prostate hyperplasia (BPH), particularly in the early stages due to similarities between symptoms, and in some instances, underdiagnoses. Clinical methods have been utilized to diagnose PCa; however, at the full-blown stage, clinical methods usually present high risks of complicated side effects. Therefore, we proposed the use of support vector machine for early differential diagnosis of PCa (SVM-PCa-EDD). SVM was used to classify persons with and without PCa. We used the PCa dataset from the Kaggle Healthcare repository to develop and validate SVM model for classification. The PCa dataset consisted of 250 features and one class of features. Attributes considered in this study were age, body mass index (BMI), race, family history, obesity, trouble urinating, urine stream force, blood in semen, bone pain, and erectile dysfunction. The SVM-PCa-EDD was used for preprocessing the PCa dataset, specifically dealing with class imbalance, and for dimensionality reduction. After eliminating class imbalance, the area under the receiver operating characteristic (ROC) curve (AUC) of the logistic regression (LR) model trained with the downsampled dataset was 58.4%, whereas that of the AUC-ROC of LR trained with the class imbalance dataset was 54.3%. The SVM-PCa-EDD achieved 90% accuracy, 80% sensitivity, and 80% specificity. The validation of SVM-PCa-EDD using random forest and LR showed that SVM-PCa-EDD performed better in early differential diagnosis of PCa. The proposed model can assist medical experts in early diagnosis of PCa, particularly in resource-constrained healthcare settings and making further recommendations for PCa testing and treatment.http://www.sciencedirect.com/science/article/pii/S2666764922000443Confusable diseasesComputational intelligenceEarly differential diagnosisLogistic regressionProstate cancerSupport vector machine
spellingShingle Boluwaji A. Akinnuwesi
Kehinde A. Olayanju
Benjamin S. Aribisala
Stephen G. Fashoto
Elliot Mbunge
Moses Okpeku
Patrick Owate
Application of support vector machine algorithm for early differential diagnosis of prostate cancer
Data Science and Management
Confusable diseases
Computational intelligence
Early differential diagnosis
Logistic regression
Prostate cancer
Support vector machine
title Application of support vector machine algorithm for early differential diagnosis of prostate cancer
title_full Application of support vector machine algorithm for early differential diagnosis of prostate cancer
title_fullStr Application of support vector machine algorithm for early differential diagnosis of prostate cancer
title_full_unstemmed Application of support vector machine algorithm for early differential diagnosis of prostate cancer
title_short Application of support vector machine algorithm for early differential diagnosis of prostate cancer
title_sort application of support vector machine algorithm for early differential diagnosis of prostate cancer
topic Confusable diseases
Computational intelligence
Early differential diagnosis
Logistic regression
Prostate cancer
Support vector machine
url http://www.sciencedirect.com/science/article/pii/S2666764922000443
work_keys_str_mv AT boluwajiaakinnuwesi applicationofsupportvectormachinealgorithmforearlydifferentialdiagnosisofprostatecancer
AT kehindeaolayanju applicationofsupportvectormachinealgorithmforearlydifferentialdiagnosisofprostatecancer
AT benjaminsaribisala applicationofsupportvectormachinealgorithmforearlydifferentialdiagnosisofprostatecancer
AT stephengfashoto applicationofsupportvectormachinealgorithmforearlydifferentialdiagnosisofprostatecancer
AT elliotmbunge applicationofsupportvectormachinealgorithmforearlydifferentialdiagnosisofprostatecancer
AT mosesokpeku applicationofsupportvectormachinealgorithmforearlydifferentialdiagnosisofprostatecancer
AT patrickowate applicationofsupportvectormachinealgorithmforearlydifferentialdiagnosisofprostatecancer