Diagnosing Breast Cancer by Machine Learning

Background and Aim: Cancer and in particular Breast cancer are among the diseases that have the highest mortality rate in Iran after heart disease. The accurate prognosis for Breast cancer is important, and the presence of various symptoms and features of this disease makes it difficult for doctors...

Full description

Bibliographic Details
Main Authors: Kasra Dolatkhahi, Adel Azar, Tooraj Karimi, Mohammad Hadizadeh
Format: Article
Language:fas
Published: Tehran University of Medical Sciences 2021-10-01
Series:پیاورد سلامت
Subjects:
Online Access:http://payavard.tums.ac.ir/article-1-7139-en.html
_version_ 1819279258323779584
author Kasra Dolatkhahi
Adel Azar
Tooraj Karimi
Mohammad Hadizadeh
author_facet Kasra Dolatkhahi
Adel Azar
Tooraj Karimi
Mohammad Hadizadeh
author_sort Kasra Dolatkhahi
collection DOAJ
description Background and Aim: Cancer and in particular Breast cancer are among the diseases that have the highest mortality rate in Iran after heart disease. The accurate prognosis for Breast cancer is important, and the presence of various symptoms and features of this disease makes it difficult for doctors to diagnose. This study aimed to identify the factors affecting Breast cancer, modeling and ultimately diagnosing the risk of Breast cancer. Materials and Methods: In the present study, first, by content analysis and library studies, the effective factors in Breast cancer were identified, then with the help of a team of experts consisting of physicians and subspecialists in Breast oncology and Breast surgery; With the help of the Delphi method, the factors were adjusted and 26 final factors that were numerically correct and string based on local and climatic conditions were approved. Then, according to the final factors and based on the medical records of 5208 patients in the Cancer Research Center of Shahid Beheshti University of medical sciences, to diagnose cancer, Decision Tree, Random Forest, and Support Vector Machine methods were used as machine learning methods. Results: In the first step, by content analysis method, 29 effective factors in Breast cancer were identified. Then, taking into account the indigenous and climatic conditions and using the Delphi method and also using the opinions of 18 Experts during three years, 26 factors were finalized. In the final step, using the medical records of the patients and the results obtained from the three methods mentioned, random forest, had the highest accuracy of 94.75% and precision of 97.26% in diagnosing Breast cancer. It has been noted that, compared to other similar studies, indigenous databases have been exploited, the accuracy obtained has been very close to previous studies, and in many cases much better. Conclusion: Using the random forest method and taking advantage of the factors affecting Breast cancer, the ability to diagnose cancer has been provided with greatest accuracy.
first_indexed 2024-12-24T00:25:03Z
format Article
id doaj.art-7243015686a84838ad8dc6e1a3f2bfd7
institution Directory Open Access Journal
issn 1735-8132
2008-2665
language fas
last_indexed 2024-12-24T00:25:03Z
publishDate 2021-10-01
publisher Tehran University of Medical Sciences
record_format Article
series پیاورد سلامت
spelling doaj.art-7243015686a84838ad8dc6e1a3f2bfd72022-12-21T17:24:28ZfasTehran University of Medical Sciencesپیاورد سلامت1735-81322008-26652021-10-01154340352Diagnosing Breast Cancer by Machine LearningKasra Dolatkhahi0Adel Azar1Tooraj Karimi2Mohammad Hadizadeh3 Ph.D. Candidate in Industrial Management Operations Research Orientation, Faculty of Management and Accounting, College of Farabi University of Tehran, Tehran, Iran Professor, Department of Industrial Management, Faculty of Management and Economics, Tarbiat Modares University, Tehran, Iran Assistant Professor, Department of Industrial and Technology Management, Faculty of Management and Accounting, College of Farabi University of Tehran, Iran Assistant Professor, Cancer Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran Background and Aim: Cancer and in particular Breast cancer are among the diseases that have the highest mortality rate in Iran after heart disease. The accurate prognosis for Breast cancer is important, and the presence of various symptoms and features of this disease makes it difficult for doctors to diagnose. This study aimed to identify the factors affecting Breast cancer, modeling and ultimately diagnosing the risk of Breast cancer. Materials and Methods: In the present study, first, by content analysis and library studies, the effective factors in Breast cancer were identified, then with the help of a team of experts consisting of physicians and subspecialists in Breast oncology and Breast surgery; With the help of the Delphi method, the factors were adjusted and 26 final factors that were numerically correct and string based on local and climatic conditions were approved. Then, according to the final factors and based on the medical records of 5208 patients in the Cancer Research Center of Shahid Beheshti University of medical sciences, to diagnose cancer, Decision Tree, Random Forest, and Support Vector Machine methods were used as machine learning methods. Results: In the first step, by content analysis method, 29 effective factors in Breast cancer were identified. Then, taking into account the indigenous and climatic conditions and using the Delphi method and also using the opinions of 18 Experts during three years, 26 factors were finalized. In the final step, using the medical records of the patients and the results obtained from the three methods mentioned, random forest, had the highest accuracy of 94.75% and precision of 97.26% in diagnosing Breast cancer. It has been noted that, compared to other similar studies, indigenous databases have been exploited, the accuracy obtained has been very close to previous studies, and in many cases much better. Conclusion: Using the random forest method and taking advantage of the factors affecting Breast cancer, the ability to diagnose cancer has been provided with greatest accuracy.http://payavard.tums.ac.ir/article-1-7139-en.htmlbreast cancercontent analysisdelphi methodrandom forestdecision treesupport vector machine
spellingShingle Kasra Dolatkhahi
Adel Azar
Tooraj Karimi
Mohammad Hadizadeh
Diagnosing Breast Cancer by Machine Learning
پیاورد سلامت
breast cancer
content analysis
delphi method
random forest
decision tree
support vector machine
title Diagnosing Breast Cancer by Machine Learning
title_full Diagnosing Breast Cancer by Machine Learning
title_fullStr Diagnosing Breast Cancer by Machine Learning
title_full_unstemmed Diagnosing Breast Cancer by Machine Learning
title_short Diagnosing Breast Cancer by Machine Learning
title_sort diagnosing breast cancer by machine learning
topic breast cancer
content analysis
delphi method
random forest
decision tree
support vector machine
url http://payavard.tums.ac.ir/article-1-7139-en.html
work_keys_str_mv AT kasradolatkhahi diagnosingbreastcancerbymachinelearning
AT adelazar diagnosingbreastcancerbymachinelearning
AT toorajkarimi diagnosingbreastcancerbymachinelearning
AT mohammadhadizadeh diagnosingbreastcancerbymachinelearning