A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence

Introduction: Breast cancer is one of the most common cancers, and also it is the most common type of malignancy in Iranian women that has been growing in recent years. The risk of recurrence is usual in patient. Many factors may increase or decrease the recurrence rate. Data mining methods have bee...

Full description

Bibliographic Details
Main Authors: Behzad Kiani, Alireza Atashi
Format: Article
Language:fas
Published: Kerman University of Medical Sciences 2014-12-01
Series:مجله انفورماتیک سلامت و زیست پزشکی
Subjects:
Online Access:http://jhbmi.ir/article-1-65-en.html
_version_ 1828052205312147456
author Behzad Kiani
Alireza Atashi
author_facet Behzad Kiani
Alireza Atashi
author_sort Behzad Kiani
collection DOAJ
description Introduction: Breast cancer is one of the most common cancers, and also it is the most common type of malignancy in Iranian women that has been growing in recent years. The risk of recurrence is usual in patient. Many factors may increase or decrease the recurrence rate. Data mining methods have been used to diagnose or predict cancer and one of the most application of data mining approaches is prediction of breast cancer recurrence Method: This is a retrospective study. Collected data on 809 patients with breast cancer with 18 fields for each patient were used. Due to excessive missing data only about 665 cases have been used. Since the number of fields in the remaining records with null values have been observed, as a preprocessing and data preparation phases, these values have been estimated by the EM algorithm and using SPSS.v20 software. In this study, a model for prognosis of breast cancer recurrence among patients using J48 tree has been developed. Results: The specificity and sensitivity of the developed model are 53% and 85%, respectively. Moreover, only 14% of patients who have relapsed are known as false negative with developed model.  Conclusion: Creating a predictive model with appropriate specificity and sensitivity can warn patients about recurrence and timely preventive measures to prevent progression of the cancer. The False Negative rate is very important in medical prediction models that can make serious results/consequences. In present study this rate is about 14% that seems reasonable amount in term of modeling.
first_indexed 2024-04-10T19:50:01Z
format Article
id doaj.art-d9510865adc54e9bac3fa62f6387e05a
institution Directory Open Access Journal
issn 2423-3870
2423-3498
language fas
last_indexed 2024-04-10T19:50:01Z
publishDate 2014-12-01
publisher Kerman University of Medical Sciences
record_format Article
series مجله انفورماتیک سلامت و زیست پزشکی
spelling doaj.art-d9510865adc54e9bac3fa62f6387e05a2023-01-28T10:50:15ZfasKerman University of Medical Sciencesمجله انفورماتیک سلامت و زیست پزشکی2423-38702423-34982014-12-01112631A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer RecurrenceBehzad Kiani0Alireza Atashi1 Cancer Informatics Department, Breast Cancer Research Center, ACECR, Iran Introduction: Breast cancer is one of the most common cancers, and also it is the most common type of malignancy in Iranian women that has been growing in recent years. The risk of recurrence is usual in patient. Many factors may increase or decrease the recurrence rate. Data mining methods have been used to diagnose or predict cancer and one of the most application of data mining approaches is prediction of breast cancer recurrence Method: This is a retrospective study. Collected data on 809 patients with breast cancer with 18 fields for each patient were used. Due to excessive missing data only about 665 cases have been used. Since the number of fields in the remaining records with null values have been observed, as a preprocessing and data preparation phases, these values have been estimated by the EM algorithm and using SPSS.v20 software. In this study, a model for prognosis of breast cancer recurrence among patients using J48 tree has been developed. Results: The specificity and sensitivity of the developed model are 53% and 85%, respectively. Moreover, only 14% of patients who have relapsed are known as false negative with developed model.  Conclusion: Creating a predictive model with appropriate specificity and sensitivity can warn patients about recurrence and timely preventive measures to prevent progression of the cancer. The False Negative rate is very important in medical prediction models that can make serious results/consequences. In present study this rate is about 14% that seems reasonable amount in term of modeling.http://jhbmi.ir/article-1-65-en.htmlbreast cancerdata miningprognostic model
spellingShingle Behzad Kiani
Alireza Atashi
A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence
مجله انفورماتیک سلامت و زیست پزشکی
breast cancer
data mining
prognostic model
title A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence
title_full A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence
title_fullStr A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence
title_full_unstemmed A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence
title_short A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence
title_sort prognostic model based on data mining techniques to predict breast cancer recurrence
topic breast cancer
data mining
prognostic model
url http://jhbmi.ir/article-1-65-en.html
work_keys_str_mv AT behzadkiani aprognosticmodelbasedondataminingtechniquestopredictbreastcancerrecurrence
AT alirezaatashi aprognosticmodelbasedondataminingtechniquestopredictbreastcancerrecurrence
AT behzadkiani prognosticmodelbasedondataminingtechniquestopredictbreastcancerrecurrence
AT alirezaatashi prognosticmodelbasedondataminingtechniquestopredictbreastcancerrecurrence