A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence
Introduction: Breast cancer is one of the most common cancers, and also it is the most common type of malignancy in Iranian women that has been growing in recent years. The risk of recurrence is usual in patient. Many factors may increase or decrease the recurrence rate. Data mining methods have bee...
Main Authors: | , |
---|---|
Format: | Article |
Language: | fas |
Published: |
Kerman University of Medical Sciences
2014-12-01
|
Series: | مجله انفورماتیک سلامت و زیست پزشکی |
Subjects: | |
Online Access: | http://jhbmi.ir/article-1-65-en.html |
_version_ | 1828052205312147456 |
---|---|
author | Behzad Kiani Alireza Atashi |
author_facet | Behzad Kiani Alireza Atashi |
author_sort | Behzad Kiani |
collection | DOAJ |
description | Introduction: Breast cancer is one of the most common cancers, and also it is the most common type of malignancy in Iranian women that has been growing in recent years. The risk of recurrence is usual in patient. Many factors may increase or decrease the recurrence rate. Data mining methods have been used to diagnose or predict cancer and one of the most application of data mining approaches is prediction of breast cancer recurrence
Method: This is a retrospective study. Collected data on 809 patients with breast cancer with 18 fields for each patient were used. Due to excessive missing data only about 665 cases have been used. Since the number of fields in the remaining records with null values have been observed, as a preprocessing and data preparation phases, these values have been estimated by the EM algorithm and using SPSS.v20 software. In this study, a model for prognosis of breast cancer recurrence among patients using J48 tree has been developed. Results: The specificity and sensitivity of the developed model are 53% and 85%, respectively. Moreover, only 14% of patients who have relapsed are known as false negative with developed model.
Conclusion: Creating a predictive model with appropriate specificity and sensitivity can warn patients about recurrence and timely preventive measures to prevent progression of the cancer. The False Negative rate is very important in medical prediction models that can make serious results/consequences. In present study this rate is about 14% that seems reasonable amount in term of modeling. |
first_indexed | 2024-04-10T19:50:01Z |
format | Article |
id | doaj.art-d9510865adc54e9bac3fa62f6387e05a |
institution | Directory Open Access Journal |
issn | 2423-3870 2423-3498 |
language | fas |
last_indexed | 2024-04-10T19:50:01Z |
publishDate | 2014-12-01 |
publisher | Kerman University of Medical Sciences |
record_format | Article |
series | مجله انفورماتیک سلامت و زیست پزشکی |
spelling | doaj.art-d9510865adc54e9bac3fa62f6387e05a2023-01-28T10:50:15ZfasKerman University of Medical Sciencesمجله انفورماتیک سلامت و زیست پزشکی2423-38702423-34982014-12-01112631A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer RecurrenceBehzad Kiani0Alireza Atashi1 Cancer Informatics Department, Breast Cancer Research Center, ACECR, Iran Introduction: Breast cancer is one of the most common cancers, and also it is the most common type of malignancy in Iranian women that has been growing in recent years. The risk of recurrence is usual in patient. Many factors may increase or decrease the recurrence rate. Data mining methods have been used to diagnose or predict cancer and one of the most application of data mining approaches is prediction of breast cancer recurrence Method: This is a retrospective study. Collected data on 809 patients with breast cancer with 18 fields for each patient were used. Due to excessive missing data only about 665 cases have been used. Since the number of fields in the remaining records with null values have been observed, as a preprocessing and data preparation phases, these values have been estimated by the EM algorithm and using SPSS.v20 software. In this study, a model for prognosis of breast cancer recurrence among patients using J48 tree has been developed. Results: The specificity and sensitivity of the developed model are 53% and 85%, respectively. Moreover, only 14% of patients who have relapsed are known as false negative with developed model. Conclusion: Creating a predictive model with appropriate specificity and sensitivity can warn patients about recurrence and timely preventive measures to prevent progression of the cancer. The False Negative rate is very important in medical prediction models that can make serious results/consequences. In present study this rate is about 14% that seems reasonable amount in term of modeling.http://jhbmi.ir/article-1-65-en.htmlbreast cancerdata miningprognostic model |
spellingShingle | Behzad Kiani Alireza Atashi A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence مجله انفورماتیک سلامت و زیست پزشکی breast cancer data mining prognostic model |
title | A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence |
title_full | A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence |
title_fullStr | A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence |
title_full_unstemmed | A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence |
title_short | A Prognostic Model Based on Data Mining Techniques to Predict Breast Cancer Recurrence |
title_sort | prognostic model based on data mining techniques to predict breast cancer recurrence |
topic | breast cancer data mining prognostic model |
url | http://jhbmi.ir/article-1-65-en.html |
work_keys_str_mv | AT behzadkiani aprognosticmodelbasedondataminingtechniquestopredictbreastcancerrecurrence AT alirezaatashi aprognosticmodelbasedondataminingtechniquestopredictbreastcancerrecurrence AT behzadkiani prognosticmodelbasedondataminingtechniquestopredictbreastcancerrecurrence AT alirezaatashi prognosticmodelbasedondataminingtechniquestopredictbreastcancerrecurrence |