The Performance of Different Artificial Intelligence Models in Predicting Breast Cancer among Individuals Having Type 2 Diabetes Mellitus

<i>Objective:</i> Early reports indicate that individuals with type 2 diabetes mellitus (T2DM) may have a greater incidence of breast malignancy than patients without T2DM. The aim of this study was to investigate the effectiveness of three different models for predicting risk of breast...

Full description

Bibliographic Details
Main Authors: Meng-Hsuen Hsieh, Li-Min Sun, Cheng-Li Lin, Meng-Ju Hsieh, Chung Y. Hsu, Chia-Hung Kao
Format: Article
Language:English
Published: MDPI AG 2019-11-01
Series:Cancers
Subjects:
Online Access:https://www.mdpi.com/2072-6694/11/11/1751
_version_ 1797713587671662592
author Meng-Hsuen Hsieh
Li-Min Sun
Cheng-Li Lin
Meng-Ju Hsieh
Chung Y. Hsu
Chia-Hung Kao
author_facet Meng-Hsuen Hsieh
Li-Min Sun
Cheng-Li Lin
Meng-Ju Hsieh
Chung Y. Hsu
Chia-Hung Kao
author_sort Meng-Hsuen Hsieh
collection DOAJ
description <i>Objective:</i> Early reports indicate that individuals with type 2 diabetes mellitus (T2DM) may have a greater incidence of breast malignancy than patients without T2DM. The aim of this study was to investigate the effectiveness of three different models for predicting risk of breast cancer in patients with T2DM of different characteristics. <i>Study design and methodology:</i> From 2000 to 2012, data on 636,111 newly diagnosed female T2DM patients were available in the Taiwan’s National Health Insurance Research Database. By applying their data, a risk prediction model of breast cancer in patients with T2DM was created. We also collected data on potential predictors of breast cancer so that adjustments for their effect could be made in the analysis. Synthetic Minority Oversampling Technology (SMOTE) was utilized to increase data for small population samples. Each datum was randomly assigned based on a ratio of about 39:1 into the training and test sets. Logistic Regression (LR), Artificial Neural Network (ANN) and Random Forest (RF) models were determined using recall, accuracy, F<sub>1</sub> score and area under the receiver operating characteristic curve (AUC). <i>Results:</i> The AUC of the LR (0.834), ANN (0.865), and RF (0.959) models were found. The largest AUC among the three models was seen in the RF model. <i>Conclusions:</i> Although the LR, ANN, and RF models all showed high accuracy predicting the risk of breast cancer in Taiwanese with T2DM, the RF model performed best.
first_indexed 2024-03-12T07:38:49Z
format Article
id doaj.art-deb747b3d9f648dcabe9c2f7dd93afe9
institution Directory Open Access Journal
issn 2072-6694
language English
last_indexed 2024-03-12T07:38:49Z
publishDate 2019-11-01
publisher MDPI AG
record_format Article
series Cancers
spelling doaj.art-deb747b3d9f648dcabe9c2f7dd93afe92023-09-02T21:26:37ZengMDPI AGCancers2072-66942019-11-011111175110.3390/cancers11111751cancers11111751The Performance of Different Artificial Intelligence Models in Predicting Breast Cancer among Individuals Having Type 2 Diabetes MellitusMeng-Hsuen Hsieh0Li-Min Sun1Cheng-Li Lin2Meng-Ju Hsieh3Chung Y. Hsu4Chia-Hung Kao5Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720, USADepartment of Radiation Oncology, Zuoying Branch of Kaohsiung Armed Forces General Hospital, Kaohsiung 813, TaiwanManagement Office for Health Data, China Medical University Hospital, Taichung 404, TaiwanDepartment of Medicine, Poznan University of Medical Sciences, 60965 Poznan, PolandGraduate Institute of Biomedical Sciences, China Medical University, Taichung 404, TaiwanGraduate Institute of Biomedical Sciences, China Medical University, Taichung 404, Taiwan<i>Objective:</i> Early reports indicate that individuals with type 2 diabetes mellitus (T2DM) may have a greater incidence of breast malignancy than patients without T2DM. The aim of this study was to investigate the effectiveness of three different models for predicting risk of breast cancer in patients with T2DM of different characteristics. <i>Study design and methodology:</i> From 2000 to 2012, data on 636,111 newly diagnosed female T2DM patients were available in the Taiwan’s National Health Insurance Research Database. By applying their data, a risk prediction model of breast cancer in patients with T2DM was created. We also collected data on potential predictors of breast cancer so that adjustments for their effect could be made in the analysis. Synthetic Minority Oversampling Technology (SMOTE) was utilized to increase data for small population samples. Each datum was randomly assigned based on a ratio of about 39:1 into the training and test sets. Logistic Regression (LR), Artificial Neural Network (ANN) and Random Forest (RF) models were determined using recall, accuracy, F<sub>1</sub> score and area under the receiver operating characteristic curve (AUC). <i>Results:</i> The AUC of the LR (0.834), ANN (0.865), and RF (0.959) models were found. The largest AUC among the three models was seen in the RF model. <i>Conclusions:</i> Although the LR, ANN, and RF models all showed high accuracy predicting the risk of breast cancer in Taiwanese with T2DM, the RF model performed best.https://www.mdpi.com/2072-6694/11/11/1751type ii diabetes mellitusbreast cancerartificial neural networklogistic regressionrandom forest
spellingShingle Meng-Hsuen Hsieh
Li-Min Sun
Cheng-Li Lin
Meng-Ju Hsieh
Chung Y. Hsu
Chia-Hung Kao
The Performance of Different Artificial Intelligence Models in Predicting Breast Cancer among Individuals Having Type 2 Diabetes Mellitus
Cancers
type ii diabetes mellitus
breast cancer
artificial neural network
logistic regression
random forest
title The Performance of Different Artificial Intelligence Models in Predicting Breast Cancer among Individuals Having Type 2 Diabetes Mellitus
title_full The Performance of Different Artificial Intelligence Models in Predicting Breast Cancer among Individuals Having Type 2 Diabetes Mellitus
title_fullStr The Performance of Different Artificial Intelligence Models in Predicting Breast Cancer among Individuals Having Type 2 Diabetes Mellitus
title_full_unstemmed The Performance of Different Artificial Intelligence Models in Predicting Breast Cancer among Individuals Having Type 2 Diabetes Mellitus
title_short The Performance of Different Artificial Intelligence Models in Predicting Breast Cancer among Individuals Having Type 2 Diabetes Mellitus
title_sort performance of different artificial intelligence models in predicting breast cancer among individuals having type 2 diabetes mellitus
topic type ii diabetes mellitus
breast cancer
artificial neural network
logistic regression
random forest
url https://www.mdpi.com/2072-6694/11/11/1751
work_keys_str_mv AT menghsuenhsieh theperformanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus
AT liminsun theperformanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus
AT chenglilin theperformanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus
AT mengjuhsieh theperformanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus
AT chungyhsu theperformanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus
AT chiahungkao theperformanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus
AT menghsuenhsieh performanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus
AT liminsun performanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus
AT chenglilin performanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus
AT mengjuhsieh performanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus
AT chungyhsu performanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus
AT chiahungkao performanceofdifferentartificialintelligencemodelsinpredictingbreastcanceramongindividualshavingtype2diabetesmellitus