Comparison of Breast Cancer Classification Using the Decision Tree ID3 Algorithm and K-Nearest Neighbors Algorithm

One of the leading causes of death is cancer. The most common cancer in women is breast cancer. Breast cancer (Carcinoma mammae) is a malignant neoplasm originating from the parenchyma. Breast cancer ranks first in terms of the highest number of cancers in Indonesia and is among the first contributo...

Full description

Bibliographic Details
Main Authors: Zyhan Faradilla Daldiri, Desti Fitriati
Format: Article
Language:English
Published: Kresnamedia Publisher 2023-03-01
Series:Jurnal Riset Informatika
Subjects:
Online Access:https://ejournal.kresnamediapublisher.com/index.php/jri/article/view/406
_version_ 1797854298255654912
author Zyhan Faradilla Daldiri
Desti Fitriati
author_facet Zyhan Faradilla Daldiri
Desti Fitriati
author_sort Zyhan Faradilla Daldiri
collection DOAJ
description One of the leading causes of death is cancer. The most common cancer in women is breast cancer. Breast cancer (Carcinoma mammae) is a malignant neoplasm originating from the parenchyma. Breast cancer ranks first in terms of the highest number of cancers in Indonesia and is among the first contributors to cancer deaths. Globocan data in 2020 shows that the number of new breast cancer cases reached 68,858 (16.6%) of the total 396,914 new cancer cases in Indonesia. Meanwhile, deaths reached more than 22 thousand cases (Romkom, 2022). This death rate is increasing due to insufficient information about breast cancer’s early symptoms and dangers. Of this lack of information, a system is needed that can provide information about breast cancer, such as early diagnosis. Several parameters and classification data mining techniques can predict which patients will develop breast cancer and which do not. In this study, a comparison of the classification of breast cancer using the Decision Tree ID3 algorithm and the K-Nearest Neighbors algorithm will be carried out. Attribute data consists of Menopause, Tumor-Size, Node-Caps, Deg-Malig, Breast-Squad,, and Irradiant. The main objective of this study is to improve classification performance in breast cancer diagnosis by applying feature selection to several classification algorithms. The Decision Tree ID3 algorithm has an accuracy rate of 93.333%, and the K-Nearest Neighbors algorithm has an accuracy rate of 76.6667%.
first_indexed 2024-04-09T20:04:45Z
format Article
id doaj.art-d5392a9ecfc64a26b7c698834310a90c
institution Directory Open Access Journal
issn 2656-1743
2656-1735
language English
last_indexed 2024-04-09T20:04:45Z
publishDate 2023-03-01
publisher Kresnamedia Publisher
record_format Article
series Jurnal Riset Informatika
spelling doaj.art-d5392a9ecfc64a26b7c698834310a90c2023-04-02T06:25:50ZengKresnamedia PublisherJurnal Riset Informatika2656-17432656-17352023-03-015217718610.34288/jri.v5i2.406406Comparison of Breast Cancer Classification Using the Decision Tree ID3 Algorithm and K-Nearest Neighbors AlgorithmZyhan Faradilla Daldiri0Desti Fitriati1Universitas PancasilaUniversitas PancasilaOne of the leading causes of death is cancer. The most common cancer in women is breast cancer. Breast cancer (Carcinoma mammae) is a malignant neoplasm originating from the parenchyma. Breast cancer ranks first in terms of the highest number of cancers in Indonesia and is among the first contributors to cancer deaths. Globocan data in 2020 shows that the number of new breast cancer cases reached 68,858 (16.6%) of the total 396,914 new cancer cases in Indonesia. Meanwhile, deaths reached more than 22 thousand cases (Romkom, 2022). This death rate is increasing due to insufficient information about breast cancer’s early symptoms and dangers. Of this lack of information, a system is needed that can provide information about breast cancer, such as early diagnosis. Several parameters and classification data mining techniques can predict which patients will develop breast cancer and which do not. In this study, a comparison of the classification of breast cancer using the Decision Tree ID3 algorithm and the K-Nearest Neighbors algorithm will be carried out. Attribute data consists of Menopause, Tumor-Size, Node-Caps, Deg-Malig, Breast-Squad,, and Irradiant. The main objective of this study is to improve classification performance in breast cancer diagnosis by applying feature selection to several classification algorithms. The Decision Tree ID3 algorithm has an accuracy rate of 93.333%, and the K-Nearest Neighbors algorithm has an accuracy rate of 76.6667%.https://ejournal.kresnamediapublisher.com/index.php/jri/article/view/406breast cancerwomanclassificationcomparisondecision tree algorithmk-nearest neighbors algorithm
spellingShingle Zyhan Faradilla Daldiri
Desti Fitriati
Comparison of Breast Cancer Classification Using the Decision Tree ID3 Algorithm and K-Nearest Neighbors Algorithm
Jurnal Riset Informatika
breast cancer
woman
classification
comparison
decision tree algorithm
k-nearest neighbors algorithm
title Comparison of Breast Cancer Classification Using the Decision Tree ID3 Algorithm and K-Nearest Neighbors Algorithm
title_full Comparison of Breast Cancer Classification Using the Decision Tree ID3 Algorithm and K-Nearest Neighbors Algorithm
title_fullStr Comparison of Breast Cancer Classification Using the Decision Tree ID3 Algorithm and K-Nearest Neighbors Algorithm
title_full_unstemmed Comparison of Breast Cancer Classification Using the Decision Tree ID3 Algorithm and K-Nearest Neighbors Algorithm
title_short Comparison of Breast Cancer Classification Using the Decision Tree ID3 Algorithm and K-Nearest Neighbors Algorithm
title_sort comparison of breast cancer classification using the decision tree id3 algorithm and k nearest neighbors algorithm
topic breast cancer
woman
classification
comparison
decision tree algorithm
k-nearest neighbors algorithm
url https://ejournal.kresnamediapublisher.com/index.php/jri/article/view/406
work_keys_str_mv AT zyhanfaradilladaldiri comparisonofbreastcancerclassificationusingthedecisiontreeid3algorithmandknearestneighborsalgorithm
AT destifitriati comparisonofbreastcancerclassificationusingthedecisiontreeid3algorithmandknearestneighborsalgorithm