THE EFFECTIVENESS ANALYSIS OF RANDOM FOREST ALGORITHMS WITH SMOTE TECHNIQUE IN PREDICTING LUNG CANCER RISK

When compared with other types of cancer, most of the population with cancer die from lung cancer.A person needs to do a screening test through X-rays, CT scans, and MRI to detect the disease. However, before carrying out the process, the doctor will ordinarily investigate a medical history and phys...

Full description

Bibliographic Details
Main Authors: Ita Yulianti, Ami Rahmawati, Tati Mardiana
Format: Article
Language:English
Published: Kresnamedia Publisher 2022-03-01
Series:Jurnal Riset Informatika
Subjects:
Online Access:https://ejournal.kresnamediapublisher.com/index.php/jri/article/view/385
_version_ 1798020444770533376
author Ita Yulianti
Ami Rahmawati
Tati Mardiana
author_facet Ita Yulianti
Ami Rahmawati
Tati Mardiana
author_sort Ita Yulianti
collection DOAJ
description When compared with other types of cancer, most of the population with cancer die from lung cancer.A person needs to do a screening test through X-rays, CT scans, and MRI to detect the disease. However, before carrying out the process, the doctor will ordinarily investigate a medical history and physical examination first to study the symptoms and possible risk factors for lung cancer. The lung cancer data set has a class imbalance that affects the performance of the random forest algorithm in predicting the risk of lung cancer. This study aims to employ the SMOTE technique to the random forest algorithm to increase accuracy in predicting lung cancer risk. In this research, data processing and analysis use the Python programming language. The test results show an accuracy value of 88% with an AUC value of 0.93. When employing the random forest method to forecast lung cancer risk, the SMOTE technique is useful in dealing with class imbalances in the data set.
first_indexed 2024-04-11T16:57:36Z
format Article
id doaj.art-286c67c43b27450db4a9a88ccfa56d9f
institution Directory Open Access Journal
issn 2656-1743
2656-1735
language English
last_indexed 2024-04-11T16:57:36Z
publishDate 2022-03-01
publisher Kresnamedia Publisher
record_format Article
series Jurnal Riset Informatika
spelling doaj.art-286c67c43b27450db4a9a88ccfa56d9f2022-12-22T04:13:14ZengKresnamedia PublisherJurnal Riset Informatika2656-17432656-17352022-03-014220721410.34288/jri.v4i2.385385THE EFFECTIVENESS ANALYSIS OF RANDOM FOREST ALGORITHMS WITH SMOTE TECHNIQUE IN PREDICTING LUNG CANCER RISKIta Yulianti0Ami Rahmawati1Tati MardianaUniversitas Bina Sarana InformatikaUniversitas Nusa MandiriWhen compared with other types of cancer, most of the population with cancer die from lung cancer.A person needs to do a screening test through X-rays, CT scans, and MRI to detect the disease. However, before carrying out the process, the doctor will ordinarily investigate a medical history and physical examination first to study the symptoms and possible risk factors for lung cancer. The lung cancer data set has a class imbalance that affects the performance of the random forest algorithm in predicting the risk of lung cancer. This study aims to employ the SMOTE technique to the random forest algorithm to increase accuracy in predicting lung cancer risk. In this research, data processing and analysis use the Python programming language. The test results show an accuracy value of 88% with an AUC value of 0.93. When employing the random forest method to forecast lung cancer risk, the SMOTE technique is useful in dealing with class imbalances in the data set.https://ejournal.kresnamediapublisher.com/index.php/jri/article/view/385lung cancerpythonrandom forestsmote
spellingShingle Ita Yulianti
Ami Rahmawati
Tati Mardiana
THE EFFECTIVENESS ANALYSIS OF RANDOM FOREST ALGORITHMS WITH SMOTE TECHNIQUE IN PREDICTING LUNG CANCER RISK
Jurnal Riset Informatika
lung cancer
python
random forest
smote
title THE EFFECTIVENESS ANALYSIS OF RANDOM FOREST ALGORITHMS WITH SMOTE TECHNIQUE IN PREDICTING LUNG CANCER RISK
title_full THE EFFECTIVENESS ANALYSIS OF RANDOM FOREST ALGORITHMS WITH SMOTE TECHNIQUE IN PREDICTING LUNG CANCER RISK
title_fullStr THE EFFECTIVENESS ANALYSIS OF RANDOM FOREST ALGORITHMS WITH SMOTE TECHNIQUE IN PREDICTING LUNG CANCER RISK
title_full_unstemmed THE EFFECTIVENESS ANALYSIS OF RANDOM FOREST ALGORITHMS WITH SMOTE TECHNIQUE IN PREDICTING LUNG CANCER RISK
title_short THE EFFECTIVENESS ANALYSIS OF RANDOM FOREST ALGORITHMS WITH SMOTE TECHNIQUE IN PREDICTING LUNG CANCER RISK
title_sort effectiveness analysis of random forest algorithms with smote technique in predicting lung cancer risk
topic lung cancer
python
random forest
smote
url https://ejournal.kresnamediapublisher.com/index.php/jri/article/view/385
work_keys_str_mv AT itayulianti theeffectivenessanalysisofrandomforestalgorithmswithsmotetechniqueinpredictinglungcancerrisk
AT amirahmawati theeffectivenessanalysisofrandomforestalgorithmswithsmotetechniqueinpredictinglungcancerrisk
AT tatimardiana theeffectivenessanalysisofrandomforestalgorithmswithsmotetechniqueinpredictinglungcancerrisk
AT itayulianti effectivenessanalysisofrandomforestalgorithmswithsmotetechniqueinpredictinglungcancerrisk
AT amirahmawati effectivenessanalysisofrandomforestalgorithmswithsmotetechniqueinpredictinglungcancerrisk
AT tatimardiana effectivenessanalysisofrandomforestalgorithmswithsmotetechniqueinpredictinglungcancerrisk