Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease

The spread of Lumpy Skin Disease (LSD) that infects livestock is increasingly widespread in various parts of the world. Early detection of the disease’s spread is necessary so that the economic losses caused by LSD are not higher. The use of machine learning algorithms to predict the presence of a d...

Full description

Bibliographic Details
Main Authors: Suparyati Suparyati, Emma Utami, Alva Hendi Muhammad
Format: Article
Language:English
Published: Ikatan Ahli Informatika Indonesia 2022-08-01
Series:Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Subjects:
Online Access:http://jurnal.iaii.or.id/index.php/RESTI/article/view/4147
_version_ 1797334050530131968
author Suparyati Suparyati
Emma Utami
Alva Hendi Muhammad
author_facet Suparyati Suparyati
Emma Utami
Alva Hendi Muhammad
author_sort Suparyati Suparyati
collection DOAJ
description The spread of Lumpy Skin Disease (LSD) that infects livestock is increasingly widespread in various parts of the world. Early detection of the disease’s spread is necessary so that the economic losses caused by LSD are not higher. The use of machine learning algorithms to predict the presence of a disease has been carried out, including in the field of animal health. The  study aims to predict the presence of LSD in an area by utilizing the LSD dataset obtained from Mendeley Data. The number of lumpy infected cases is so low that it creates imbalanced data, posing a challenge in training machine learning models. Handling the unbalanced data is performed by sampling technique using the Random Under-sampling technique and Synthetic Minority Oversampling Technique (SMOTE). The Random Forest classification model was trained on sample data to predict cases of lumpy infection. The Random Forest classifier performs very well on both under-sampling and oversampling data. Measurement of performance metrics shows that SMOTE has a superior score of 1-2% compared to the use of Random Undersampling. Furthermore, Re-call rate, which is the metric we want to maximize in identifying lumpy cases, is superior when using SMOTE and has slightly better precision than Random Undersampling. This research only focuses on how to balance unbalanced data classes so that the optimization of the model has not been implemented, which creates opportunities for further research in the future.
first_indexed 2024-03-08T08:15:03Z
format Article
id doaj.art-046889d61ef04914b705a8d0e50bf5f4
institution Directory Open Access Journal
issn 2580-0760
language English
last_indexed 2024-03-08T08:15:03Z
publishDate 2022-08-01
publisher Ikatan Ahli Informatika Indonesia
record_format Article
series Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
spelling doaj.art-046889d61ef04914b705a8d0e50bf5f42024-02-02T07:42:33ZengIkatan Ahli Informatika IndonesiaJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)2580-07602022-08-016455556210.29207/resti.v6i4.41474147Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin DiseaseSuparyati Suparyati0Emma Utami1Alva Hendi Muhammad2Universitas Amikom YogyakartaUniversitas Amikom YogyakartaUniversitas Amikom YogyakartaThe spread of Lumpy Skin Disease (LSD) that infects livestock is increasingly widespread in various parts of the world. Early detection of the disease’s spread is necessary so that the economic losses caused by LSD are not higher. The use of machine learning algorithms to predict the presence of a disease has been carried out, including in the field of animal health. The  study aims to predict the presence of LSD in an area by utilizing the LSD dataset obtained from Mendeley Data. The number of lumpy infected cases is so low that it creates imbalanced data, posing a challenge in training machine learning models. Handling the unbalanced data is performed by sampling technique using the Random Under-sampling technique and Synthetic Minority Oversampling Technique (SMOTE). The Random Forest classification model was trained on sample data to predict cases of lumpy infection. The Random Forest classifier performs very well on both under-sampling and oversampling data. Measurement of performance metrics shows that SMOTE has a superior score of 1-2% compared to the use of Random Undersampling. Furthermore, Re-call rate, which is the metric we want to maximize in identifying lumpy cases, is superior when using SMOTE and has slightly better precision than Random Undersampling. This research only focuses on how to balance unbalanced data classes so that the optimization of the model has not been implemented, which creates opportunities for further research in the future.http://jurnal.iaii.or.id/index.php/RESTI/article/view/4147genetic algorithm, hyperparameter tuning, lumpy skin disease, machine learning, random forest
spellingShingle Suparyati Suparyati
Emma Utami
Alva Hendi Muhammad
Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
genetic algorithm, hyperparameter tuning, lumpy skin disease, machine learning, random forest
title Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
title_full Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
title_fullStr Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
title_full_unstemmed Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
title_short Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
title_sort applying different resampling strategies in random forest algorithm to predict lumpy skin disease
topic genetic algorithm, hyperparameter tuning, lumpy skin disease, machine learning, random forest
url http://jurnal.iaii.or.id/index.php/RESTI/article/view/4147
work_keys_str_mv AT suparyatisuparyati applyingdifferentresamplingstrategiesinrandomforestalgorithmtopredictlumpyskindisease
AT emmautami applyingdifferentresamplingstrategiesinrandomforestalgorithmtopredictlumpyskindisease
AT alvahendimuhammad applyingdifferentresamplingstrategiesinrandomforestalgorithmtopredictlumpyskindisease