Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease

The spread of Lumpy Skin Disease (LSD) that infects livestock is increasingly widespread in various parts of the world. Early detection of the disease’s spread is necessary so that the economic losses caused by LSD are not higher. The use of machine learning algorithms to predict the presence of a d...

Full description

Bibliographic Details
Main Authors:	Suparyati Suparyati, Emma Utami, Alva Hendi Muhammad
Format:	Article
Language:	English
Published:	Ikatan Ahli Informatika Indonesia 2022-08-01
Series:	Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Subjects:	genetic algorithm, hyperparameter tuning, lumpy skin disease, machine learning, random forest
Online Access:	http://jurnal.iaii.or.id/index.php/RESTI/article/view/4147

_version_	1797334050530131968
author	Suparyati Suparyati Emma Utami Alva Hendi Muhammad
author_facet	Suparyati Suparyati Emma Utami Alva Hendi Muhammad
author_sort	Suparyati Suparyati
collection	DOAJ
description	The spread of Lumpy Skin Disease (LSD) that infects livestock is increasingly widespread in various parts of the world. Early detection of the disease’s spread is necessary so that the economic losses caused by LSD are not higher. The use of machine learning algorithms to predict the presence of a disease has been carried out, including in the field of animal health. The study aims to predict the presence of LSD in an area by utilizing the LSD dataset obtained from Mendeley Data. The number of lumpy infected cases is so low that it creates imbalanced data, posing a challenge in training machine learning models. Handling the unbalanced data is performed by sampling technique using the Random Under-sampling technique and Synthetic Minority Oversampling Technique (SMOTE). The Random Forest classification model was trained on sample data to predict cases of lumpy infection. The Random Forest classifier performs very well on both under-sampling and oversampling data. Measurement of performance metrics shows that SMOTE has a superior score of 1-2% compared to the use of Random Undersampling. Furthermore, Re-call rate, which is the metric we want to maximize in identifying lumpy cases, is superior when using SMOTE and has slightly better precision than Random Undersampling. This research only focuses on how to balance unbalanced data classes so that the optimization of the model has not been implemented, which creates opportunities for further research in the future.
first_indexed	2024-03-08T08:15:03Z
format	Article
id	doaj.art-046889d61ef04914b705a8d0e50bf5f4
institution	Directory Open Access Journal
issn	2580-0760
language	English
last_indexed	2024-03-08T08:15:03Z
publishDate	2022-08-01
publisher	Ikatan Ahli Informatika Indonesia
record_format	Article
series	Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
spelling	doaj.art-046889d61ef04914b705a8d0e50bf5f42024-02-02T07:42:33ZengIkatan Ahli Informatika IndonesiaJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)2580-07602022-08-016455556210.29207/resti.v6i4.41474147Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin DiseaseSuparyati Suparyati0Emma Utami1Alva Hendi Muhammad2Universitas Amikom YogyakartaUniversitas Amikom YogyakartaUniversitas Amikom YogyakartaThe spread of Lumpy Skin Disease (LSD) that infects livestock is increasingly widespread in various parts of the world. Early detection of the disease’s spread is necessary so that the economic losses caused by LSD are not higher. The use of machine learning algorithms to predict the presence of a disease has been carried out, including in the field of animal health. The study aims to predict the presence of LSD in an area by utilizing the LSD dataset obtained from Mendeley Data. The number of lumpy infected cases is so low that it creates imbalanced data, posing a challenge in training machine learning models. Handling the unbalanced data is performed by sampling technique using the Random Under-sampling technique and Synthetic Minority Oversampling Technique (SMOTE). The Random Forest classification model was trained on sample data to predict cases of lumpy infection. The Random Forest classifier performs very well on both under-sampling and oversampling data. Measurement of performance metrics shows that SMOTE has a superior score of 1-2% compared to the use of Random Undersampling. Furthermore, Re-call rate, which is the metric we want to maximize in identifying lumpy cases, is superior when using SMOTE and has slightly better precision than Random Undersampling. This research only focuses on how to balance unbalanced data classes so that the optimization of the model has not been implemented, which creates opportunities for further research in the future.http://jurnal.iaii.or.id/index.php/RESTI/article/view/4147genetic algorithm, hyperparameter tuning, lumpy skin disease, machine learning, random forest
spellingShingle	Suparyati Suparyati Emma Utami Alva Hendi Muhammad Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) genetic algorithm, hyperparameter tuning, lumpy skin disease, machine learning, random forest
title	Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
title_full	Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
title_fullStr	Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
title_full_unstemmed	Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
title_short	Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
title_sort	applying different resampling strategies in random forest algorithm to predict lumpy skin disease
topic	genetic algorithm, hyperparameter tuning, lumpy skin disease, machine learning, random forest
url	http://jurnal.iaii.or.id/index.php/RESTI/article/view/4147
work_keys_str_mv	AT suparyatisuparyati applyingdifferentresamplingstrategiesinrandomforestalgorithmtopredictlumpyskindisease AT emmautami applyingdifferentresamplingstrategiesinrandomforestalgorithmtopredictlumpyskindisease AT alvahendimuhammad applyingdifferentresamplingstrategiesinrandomforestalgorithmtopredictlumpyskindisease

Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease

Similar Items