Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease
The spread of Lumpy Skin Disease (LSD) that infects livestock is increasingly widespread in various parts of the world. Early detection of the disease’s spread is necessary so that the economic losses caused by LSD are not higher. The use of machine learning algorithms to predict the presence of a d...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Ikatan Ahli Informatika Indonesia
2022-08-01
|
Series: | Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) |
Subjects: | |
Online Access: | http://jurnal.iaii.or.id/index.php/RESTI/article/view/4147 |
_version_ | 1797334050530131968 |
---|---|
author | Suparyati Suparyati Emma Utami Alva Hendi Muhammad |
author_facet | Suparyati Suparyati Emma Utami Alva Hendi Muhammad |
author_sort | Suparyati Suparyati |
collection | DOAJ |
description | The spread of Lumpy Skin Disease (LSD) that infects livestock is increasingly widespread in various parts of the world. Early detection of the disease’s spread is necessary so that the economic losses caused by LSD are not higher. The use of machine learning algorithms to predict the presence of a disease has been carried out, including in the field of animal health. The study aims to predict the presence of LSD in an area by utilizing the LSD dataset obtained from Mendeley Data. The number of lumpy infected cases is so low that it creates imbalanced data, posing a challenge in training machine learning models. Handling the unbalanced data is performed by sampling technique using the Random Under-sampling technique and Synthetic Minority Oversampling Technique (SMOTE). The Random Forest classification model was trained on sample data to predict cases of lumpy infection. The Random Forest classifier performs very well on both under-sampling and oversampling data. Measurement of performance metrics shows that SMOTE has a superior score of 1-2% compared to the use of Random Undersampling. Furthermore, Re-call rate, which is the metric we want to maximize in identifying lumpy cases, is superior when using SMOTE and has slightly better precision than Random Undersampling. This research only focuses on how to balance unbalanced data classes so that the optimization of the model has not been implemented, which creates opportunities for further research in the future. |
first_indexed | 2024-03-08T08:15:03Z |
format | Article |
id | doaj.art-046889d61ef04914b705a8d0e50bf5f4 |
institution | Directory Open Access Journal |
issn | 2580-0760 |
language | English |
last_indexed | 2024-03-08T08:15:03Z |
publishDate | 2022-08-01 |
publisher | Ikatan Ahli Informatika Indonesia |
record_format | Article |
series | Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) |
spelling | doaj.art-046889d61ef04914b705a8d0e50bf5f42024-02-02T07:42:33ZengIkatan Ahli Informatika IndonesiaJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)2580-07602022-08-016455556210.29207/resti.v6i4.41474147Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin DiseaseSuparyati Suparyati0Emma Utami1Alva Hendi Muhammad2Universitas Amikom YogyakartaUniversitas Amikom YogyakartaUniversitas Amikom YogyakartaThe spread of Lumpy Skin Disease (LSD) that infects livestock is increasingly widespread in various parts of the world. Early detection of the disease’s spread is necessary so that the economic losses caused by LSD are not higher. The use of machine learning algorithms to predict the presence of a disease has been carried out, including in the field of animal health. The study aims to predict the presence of LSD in an area by utilizing the LSD dataset obtained from Mendeley Data. The number of lumpy infected cases is so low that it creates imbalanced data, posing a challenge in training machine learning models. Handling the unbalanced data is performed by sampling technique using the Random Under-sampling technique and Synthetic Minority Oversampling Technique (SMOTE). The Random Forest classification model was trained on sample data to predict cases of lumpy infection. The Random Forest classifier performs very well on both under-sampling and oversampling data. Measurement of performance metrics shows that SMOTE has a superior score of 1-2% compared to the use of Random Undersampling. Furthermore, Re-call rate, which is the metric we want to maximize in identifying lumpy cases, is superior when using SMOTE and has slightly better precision than Random Undersampling. This research only focuses on how to balance unbalanced data classes so that the optimization of the model has not been implemented, which creates opportunities for further research in the future.http://jurnal.iaii.or.id/index.php/RESTI/article/view/4147genetic algorithm, hyperparameter tuning, lumpy skin disease, machine learning, random forest |
spellingShingle | Suparyati Suparyati Emma Utami Alva Hendi Muhammad Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) genetic algorithm, hyperparameter tuning, lumpy skin disease, machine learning, random forest |
title | Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease |
title_full | Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease |
title_fullStr | Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease |
title_full_unstemmed | Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease |
title_short | Applying Different Resampling Strategies In Random Forest Algorithm To Predict Lumpy Skin Disease |
title_sort | applying different resampling strategies in random forest algorithm to predict lumpy skin disease |
topic | genetic algorithm, hyperparameter tuning, lumpy skin disease, machine learning, random forest |
url | http://jurnal.iaii.or.id/index.php/RESTI/article/view/4147 |
work_keys_str_mv | AT suparyatisuparyati applyingdifferentresamplingstrategiesinrandomforestalgorithmtopredictlumpyskindisease AT emmautami applyingdifferentresamplingstrategiesinrandomforestalgorithmtopredictlumpyskindisease AT alvahendimuhammad applyingdifferentresamplingstrategiesinrandomforestalgorithmtopredictlumpyskindisease |