The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis
The COVID-19 pandemic has significantly impacted Indonesia, necessitating a deeper understanding of public sentiment towards the crisis. This study investigates the performance of three prominent machine learning models: Bernoulli Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression,...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Universitas Kristen Satya Wacana
2024-04-01
|
Series: | Techne |
Subjects: | |
Online Access: | https://ojs.jurnaltechne.org/index.php/techne/article/view/446 |
_version_ | 1797200799765364736 |
---|---|
author | Wahyu Dirgantara Fairuz Iqbal Maulana Subairi Rahman Arifuddin |
author_facet | Wahyu Dirgantara Fairuz Iqbal Maulana Subairi Rahman Arifuddin |
author_sort | Wahyu Dirgantara |
collection | DOAJ |
description |
The COVID-19 pandemic has significantly impacted Indonesia, necessitating a deeper understanding of public sentiment towards the crisis. This study investigates the performance of three prominent machine learning models: Bernoulli Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression, in analyzing sentiments related to COVID-19 in Indonesia. Utilizing a dataset comprising social media posts, the research aims to classify sentiments into positive, and negative categories, providing insights into the public's perception of the pandemic and associated measures. Sentiment analysis serves as a powerful tool to capture the collective emotions and opinions of the populace, which are pivotal in shaping public health responses and policies. The accuracy of LR and SVM is 99%, whereas Bayesian has an accuracy of 98%. We conclude that Logistic Regression and Support Vector Machine are the best model for the above dataset. This research evaluates these models' accuracy and reliability in the context of the Indonesian language, which influence sentiment interpretation. The findings of this study will contribute to the fields of natural language processing and public health by highlighting the efficacy of machine learning models in sentiment analysis during a health crisis. Moreover, the results will assist policymakers and health officials in understanding public sentiment, enabling them to tailor communication and interventions more effectively.
|
first_indexed | 2024-04-24T07:37:24Z |
format | Article |
id | doaj.art-69c3e191d75844c2879e7a7af18caca5 |
institution | Directory Open Access Journal |
issn | 1412-8292 2615-7772 |
language | English |
last_indexed | 2024-04-24T07:37:24Z |
publishDate | 2024-04-01 |
publisher | Universitas Kristen Satya Wacana |
record_format | Article |
series | Techne |
spelling | doaj.art-69c3e191d75844c2879e7a7af18caca52024-04-20T05:15:19ZengUniversitas Kristen Satya WacanaTechne1412-82922615-77722024-04-0123110.31358/techne.v23i1.446The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment AnalysisWahyu Dirgantara0Fairuz Iqbal Maulana1Subairi2Rahman Arifuddin3Universitas Merdeka MalangBina Nusantara UniversityUniversitas Merdeka MalangUniversitas Merdeka Malang The COVID-19 pandemic has significantly impacted Indonesia, necessitating a deeper understanding of public sentiment towards the crisis. This study investigates the performance of three prominent machine learning models: Bernoulli Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression, in analyzing sentiments related to COVID-19 in Indonesia. Utilizing a dataset comprising social media posts, the research aims to classify sentiments into positive, and negative categories, providing insights into the public's perception of the pandemic and associated measures. Sentiment analysis serves as a powerful tool to capture the collective emotions and opinions of the populace, which are pivotal in shaping public health responses and policies. The accuracy of LR and SVM is 99%, whereas Bayesian has an accuracy of 98%. We conclude that Logistic Regression and Support Vector Machine are the best model for the above dataset. This research evaluates these models' accuracy and reliability in the context of the Indonesian language, which influence sentiment interpretation. The findings of this study will contribute to the fields of natural language processing and public health by highlighting the efficacy of machine learning models in sentiment analysis during a health crisis. Moreover, the results will assist policymakers and health officials in understanding public sentiment, enabling them to tailor communication and interventions more effectively. https://ojs.jurnaltechne.org/index.php/techne/article/view/446COVID-19Machine LearningBernoulli Naïve BayesSupport Vector MachineLogistic RegressionSentiment Analysis |
spellingShingle | Wahyu Dirgantara Fairuz Iqbal Maulana Subairi Rahman Arifuddin The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis Techne COVID-19 Machine Learning Bernoulli Naïve Bayes Support Vector Machine Logistic Regression Sentiment Analysis |
title | The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis |
title_full | The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis |
title_fullStr | The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis |
title_full_unstemmed | The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis |
title_short | The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis |
title_sort | performance of machine learning model bernoulli naive bayes support vector machine and logistic regression on covid 19 in indonesia using sentiment analysis |
topic | COVID-19 Machine Learning Bernoulli Naïve Bayes Support Vector Machine Logistic Regression Sentiment Analysis |
url | https://ojs.jurnaltechne.org/index.php/techne/article/view/446 |
work_keys_str_mv | AT wahyudirgantara theperformanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis AT fairuziqbalmaulana theperformanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis AT subairi theperformanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis AT rahmanarifuddin theperformanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis AT wahyudirgantara performanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis AT fairuziqbalmaulana performanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis AT subairi performanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis AT rahmanarifuddin performanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis |