The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis

The COVID-19 pandemic has significantly impacted Indonesia, necessitating a deeper understanding of public sentiment towards the crisis. This study investigates the performance of three prominent machine learning models: Bernoulli Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression,...

Full description

Bibliographic Details
Main Authors: Wahyu Dirgantara, Fairuz Iqbal Maulana, Subairi, Rahman Arifuddin
Format: Article
Language:English
Published: Universitas Kristen Satya Wacana 2024-04-01
Series:Techne
Subjects:
Online Access:https://ojs.jurnaltechne.org/index.php/techne/article/view/446
_version_ 1797200799765364736
author Wahyu Dirgantara
Fairuz Iqbal Maulana
Subairi
Rahman Arifuddin
author_facet Wahyu Dirgantara
Fairuz Iqbal Maulana
Subairi
Rahman Arifuddin
author_sort Wahyu Dirgantara
collection DOAJ
description The COVID-19 pandemic has significantly impacted Indonesia, necessitating a deeper understanding of public sentiment towards the crisis. This study investigates the performance of three prominent machine learning models: Bernoulli Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression, in analyzing sentiments related to COVID-19 in Indonesia. Utilizing a dataset comprising social media posts, the research aims to classify sentiments into positive, and negative categories, providing insights into the public's perception of the pandemic and associated measures. Sentiment analysis serves as a powerful tool to capture the collective emotions and opinions of the populace, which are pivotal in shaping public health responses and policies. The accuracy of LR and SVM is 99%, whereas Bayesian has an accuracy of 98%. We conclude that Logistic Regression and Support Vector Machine are the best model for the above dataset. This research evaluates these models' accuracy and reliability in the context of the Indonesian language, which influence sentiment interpretation. The findings of this study will contribute to the fields of natural language processing and public health by highlighting the efficacy of machine learning models in sentiment analysis during a health crisis. Moreover, the results will assist policymakers and health officials in understanding public sentiment, enabling them to tailor communication and interventions more effectively.
first_indexed 2024-04-24T07:37:24Z
format Article
id doaj.art-69c3e191d75844c2879e7a7af18caca5
institution Directory Open Access Journal
issn 1412-8292
2615-7772
language English
last_indexed 2024-04-24T07:37:24Z
publishDate 2024-04-01
publisher Universitas Kristen Satya Wacana
record_format Article
series Techne
spelling doaj.art-69c3e191d75844c2879e7a7af18caca52024-04-20T05:15:19ZengUniversitas Kristen Satya WacanaTechne1412-82922615-77722024-04-0123110.31358/techne.v23i1.446The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment AnalysisWahyu Dirgantara0Fairuz Iqbal Maulana1Subairi2Rahman Arifuddin3Universitas Merdeka MalangBina Nusantara UniversityUniversitas Merdeka MalangUniversitas Merdeka Malang The COVID-19 pandemic has significantly impacted Indonesia, necessitating a deeper understanding of public sentiment towards the crisis. This study investigates the performance of three prominent machine learning models: Bernoulli Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression, in analyzing sentiments related to COVID-19 in Indonesia. Utilizing a dataset comprising social media posts, the research aims to classify sentiments into positive, and negative categories, providing insights into the public's perception of the pandemic and associated measures. Sentiment analysis serves as a powerful tool to capture the collective emotions and opinions of the populace, which are pivotal in shaping public health responses and policies. The accuracy of LR and SVM is 99%, whereas Bayesian has an accuracy of 98%. We conclude that Logistic Regression and Support Vector Machine are the best model for the above dataset. This research evaluates these models' accuracy and reliability in the context of the Indonesian language, which influence sentiment interpretation. The findings of this study will contribute to the fields of natural language processing and public health by highlighting the efficacy of machine learning models in sentiment analysis during a health crisis. Moreover, the results will assist policymakers and health officials in understanding public sentiment, enabling them to tailor communication and interventions more effectively. https://ojs.jurnaltechne.org/index.php/techne/article/view/446COVID-19Machine LearningBernoulli Naïve BayesSupport Vector MachineLogistic RegressionSentiment Analysis
spellingShingle Wahyu Dirgantara
Fairuz Iqbal Maulana
Subairi
Rahman Arifuddin
The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis
Techne
COVID-19
Machine Learning
Bernoulli Naïve Bayes
Support Vector Machine
Logistic Regression
Sentiment Analysis
title The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis
title_full The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis
title_fullStr The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis
title_full_unstemmed The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis
title_short The Performance of Machine Learning Model Bernoulli Naïve Bayes, Support Vector Machine, and Logistic Regression on COVID-19 in Indonesia using Sentiment Analysis
title_sort performance of machine learning model bernoulli naive bayes support vector machine and logistic regression on covid 19 in indonesia using sentiment analysis
topic COVID-19
Machine Learning
Bernoulli Naïve Bayes
Support Vector Machine
Logistic Regression
Sentiment Analysis
url https://ojs.jurnaltechne.org/index.php/techne/article/view/446
work_keys_str_mv AT wahyudirgantara theperformanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis
AT fairuziqbalmaulana theperformanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis
AT subairi theperformanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis
AT rahmanarifuddin theperformanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis
AT wahyudirgantara performanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis
AT fairuziqbalmaulana performanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis
AT subairi performanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis
AT rahmanarifuddin performanceofmachinelearningmodelbernoullinaivebayessupportvectormachineandlogisticregressiononcovid19inindonesiausingsentimentanalysis