A Combination of Lexicon-based and Distributional Representations for Classification of Indonesian Vaccine Acceptance Rates

When the COVID-19 pandemic hit, the use of vaccines was advertised as the end of the pandemic by the entire world. However, the chances of vaccination depended on the sentiments of society and individuals about the vaccine. People's acceptance of vaccines can change depending on conditions and...

Full description

Bibliographic Details
Main Authors: Katon Suwida, Muhammad Yusuf Kardawi, Diana Purwitasari, Fahril Mabahist
Format: Article
Language:English
Published: Politeknik Elektronika Negeri Surabaya 2023-06-01
Series:Emitter: International Journal of Engineering Technology
Subjects:
Online Access:https://emitter.pens.ac.id/index.php/emitter/article/view/768
Description
Summary:When the COVID-19 pandemic hit, the use of vaccines was advertised as the end of the pandemic by the entire world. However, the chances of vaccination depended on the sentiments of society and individuals about the vaccine. People's acceptance of vaccines can change depending on conditions and events. Social media platforms such as Twitter can be used as a source of information to find out the conditions and attitudes of the community toward the program. By implementing a machine learning technique on the COVID-19 vaccine dataset, we hope to impact the classification result with text. This study suggests three distinct machine learning models for classifying texts of the COVID-19 vaccination, namely a model based on the first lexicon using the feature extraction method; second, using the word insertion technique to utilize distribution representation; and third, a combination model of distribution representation and feature extraction based on the lexicon. From the evaluation that has been carried out, we found that a combination of lexicon-based and distributional representation methods succeeded in giving the best results for classifying the level of acceptance of the COVID-19 vaccine in Indonesia with an accuracy score of 71.44% and an F1-score of 71.43%.
ISSN:2355-391X
2443-1168