The Influence Of Applying Stopword Removal And Smote On Indonesian Sentiment Classification

Information, like public opinions or responses, can be obtained through Twitter tweets. These opinions can expressed as a sentiment. Sentiments can be positive, neutral, or negative. Sentiment analysis (opinion mining) on a text can performed through text classification. This research aims to determ...

Full description

Bibliographic Details
Main Author: Arif Bijaksana Putra Negara
Format: Article
Language:English
Published: Udayana University, Institute for Research and Community Services 2023-12-01
Series:Lontar Komputer
Online Access:https://ojs.unud.ac.id/index.php/lontar/article/view/106786
_version_ 1797372352634290176
author Arif Bijaksana Putra Negara
author_facet Arif Bijaksana Putra Negara
author_sort Arif Bijaksana Putra Negara
collection DOAJ
description Information, like public opinions or responses, can be obtained through Twitter tweets. These opinions can expressed as a sentiment. Sentiments can be positive, neutral, or negative. Sentiment analysis (opinion mining) on a text can performed through text classification. This research aims to determine the influence of implementing Stopword Removal and SMOTE on the sentiment classification model for Indonesian tweets. The algorithms used in this research are Logistic Regression and Random Forest. Based on the evaluation, the best classification model in this research was achieved by implementing the Random Forest algorithm along with SMOTE, with an f1-score value of 75.03%. Meanwhile, implementing the Random Forest algorithm and Stopword Removal achieved the worst classification model, with an f1-score value of 68.09%. Implementing Stopword Removal in both algorithms has a negative impact in the form of a decrease in the resulting f1-score. Meanwhile, the performance of SMOTE provides a positive impact in the form of an increase in the resulting f1-score. This happened since Stopword Removal could reduce information and alter the meaning of processed tweets, causing the tweet to lose its sentiment.
first_indexed 2024-03-08T18:33:30Z
format Article
id doaj.art-40fcc3e5c93c46f5bab3ea1920c98850
institution Directory Open Access Journal
issn 2088-1541
2541-5832
language English
last_indexed 2024-03-08T18:33:30Z
publishDate 2023-12-01
publisher Udayana University, Institute for Research and Community Services
record_format Article
series Lontar Komputer
spelling doaj.art-40fcc3e5c93c46f5bab3ea1920c988502023-12-29T16:03:05ZengUdayana University, Institute for Research and Community ServicesLontar Komputer2088-15412541-58322023-12-0114317218510.24843/LKJITI.2023.v14.i03.p05106786The Influence Of Applying Stopword Removal And Smote On Indonesian Sentiment ClassificationArif Bijaksana Putra Negara0Universitas TanjungpuraInformation, like public opinions or responses, can be obtained through Twitter tweets. These opinions can expressed as a sentiment. Sentiments can be positive, neutral, or negative. Sentiment analysis (opinion mining) on a text can performed through text classification. This research aims to determine the influence of implementing Stopword Removal and SMOTE on the sentiment classification model for Indonesian tweets. The algorithms used in this research are Logistic Regression and Random Forest. Based on the evaluation, the best classification model in this research was achieved by implementing the Random Forest algorithm along with SMOTE, with an f1-score value of 75.03%. Meanwhile, implementing the Random Forest algorithm and Stopword Removal achieved the worst classification model, with an f1-score value of 68.09%. Implementing Stopword Removal in both algorithms has a negative impact in the form of a decrease in the resulting f1-score. Meanwhile, the performance of SMOTE provides a positive impact in the form of an increase in the resulting f1-score. This happened since Stopword Removal could reduce information and alter the meaning of processed tweets, causing the tweet to lose its sentiment.https://ojs.unud.ac.id/index.php/lontar/article/view/106786
spellingShingle Arif Bijaksana Putra Negara
The Influence Of Applying Stopword Removal And Smote On Indonesian Sentiment Classification
Lontar Komputer
title The Influence Of Applying Stopword Removal And Smote On Indonesian Sentiment Classification
title_full The Influence Of Applying Stopword Removal And Smote On Indonesian Sentiment Classification
title_fullStr The Influence Of Applying Stopword Removal And Smote On Indonesian Sentiment Classification
title_full_unstemmed The Influence Of Applying Stopword Removal And Smote On Indonesian Sentiment Classification
title_short The Influence Of Applying Stopword Removal And Smote On Indonesian Sentiment Classification
title_sort influence of applying stopword removal and smote on indonesian sentiment classification
url https://ojs.unud.ac.id/index.php/lontar/article/view/106786
work_keys_str_mv AT arifbijaksanaputranegara theinfluenceofapplyingstopwordremovalandsmoteonindonesiansentimentclassification
AT arifbijaksanaputranegara influenceofapplyingstopwordremovalandsmoteonindonesiansentimentclassification