Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion

Social media has recently been widely used by users, especially Indonesians, as a place to express themselves in sentences, pictures, sounds, or videos. Twitter is one of the social media favored by people of diverse ages. Twitter is a social media that provides features like social media in general...

Full description

Bibliographic Details
Main Authors: Hanif Reangga Alhakiem, Erwin Budi Setiawan
Format: Article
Language:English
Published: Ikatan Ahli Informatika Indonesia 2022-11-01
Series:Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Subjects:
Online Access:http://jurnal.iaii.or.id/index.php/RESTI/article/view/4429
_version_ 1797334323272089600
author Hanif Reangga Alhakiem
Erwin Budi Setiawan
author_facet Hanif Reangga Alhakiem
Erwin Budi Setiawan
author_sort Hanif Reangga Alhakiem
collection DOAJ
description Social media has recently been widely used by users, especially Indonesians, as a place to express themselves in sentences, pictures, sounds, or videos. Twitter is one of the social media favored by people of diverse ages. Twitter is a social media that provides features like social media in general. However, Twitter has a unique feature where users can send or read text messages limited to only a few characters. Therefore, user tweets with topics related to a particular product can be utilized by companies to become input in the development of these products. This research was conducted using tweet data on the topic of Telkomsel, which is divided into two aspects, namely signal and service. Aspect-based sentiment analysis of Telkomsel was carried out using Logistic Regression with FastText feature expansion to reduce vocabulary mismatch in tweets so that the classification stage can be performed optimally. In addition, the Synthetic Minority Oversampling Technique (SMOTE) sampling method was applied to overcome data imbalance. The test results prove that feature expansion can improve F1-Score values for signal and service aspects. For the signal aspect, F1-Score increased by 3.33% from the baseline with a value of 96.48%. While for the service aspect, F1-Score increased by 12.91% from the baseline with a value of 95.57%.
first_indexed 2024-03-08T08:18:12Z
format Article
id doaj.art-34cca55dabe2434aa87d2288d160d267
institution Directory Open Access Journal
issn 2580-0760
language English
last_indexed 2024-03-08T08:18:12Z
publishDate 2022-11-01
publisher Ikatan Ahli Informatika Indonesia
record_format Article
series Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
spelling doaj.art-34cca55dabe2434aa87d2288d160d2672024-02-02T06:34:38ZengIkatan Ahli Informatika IndonesiaJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)2580-07602022-11-016584084610.29207/resti.v6i5.44294429Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature ExpansionHanif Reangga Alhakiem0Erwin Budi SetiawanTelkom UniversitySocial media has recently been widely used by users, especially Indonesians, as a place to express themselves in sentences, pictures, sounds, or videos. Twitter is one of the social media favored by people of diverse ages. Twitter is a social media that provides features like social media in general. However, Twitter has a unique feature where users can send or read text messages limited to only a few characters. Therefore, user tweets with topics related to a particular product can be utilized by companies to become input in the development of these products. This research was conducted using tweet data on the topic of Telkomsel, which is divided into two aspects, namely signal and service. Aspect-based sentiment analysis of Telkomsel was carried out using Logistic Regression with FastText feature expansion to reduce vocabulary mismatch in tweets so that the classification stage can be performed optimally. In addition, the Synthetic Minority Oversampling Technique (SMOTE) sampling method was applied to overcome data imbalance. The test results prove that feature expansion can improve F1-Score values for signal and service aspects. For the signal aspect, F1-Score increased by 3.33% from the baseline with a value of 96.48%. While for the service aspect, F1-Score increased by 12.91% from the baseline with a value of 95.57%.http://jurnal.iaii.or.id/index.php/RESTI/article/view/4429aspect-based sentiment analysislogistic regressionfasttextfeature expansiontwitter
spellingShingle Hanif Reangga Alhakiem
Erwin Budi Setiawan
Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
aspect-based sentiment analysis
logistic regression
fasttext
feature expansion
twitter
title Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion
title_full Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion
title_fullStr Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion
title_full_unstemmed Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion
title_short Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion
title_sort aspect bas1ed sentiment analysis on twitter using logistic regression with fasttext feature expansion
topic aspect-based sentiment analysis
logistic regression
fasttext
feature expansion
twitter
url http://jurnal.iaii.or.id/index.php/RESTI/article/view/4429
work_keys_str_mv AT hanifreanggaalhakiem aspectbas1edsentimentanalysisontwitterusinglogisticregressionwithfasttextfeatureexpansion
AT erwinbudisetiawan aspectbas1edsentimentanalysisontwitterusinglogisticregressionwithfasttextfeatureexpansion