Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis

In the process of developing a business, aspect-based sentiment analysis (ABSA) could help extract customers' opinions on different aspects of the business from online reviews. Researchers have found great prospective in deep learning approaches to solving ABSA tasks. Furthermore, studies have...

Full description

Bibliographic Details
Main Authors: . Natasya, Abba Suganda Girsang
Format: Article
Language:English
Published: Ital Publication 2022-11-01
Series:Emerging Science Journal
Subjects:
Online Access:https://ijournalse.org/index.php/ESJ/article/view/933
_version_ 1797956903871971328
author . Natasya
Abba Suganda Girsang
author_facet . Natasya
Abba Suganda Girsang
author_sort . Natasya
collection DOAJ
description In the process of developing a business, aspect-based sentiment analysis (ABSA) could help extract customers' opinions on different aspects of the business from online reviews. Researchers have found great prospective in deep learning approaches to solving ABSA tasks. Furthermore, studies have also explored the implementation of text augmentation, such as Easy Data Augmentation (EDA), to improve the deep learning models’ performance using only simple operations. However, when implementing EDA to ABSA, there will be high chances that the augmented sentences could lose important aspects or sentiment-related words (target words) critical for training. Corresponding to that, another study has made adjustments to EDA for English aspect-based sentiment data provided with the target words tag. However, the solution still needs additional modifications in the case of non-tagged data. Hence, in this work, we will focus on modifying EDA that integrates POS tagging and word similarity to not only understand the context of the words but also extract the target words directly from non-tagged sentences. Additionally, the modified EDA is combined with the backtranslation method, as the latter has also shown quite a significant contribution to the model’s performance in several research studies. The proposed method is then evaluated on a small Indonesian ABSA dataset using baseline deep learning models. Results show that the augmentation method could increase the model’s performance on a limited dataset problem. In general, the best performance for aspect classification is achieved by implementing the proposed method, which increases the macro-accuracy and F1, respectively, on Long Short-Term Memory (LSTM) and Bidirectional LSTM models compared to the original EDA. The proposed method also obtained the best performance for sentiment classification using a convolutional neural network, increasing the overall accuracy by 2.2% and F1 by 3.2%.   Doi: 10.28991/ESJ-2023-07-01-018 Full Text: PDF
first_indexed 2024-04-10T23:57:05Z
format Article
id doaj.art-bfe9958a14714c8abe101cde1a418f9a
institution Directory Open Access Journal
issn 2610-9182
language English
last_indexed 2024-04-10T23:57:05Z
publishDate 2022-11-01
publisher Ital Publication
record_format Article
series Emerging Science Journal
spelling doaj.art-bfe9958a14714c8abe101cde1a418f9a2023-01-10T12:51:05ZengItal PublicationEmerging Science Journal2610-91822022-11-017125627210.28991/ESJ-2023-07-01-018414Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis. Natasya0Abba Suganda Girsang1Computer Science Department, BINUS Graduate Program – Master of Computer Science, Bina Nusantara University, Jakarta 11480,Computer Science Department, BINUS Graduate Program – Master of Computer Science, Bina Nusantara University, Jakarta 11480,In the process of developing a business, aspect-based sentiment analysis (ABSA) could help extract customers' opinions on different aspects of the business from online reviews. Researchers have found great prospective in deep learning approaches to solving ABSA tasks. Furthermore, studies have also explored the implementation of text augmentation, such as Easy Data Augmentation (EDA), to improve the deep learning models’ performance using only simple operations. However, when implementing EDA to ABSA, there will be high chances that the augmented sentences could lose important aspects or sentiment-related words (target words) critical for training. Corresponding to that, another study has made adjustments to EDA for English aspect-based sentiment data provided with the target words tag. However, the solution still needs additional modifications in the case of non-tagged data. Hence, in this work, we will focus on modifying EDA that integrates POS tagging and word similarity to not only understand the context of the words but also extract the target words directly from non-tagged sentences. Additionally, the modified EDA is combined with the backtranslation method, as the latter has also shown quite a significant contribution to the model’s performance in several research studies. The proposed method is then evaluated on a small Indonesian ABSA dataset using baseline deep learning models. Results show that the augmentation method could increase the model’s performance on a limited dataset problem. In general, the best performance for aspect classification is achieved by implementing the proposed method, which increases the macro-accuracy and F1, respectively, on Long Short-Term Memory (LSTM) and Bidirectional LSTM models compared to the original EDA. The proposed method also obtained the best performance for sentiment classification using a convolutional neural network, increasing the overall accuracy by 2.2% and F1 by 3.2%.   Doi: 10.28991/ESJ-2023-07-01-018 Full Text: PDFhttps://ijournalse.org/index.php/ESJ/article/view/933easy data augmentationbacktranslationlong short-term memorybidirectional lstmconvolutional neural network.
spellingShingle . Natasya
Abba Suganda Girsang
Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis
Emerging Science Journal
easy data augmentation
backtranslation
long short-term memory
bidirectional lstm
convolutional neural network.
title Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis
title_full Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis
title_fullStr Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis
title_full_unstemmed Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis
title_short Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis
title_sort modified eda and backtranslation augmentation in deep learning models for indonesian aspect based sentiment analysis
topic easy data augmentation
backtranslation
long short-term memory
bidirectional lstm
convolutional neural network.
url https://ijournalse.org/index.php/ESJ/article/view/933
work_keys_str_mv AT natasya modifiededaandbacktranslationaugmentationindeeplearningmodelsforindonesianaspectbasedsentimentanalysis
AT abbasugandagirsang modifiededaandbacktranslationaugmentationindeeplearningmodelsforindonesianaspectbasedsentimentanalysis