Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis
In the process of developing a business, aspect-based sentiment analysis (ABSA) could help extract customers' opinions on different aspects of the business from online reviews. Researchers have found great prospective in deep learning approaches to solving ABSA tasks. Furthermore, studies have...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Ital Publication
2022-11-01
|
Series: | Emerging Science Journal |
Subjects: | |
Online Access: | https://ijournalse.org/index.php/ESJ/article/view/933 |
_version_ | 1797956903871971328 |
---|---|
author | . Natasya Abba Suganda Girsang |
author_facet | . Natasya Abba Suganda Girsang |
author_sort | . Natasya |
collection | DOAJ |
description | In the process of developing a business, aspect-based sentiment analysis (ABSA) could help extract customers' opinions on different aspects of the business from online reviews. Researchers have found great prospective in deep learning approaches to solving ABSA tasks. Furthermore, studies have also explored the implementation of text augmentation, such as Easy Data Augmentation (EDA), to improve the deep learning models’ performance using only simple operations. However, when implementing EDA to ABSA, there will be high chances that the augmented sentences could lose important aspects or sentiment-related words (target words) critical for training. Corresponding to that, another study has made adjustments to EDA for English aspect-based sentiment data provided with the target words tag. However, the solution still needs additional modifications in the case of non-tagged data. Hence, in this work, we will focus on modifying EDA that integrates POS tagging and word similarity to not only understand the context of the words but also extract the target words directly from non-tagged sentences. Additionally, the modified EDA is combined with the backtranslation method, as the latter has also shown quite a significant contribution to the model’s performance in several research studies. The proposed method is then evaluated on a small Indonesian ABSA dataset using baseline deep learning models. Results show that the augmentation method could increase the model’s performance on a limited dataset problem. In general, the best performance for aspect classification is achieved by implementing the proposed method, which increases the macro-accuracy and F1, respectively, on Long Short-Term Memory (LSTM) and Bidirectional LSTM models compared to the original EDA. The proposed method also obtained the best performance for sentiment classification using a convolutional neural network, increasing the overall accuracy by 2.2% and F1 by 3.2%.
Doi: 10.28991/ESJ-2023-07-01-018
Full Text: PDF |
first_indexed | 2024-04-10T23:57:05Z |
format | Article |
id | doaj.art-bfe9958a14714c8abe101cde1a418f9a |
institution | Directory Open Access Journal |
issn | 2610-9182 |
language | English |
last_indexed | 2024-04-10T23:57:05Z |
publishDate | 2022-11-01 |
publisher | Ital Publication |
record_format | Article |
series | Emerging Science Journal |
spelling | doaj.art-bfe9958a14714c8abe101cde1a418f9a2023-01-10T12:51:05ZengItal PublicationEmerging Science Journal2610-91822022-11-017125627210.28991/ESJ-2023-07-01-018414Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis. Natasya0Abba Suganda Girsang1Computer Science Department, BINUS Graduate Program – Master of Computer Science, Bina Nusantara University, Jakarta 11480,Computer Science Department, BINUS Graduate Program – Master of Computer Science, Bina Nusantara University, Jakarta 11480,In the process of developing a business, aspect-based sentiment analysis (ABSA) could help extract customers' opinions on different aspects of the business from online reviews. Researchers have found great prospective in deep learning approaches to solving ABSA tasks. Furthermore, studies have also explored the implementation of text augmentation, such as Easy Data Augmentation (EDA), to improve the deep learning models’ performance using only simple operations. However, when implementing EDA to ABSA, there will be high chances that the augmented sentences could lose important aspects or sentiment-related words (target words) critical for training. Corresponding to that, another study has made adjustments to EDA for English aspect-based sentiment data provided with the target words tag. However, the solution still needs additional modifications in the case of non-tagged data. Hence, in this work, we will focus on modifying EDA that integrates POS tagging and word similarity to not only understand the context of the words but also extract the target words directly from non-tagged sentences. Additionally, the modified EDA is combined with the backtranslation method, as the latter has also shown quite a significant contribution to the model’s performance in several research studies. The proposed method is then evaluated on a small Indonesian ABSA dataset using baseline deep learning models. Results show that the augmentation method could increase the model’s performance on a limited dataset problem. In general, the best performance for aspect classification is achieved by implementing the proposed method, which increases the macro-accuracy and F1, respectively, on Long Short-Term Memory (LSTM) and Bidirectional LSTM models compared to the original EDA. The proposed method also obtained the best performance for sentiment classification using a convolutional neural network, increasing the overall accuracy by 2.2% and F1 by 3.2%. Doi: 10.28991/ESJ-2023-07-01-018 Full Text: PDFhttps://ijournalse.org/index.php/ESJ/article/view/933easy data augmentationbacktranslationlong short-term memorybidirectional lstmconvolutional neural network. |
spellingShingle | . Natasya Abba Suganda Girsang Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis Emerging Science Journal easy data augmentation backtranslation long short-term memory bidirectional lstm convolutional neural network. |
title | Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis |
title_full | Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis |
title_fullStr | Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis |
title_full_unstemmed | Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis |
title_short | Modified EDA and Backtranslation Augmentation in Deep Learning Models for Indonesian Aspect-Based Sentiment Analysis |
title_sort | modified eda and backtranslation augmentation in deep learning models for indonesian aspect based sentiment analysis |
topic | easy data augmentation backtranslation long short-term memory bidirectional lstm convolutional neural network. |
url | https://ijournalse.org/index.php/ESJ/article/view/933 |
work_keys_str_mv | AT natasya modifiededaandbacktranslationaugmentationindeeplearningmodelsforindonesianaspectbasedsentimentanalysis AT abbasugandagirsang modifiededaandbacktranslationaugmentationindeeplearningmodelsforindonesianaspectbasedsentimentanalysis |