Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders
Interactions between online users are growing more and more in recent years, due to the latest developments of the web. People share online comments, opinions, and reviews about many topics. Aspect extraction is the automatic process of understanding the topic (the aspect) of such comments, which ha...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2019-08-01
|
Series: | Data |
Subjects: | |
Online Access: | https://www.mdpi.com/2306-5729/4/3/121 |
_version_ | 1828371329408040960 |
---|---|
author | Matteo Bodini |
author_facet | Matteo Bodini |
author_sort | Matteo Bodini |
collection | DOAJ |
description | Interactions between online users are growing more and more in recent years, due to the latest developments of the web. People share online comments, opinions, and reviews about many topics. Aspect extraction is the automatic process of understanding the topic (the aspect) of such comments, which has obtained huge interest from commercial and academic points of view. For instance, reviews available in webshops (like eBay, Amazon, Aliexpress, etc.) can help the customers in purchasing products and automatic analysis of reviews would be useful, as sometimes it is almost impossible to read all the available ones. In recent years, aspect extraction in the Bangla language has been regarded more and more as a task of growing importance. In the previous literature, a few methods have been introduced to classify Bangla texts according to the aspect they were focused on. This kind of research is limited mainly due to the lack of publicly available datasets for aspect extraction in the Bangla language. We take into account the only two publicly available datasets, recently published, collected for the task of aspect extraction in the Bangla language. Then, we introduce several classification methods based on stacked auto-encoders, as far as we know never exploited in the task of aspect extraction in Bangla, and we achieve better aspect classification performance with respect to the state-of-the-art: the experiments show an average improvement of <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.17</mn> </mrow> </semantics> </math> </inline-formula>, <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.31</mn> </mrow> </semantics> </math> </inline-formula> and <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.30</mn> </mrow> </semantics> </math> </inline-formula> (across the two datasets), respectively in precision, recall and F1-score, reported in the state-of-the-art works that tackled the problem. |
first_indexed | 2024-04-14T06:48:27Z |
format | Article |
id | doaj.art-5d59c8acb47b437897bfddb3283e2702 |
institution | Directory Open Access Journal |
issn | 2306-5729 |
language | English |
last_indexed | 2024-04-14T06:48:27Z |
publishDate | 2019-08-01 |
publisher | MDPI AG |
record_format | Article |
series | Data |
spelling | doaj.art-5d59c8acb47b437897bfddb3283e27022022-12-22T02:07:06ZengMDPI AGData2306-57292019-08-014312110.3390/data4030121data4030121Aspect Extraction from Bangla Reviews Through Stacked Auto-EncodersMatteo Bodini0Dipartimento di Informatica “Giovanni Degli Antoni”, Università degli Studi di Milano, Via Celoria 18, 20133 Milano, ItalyInteractions between online users are growing more and more in recent years, due to the latest developments of the web. People share online comments, opinions, and reviews about many topics. Aspect extraction is the automatic process of understanding the topic (the aspect) of such comments, which has obtained huge interest from commercial and academic points of view. For instance, reviews available in webshops (like eBay, Amazon, Aliexpress, etc.) can help the customers in purchasing products and automatic analysis of reviews would be useful, as sometimes it is almost impossible to read all the available ones. In recent years, aspect extraction in the Bangla language has been regarded more and more as a task of growing importance. In the previous literature, a few methods have been introduced to classify Bangla texts according to the aspect they were focused on. This kind of research is limited mainly due to the lack of publicly available datasets for aspect extraction in the Bangla language. We take into account the only two publicly available datasets, recently published, collected for the task of aspect extraction in the Bangla language. Then, we introduce several classification methods based on stacked auto-encoders, as far as we know never exploited in the task of aspect extraction in Bangla, and we achieve better aspect classification performance with respect to the state-of-the-art: the experiments show an average improvement of <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.17</mn> </mrow> </semantics> </math> </inline-formula>, <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.31</mn> </mrow> </semantics> </math> </inline-formula> and <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.30</mn> </mrow> </semantics> </math> </inline-formula> (across the two datasets), respectively in precision, recall and F1-score, reported in the state-of-the-art works that tackled the problem.https://www.mdpi.com/2306-5729/4/3/121text classificationaspect-based sentiment analysisaspect extractionBangla languageauto-encoder |
spellingShingle | Matteo Bodini Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders Data text classification aspect-based sentiment analysis aspect extraction Bangla language auto-encoder |
title | Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders |
title_full | Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders |
title_fullStr | Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders |
title_full_unstemmed | Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders |
title_short | Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders |
title_sort | aspect extraction from bangla reviews through stacked auto encoders |
topic | text classification aspect-based sentiment analysis aspect extraction Bangla language auto-encoder |
url | https://www.mdpi.com/2306-5729/4/3/121 |
work_keys_str_mv | AT matteobodini aspectextractionfrombanglareviewsthroughstackedautoencoders |