Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders

Interactions between online users are growing more and more in recent years, due to the latest developments of the web. People share online comments, opinions, and reviews about many topics. Aspect extraction is the automatic process of understanding the topic (the aspect) of such comments, which ha...

Full description

Bibliographic Details
Main Author: Matteo Bodini
Format: Article
Language:English
Published: MDPI AG 2019-08-01
Series:Data
Subjects:
Online Access:https://www.mdpi.com/2306-5729/4/3/121
_version_ 1828371329408040960
author Matteo Bodini
author_facet Matteo Bodini
author_sort Matteo Bodini
collection DOAJ
description Interactions between online users are growing more and more in recent years, due to the latest developments of the web. People share online comments, opinions, and reviews about many topics. Aspect extraction is the automatic process of understanding the topic (the aspect) of such comments, which has obtained huge interest from commercial and academic points of view. For instance, reviews available in webshops (like eBay, Amazon, Aliexpress, etc.) can help the customers in purchasing products and automatic analysis of reviews would be useful, as sometimes it is almost impossible to read all the available ones. In recent years, aspect extraction in the Bangla language has been regarded more and more as a task of growing importance. In the previous literature, a few methods have been introduced to classify Bangla texts according to the aspect they were focused on. This kind of research is limited mainly due to the lack of publicly available datasets for aspect extraction in the Bangla language. We take into account the only two publicly available datasets, recently published, collected for the task of aspect extraction in the Bangla language. Then, we introduce several classification methods based on stacked auto-encoders, as far as we know never exploited in the task of aspect extraction in Bangla, and we achieve better aspect classification performance with respect to the state-of-the-art: the experiments show an average improvement of <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.17</mn> </mrow> </semantics> </math> </inline-formula>, <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.31</mn> </mrow> </semantics> </math> </inline-formula> and <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.30</mn> </mrow> </semantics> </math> </inline-formula> (across the two datasets), respectively in precision, recall and F1-score, reported in the state-of-the-art works that tackled the problem.
first_indexed 2024-04-14T06:48:27Z
format Article
id doaj.art-5d59c8acb47b437897bfddb3283e2702
institution Directory Open Access Journal
issn 2306-5729
language English
last_indexed 2024-04-14T06:48:27Z
publishDate 2019-08-01
publisher MDPI AG
record_format Article
series Data
spelling doaj.art-5d59c8acb47b437897bfddb3283e27022022-12-22T02:07:06ZengMDPI AGData2306-57292019-08-014312110.3390/data4030121data4030121Aspect Extraction from Bangla Reviews Through Stacked Auto-EncodersMatteo Bodini0Dipartimento di Informatica “Giovanni Degli Antoni”, Università degli Studi di Milano, Via Celoria 18, 20133 Milano, ItalyInteractions between online users are growing more and more in recent years, due to the latest developments of the web. People share online comments, opinions, and reviews about many topics. Aspect extraction is the automatic process of understanding the topic (the aspect) of such comments, which has obtained huge interest from commercial and academic points of view. For instance, reviews available in webshops (like eBay, Amazon, Aliexpress, etc.) can help the customers in purchasing products and automatic analysis of reviews would be useful, as sometimes it is almost impossible to read all the available ones. In recent years, aspect extraction in the Bangla language has been regarded more and more as a task of growing importance. In the previous literature, a few methods have been introduced to classify Bangla texts according to the aspect they were focused on. This kind of research is limited mainly due to the lack of publicly available datasets for aspect extraction in the Bangla language. We take into account the only two publicly available datasets, recently published, collected for the task of aspect extraction in the Bangla language. Then, we introduce several classification methods based on stacked auto-encoders, as far as we know never exploited in the task of aspect extraction in Bangla, and we achieve better aspect classification performance with respect to the state-of-the-art: the experiments show an average improvement of <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.17</mn> </mrow> </semantics> </math> </inline-formula>, <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.31</mn> </mrow> </semantics> </math> </inline-formula> and <inline-formula> <math display="inline"> <semantics> <mrow> <mn>0.30</mn> </mrow> </semantics> </math> </inline-formula> (across the two datasets), respectively in precision, recall and F1-score, reported in the state-of-the-art works that tackled the problem.https://www.mdpi.com/2306-5729/4/3/121text classificationaspect-based sentiment analysisaspect extractionBangla languageauto-encoder
spellingShingle Matteo Bodini
Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders
Data
text classification
aspect-based sentiment analysis
aspect extraction
Bangla language
auto-encoder
title Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders
title_full Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders
title_fullStr Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders
title_full_unstemmed Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders
title_short Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders
title_sort aspect extraction from bangla reviews through stacked auto encoders
topic text classification
aspect-based sentiment analysis
aspect extraction
Bangla language
auto-encoder
url https://www.mdpi.com/2306-5729/4/3/121
work_keys_str_mv AT matteobodini aspectextractionfrombanglareviewsthroughstackedautoencoders