Lexicon-Based vs. Bert-Based Sentiment Analysis: A Comparative Study in Italian

Recent evolutions in the e-commerce market have led to an increasing importance attributed by consumers to product reviews made by third parties before proceeding to purchase. The industry, in order to improve the offer intercepting the discontent of consumers, has placed increasing attention toward...

Full description

Bibliographic Details
Main Authors: Rosario Catelli, Serena Pelosi, Massimo Esposito
Format: Article
Language:English
Published: MDPI AG 2022-01-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/11/3/374
Description
Summary:Recent evolutions in the e-commerce market have led to an increasing importance attributed by consumers to product reviews made by third parties before proceeding to purchase. The industry, in order to improve the offer intercepting the discontent of consumers, has placed increasing attention towards systems able to identify the sentiment expressed by buyers, whether positive or negative. From a technological point of view, the literature in recent years has seen the development of two types of methodologies: those based on lexicons and those based on machine and deep learning techniques. This study proposes a comparison between these technologies in the Italian market, one of the largest in the world, exploiting an ad hoc dataset: scientific evidence generally shows the superiority of language models such as BERT built on deep neural networks, but it opens several considerations on the effectiveness and improvement of these solutions when compared to those based on lexicons in the presence of datasets of reduced size such as the one under study, a common condition for languages other than English or Chinese.
ISSN:2079-9292