Application of the text wave model to the sentiment analysis problem
Authors researched the wave model of text representation which is one of the implementations of distributive semantics. This model takes into account not only the frequency of words occurrence in the text, but also their mutual location. The purpose of the study: to increase the accuracy of the anal...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)
2022-12-01
|
Series: | Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki |
Subjects: | |
Online Access: | https://ntv.ifmo.ru/file/article/21662.pdf |
_version_ | 1811290381017415680 |
---|---|
author | Anastasia S. Gruzdeva Rodion N. Iurev Igor A. Bessmertny |
author_facet | Anastasia S. Gruzdeva Rodion N. Iurev Igor A. Bessmertny |
author_sort | Anastasia S. Gruzdeva |
collection | DOAJ |
description | Authors researched the wave model of text representation which is one of the implementations of distributive semantics. This model takes into account not only the frequency of words occurrence in the text, but also their mutual location. The purpose of the study: to increase the accuracy of the analysis of the tonality of short texts based on the wave model.
The method of determining the relationship between the text and the term is based on the calculation of the probability amplitude of the text and term proximity using a wave model. The term with the highest probability amplitude is considered to correspond most closely to the meaning of the text. The wave model allowed taking into account the
fact that well-known methods define antonyms as semantically close lexical units. For the experimental study of this technique, a solution to the problem of sentiment analysis was chosen, exactly, finding the correspondence of user reviews about the product to the classes “positive” and “negative”. As a result, the accuracy of the text tonality defining was obtained up to 76.4 %, which exceeds the accuracy of the classical approach as well as the well-known methods of sentiment analysis for the Russian language. In addition, authors detected significant influence on classification accuracy of such model parameters as the choice of a basic distributive semantic model, the choice of a control point for calculating wave numbers, taking into account the influence of antonyms. The presented model has shown high accuracy
in identifying the relationships of the text with concepts that are not explicitly present in it and can be successfully used as a mathematical basis for solving problems of sentiment analysis. In addition, the results obtained indicate the potential use of the wave model in other areas that require the classification of texts by indirect signs, for example, to determine the elements of author psychological portrait. |
first_indexed | 2024-04-13T04:11:52Z |
format | Article |
id | doaj.art-16ed4ff6700e49d3aea83b708e49c5a1 |
institution | Directory Open Access Journal |
issn | 2226-1494 2500-0373 |
language | English |
last_indexed | 2024-04-13T04:11:52Z |
publishDate | 2022-12-01 |
publisher | Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University) |
record_format | Article |
series | Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki |
spelling | doaj.art-16ed4ff6700e49d3aea83b708e49c5a12022-12-22T03:03:04ZengSaint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki2226-14942500-03732022-12-012261159116510.17586/2226-1494-2022-22-6-1159-1165Application of the text wave model to the sentiment analysis problemAnastasia S. Gruzdeva0https://orcid.org/0000-0003-4963-0823Rodion N. Iurev1https://orcid.org/0000-0003-1146-2617Igor A. Bessmertny2https://orcid.org/0000-0001-6711-6399PhD Student, ITMO University, Saint Petersburg, 197101, Russian Federation, sc 57674037100PhD Student, ITMO University, Saint Petersburg, 197101, Russian Federation, sc 57485730300D. Sc., Full Professor, ITMO University, Saint Petersburg, 197101, Russian Federation, sc 36661767800Authors researched the wave model of text representation which is one of the implementations of distributive semantics. This model takes into account not only the frequency of words occurrence in the text, but also their mutual location. The purpose of the study: to increase the accuracy of the analysis of the tonality of short texts based on the wave model. The method of determining the relationship between the text and the term is based on the calculation of the probability amplitude of the text and term proximity using a wave model. The term with the highest probability amplitude is considered to correspond most closely to the meaning of the text. The wave model allowed taking into account the fact that well-known methods define antonyms as semantically close lexical units. For the experimental study of this technique, a solution to the problem of sentiment analysis was chosen, exactly, finding the correspondence of user reviews about the product to the classes “positive” and “negative”. As a result, the accuracy of the text tonality defining was obtained up to 76.4 %, which exceeds the accuracy of the classical approach as well as the well-known methods of sentiment analysis for the Russian language. In addition, authors detected significant influence on classification accuracy of such model parameters as the choice of a basic distributive semantic model, the choice of a control point for calculating wave numbers, taking into account the influence of antonyms. The presented model has shown high accuracy in identifying the relationships of the text with concepts that are not explicitly present in it and can be successfully used as a mathematical basis for solving problems of sentiment analysis. In addition, the results obtained indicate the potential use of the wave model in other areas that require the classification of texts by indirect signs, for example, to determine the elements of author psychological portrait.https://ntv.ifmo.ru/file/article/21662.pdfsentiment analysisclassificationnatural language processingwave modelquantum-like model |
spellingShingle | Anastasia S. Gruzdeva Rodion N. Iurev Igor A. Bessmertny Application of the text wave model to the sentiment analysis problem Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki sentiment analysis classification natural language processing wave model quantum-like model |
title | Application of the text wave model to the sentiment analysis problem |
title_full | Application of the text wave model to the sentiment analysis problem |
title_fullStr | Application of the text wave model to the sentiment analysis problem |
title_full_unstemmed | Application of the text wave model to the sentiment analysis problem |
title_short | Application of the text wave model to the sentiment analysis problem |
title_sort | application of the text wave model to the sentiment analysis problem |
topic | sentiment analysis classification natural language processing wave model quantum-like model |
url | https://ntv.ifmo.ru/file/article/21662.pdf |
work_keys_str_mv | AT anastasiasgruzdeva applicationofthetextwavemodeltothesentimentanalysisproblem AT rodionniurev applicationofthetextwavemodeltothesentimentanalysisproblem AT igorabessmertny applicationofthetextwavemodeltothesentimentanalysisproblem |