Classification of short texts using a wave model

Quantum computing algorithms are actively developed and applied in the field of natural language processing. The authors of the paper proposed a new quantum-like method for classifying short texts. The basis of the method is the representation of the text as an ensemble of elementary particles. The...

Full description

Bibliographic Details
Main Authors: Anastasia S. Gruzdeva, Igor A. Bessmertny
Format: Article
Language:English
Published: Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University) 2022-04-01
Series:Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki
Subjects:
Online Access:https://ntv.ifmo.ru/file/article/21132.pdf
_version_ 1818278687137595392
author Anastasia S. Gruzdeva
Igor A. Bessmertny
author_facet Anastasia S. Gruzdeva
Igor A. Bessmertny
author_sort Anastasia S. Gruzdeva
collection DOAJ
description Quantum computing algorithms are actively developed and applied in the field of natural language processing. The authors of the paper proposed a new quantum-like method for classifying short texts. The basis of the method is the representation of the text as an ensemble of elementary particles. The value of the detection probability amplitude of a given ensemble at the selected points in space is chosen as a classification criterion. In this case, the space is understood as a vector space described using the distributive-semantic model of the language. The authors suggested one of the possible ways of interpreting the parameters of the wave function that describes the behavior of an elementary particle, as well as an algorithm for calculating the probability amplitude taking into account these parameters. For the experimental research of the described method, authors performed the classification of Internet communities by topics. For the analysis, the names and the “information” section of communities were used. In total, 100 groups of the social network “VKontakte” belonging to five various topics were taken. The proposed model showed rather high classification accuracy (91 % in general on the data set and from 75 % to 95 % within individual classes). The proposed model is supposed to be used to classify user comments about goods, services and events, as well as to determine some properties of the psychological portraits of users of online communities.
first_indexed 2024-12-12T23:21:23Z
format Article
id doaj.art-485d12af85e1450ba1ce40370f726012
institution Directory Open Access Journal
issn 2226-1494
2500-0373
language English
last_indexed 2024-12-12T23:21:23Z
publishDate 2022-04-01
publisher Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)
record_format Article
series Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki
spelling doaj.art-485d12af85e1450ba1ce40370f7260122022-12-22T00:08:16ZengSaint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki2226-14942500-03732022-04-0122228729310.17586/2226-1494-2022-22-2-287-293Classification of short texts using a wave modelAnastasia S. Gruzdeva0https://orcid.org/0000-0003-4963-0823Igor A. Bessmertny1https://orcid.org/0000-0001-6711-6399PhD student, ITMO University, Saint Petersburg, 197101, Russian FederationD.Sc., Full Professor, ITMO University, Saint Petersburg, 197101, Russian Federation, sc 36661767800Quantum computing algorithms are actively developed and applied in the field of natural language processing. The authors of the paper proposed a new quantum-like method for classifying short texts. The basis of the method is the representation of the text as an ensemble of elementary particles. The value of the detection probability amplitude of a given ensemble at the selected points in space is chosen as a classification criterion. In this case, the space is understood as a vector space described using the distributive-semantic model of the language. The authors suggested one of the possible ways of interpreting the parameters of the wave function that describes the behavior of an elementary particle, as well as an algorithm for calculating the probability amplitude taking into account these parameters. For the experimental research of the described method, authors performed the classification of Internet communities by topics. For the analysis, the names and the “information” section of communities were used. In total, 100 groups of the social network “VKontakte” belonging to five various topics were taken. The proposed model showed rather high classification accuracy (91 % in general on the data set and from 75 % to 95 % within individual classes). The proposed model is supposed to be used to classify user comments about goods, services and events, as well as to determine some properties of the psychological portraits of users of online communities.https://ntv.ifmo.ru/file/article/21132.pdfclassificationnatural language processingwave modelinterferencequantum-like modeldefinition of the text subject
spellingShingle Anastasia S. Gruzdeva
Igor A. Bessmertny
Classification of short texts using a wave model
Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki
classification
natural language processing
wave model
interference
quantum-like model
definition of the text subject
title Classification of short texts using a wave model
title_full Classification of short texts using a wave model
title_fullStr Classification of short texts using a wave model
title_full_unstemmed Classification of short texts using a wave model
title_short Classification of short texts using a wave model
title_sort classification of short texts using a wave model
topic classification
natural language processing
wave model
interference
quantum-like model
definition of the text subject
url https://ntv.ifmo.ru/file/article/21132.pdf
work_keys_str_mv AT anastasiasgruzdeva classificationofshorttextsusingawavemodel
AT igorabessmertny classificationofshorttextsusingawavemodel