Synset2Node: A new synset embedding based upon graph embeddings
Due to the advances made in recent years, embedding methods caused a significant increase in the accuracy of text or graph processing methods. Embedding methods exhibit a compact vector representation of the basic elements (words, synsets, nodes,..) of the underlying system to encode the semantic in...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2023-02-01
|
Series: | Intelligent Systems with Applications |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2667305322000965 |
_version_ | 1811171027676299264 |
---|---|
author | Fatemeh Jafarinejad |
author_facet | Fatemeh Jafarinejad |
author_sort | Fatemeh Jafarinejad |
collection | DOAJ |
description | Due to the advances made in recent years, embedding methods caused a significant increase in the accuracy of text or graph processing methods. Embedding methods exhibit a compact vector representation of the basic elements (words, synsets, nodes,..) of the underlying system to encode the semantic information between the elements. Of course, due to the polysemous nature of words, in some NLP tasks, the use of sense/synset embedding is better than word embedding. However, in the literature, the introduction of embedding for synsets has received less attention. Existing synset embedding methods have complex calculations to calculate synset embedding based on word embeddings or base upon a defined pairwise synset similarity. In this paper, considering the graphical structure of the WordNet and the high-level knowledge encoded in it, we will create a synset embedding directly from the WordNet graph and its synset relations. Node2Vec graph embedding is used to map nodes of this graph to a vector space. We evaluate the performance of different graph structures (e.g. weighted/weightless, directed/undirected graphs). Moreover, we propose a weighting strategy to weight different synset relation types in the resulting WordNet graph. Experimental results of evaluation of the proposed synset embedding on the task of measuring lexical semantic similarities shows that mean squared error of similarities for the proposed synset embedding method on MEM and WordSim353 datasets are 0.065 and 0.035, resp., which is better than the mean squared error of Word2Vec on these datasets, (0.073 and 0.045, resp.). Furthermore, we use the Pearson correlation and Spearman correlation to compare the performance of the proposed synset embedding method with the state-of-the-art ones. The obtained results show the efficiency of the proposed method on various datasets. .The spearman correlation of the SimLex999 is improved by 0.02, while it improves WordSim353 Pearson correlation by 0.14. |
first_indexed | 2024-04-10T17:06:40Z |
format | Article |
id | doaj.art-cd4c121cb4cb437b8d08e311dc680c1f |
institution | Directory Open Access Journal |
issn | 2667-3053 |
language | English |
last_indexed | 2024-04-10T17:06:40Z |
publishDate | 2023-02-01 |
publisher | Elsevier |
record_format | Article |
series | Intelligent Systems with Applications |
spelling | doaj.art-cd4c121cb4cb437b8d08e311dc680c1f2023-02-06T04:06:22ZengElsevierIntelligent Systems with Applications2667-30532023-02-0117200159Synset2Node: A new synset embedding based upon graph embeddingsFatemeh Jafarinejad0Faculty of Computer Engineering, Shahrood University of Technology, Shahrood, IranDue to the advances made in recent years, embedding methods caused a significant increase in the accuracy of text or graph processing methods. Embedding methods exhibit a compact vector representation of the basic elements (words, synsets, nodes,..) of the underlying system to encode the semantic information between the elements. Of course, due to the polysemous nature of words, in some NLP tasks, the use of sense/synset embedding is better than word embedding. However, in the literature, the introduction of embedding for synsets has received less attention. Existing synset embedding methods have complex calculations to calculate synset embedding based on word embeddings or base upon a defined pairwise synset similarity. In this paper, considering the graphical structure of the WordNet and the high-level knowledge encoded in it, we will create a synset embedding directly from the WordNet graph and its synset relations. Node2Vec graph embedding is used to map nodes of this graph to a vector space. We evaluate the performance of different graph structures (e.g. weighted/weightless, directed/undirected graphs). Moreover, we propose a weighting strategy to weight different synset relation types in the resulting WordNet graph. Experimental results of evaluation of the proposed synset embedding on the task of measuring lexical semantic similarities shows that mean squared error of similarities for the proposed synset embedding method on MEM and WordSim353 datasets are 0.065 and 0.035, resp., which is better than the mean squared error of Word2Vec on these datasets, (0.073 and 0.045, resp.). Furthermore, we use the Pearson correlation and Spearman correlation to compare the performance of the proposed synset embedding method with the state-of-the-art ones. The obtained results show the efficiency of the proposed method on various datasets. .The spearman correlation of the SimLex999 is improved by 0.02, while it improves WordSim353 Pearson correlation by 0.14.http://www.sciencedirect.com/science/article/pii/S2667305322000965Synset embeddingsGraph embeddingsNode2vecWordNetLexical Semantic Similarity |
spellingShingle | Fatemeh Jafarinejad Synset2Node: A new synset embedding based upon graph embeddings Intelligent Systems with Applications Synset embeddings Graph embeddings Node2vec WordNet Lexical Semantic Similarity |
title | Synset2Node: A new synset embedding based upon graph embeddings |
title_full | Synset2Node: A new synset embedding based upon graph embeddings |
title_fullStr | Synset2Node: A new synset embedding based upon graph embeddings |
title_full_unstemmed | Synset2Node: A new synset embedding based upon graph embeddings |
title_short | Synset2Node: A new synset embedding based upon graph embeddings |
title_sort | synset2node a new synset embedding based upon graph embeddings |
topic | Synset embeddings Graph embeddings Node2vec WordNet Lexical Semantic Similarity |
url | http://www.sciencedirect.com/science/article/pii/S2667305322000965 |
work_keys_str_mv | AT fatemehjafarinejad synset2nodeanewsynsetembeddingbasedupongraphembeddings |