Enhancing Accuracy of Semantic Relatedness Measurement by Word Single-Meaning Embeddings

We propose a lightweight algorithm of learning word single-meaning embeddings (WSME), by exploring WordNet synsets and Doc2vec document embeddings, to enhance the accuracy of semantic relatedness measurement. In our model, each polyseme is decomposed into a series of monosemous words with diverse Wo...

Full description

Bibliographic Details
Main Authors: Xiaotao Li, Shujuan You, Wai Chen
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9521506/
_version_ 1831774101645033472
author Xiaotao Li
Shujuan You
Wai Chen
author_facet Xiaotao Li
Shujuan You
Wai Chen
author_sort Xiaotao Li
collection DOAJ
description We propose a lightweight algorithm of learning word single-meaning embeddings (WSME), by exploring WordNet synsets and Doc2vec document embeddings, to enhance the accuracy of semantic relatedness measurement. In our model, each polyseme is decomposed into a series of monosemous words with diverse WordNet synset tags which represent different word meanings, and there is a one-to-one correspondence between a word meaning and a vector. Our algorithm proceeds in 3 steps. First, the word sense disambiguation of each polyseme in different contexts is achieved by computing the maximum relatedness between the context of this polyseme and all its candidate meaning definitions in WordNet. Second, each tagged word is lemmatized according to its synset tag to alleviate the word sparsity problem caused by polysemes decomposition. Third, the word single-meaning embeddings are learned from the meaning-tagged corpus, and the semantic relatedness between words can be more accurately measured based on such embeddings. Our experimental results show that our algorithm achieves better performance on the semantic relatedness measurement compared with existing techniques.
first_indexed 2024-12-22T08:48:20Z
format Article
id doaj.art-853671ccd6a841e0a1590a94e9a1ea69
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-22T08:48:20Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-853671ccd6a841e0a1590a94e9a1ea692022-12-21T18:32:02ZengIEEEIEEE Access2169-35362021-01-01911742411743310.1109/ACCESS.2021.31074459521506Enhancing Accuracy of Semantic Relatedness Measurement by Word Single-Meaning EmbeddingsXiaotao Li0https://orcid.org/0000-0001-8786-2962Shujuan You1Wai Chen2https://orcid.org/0000-0002-1663-2729China Mobile Research Institute, Beijing, ChinaChina Mobile Research Institute, Beijing, ChinaChina Mobile Research Institute, Beijing, ChinaWe propose a lightweight algorithm of learning word single-meaning embeddings (WSME), by exploring WordNet synsets and Doc2vec document embeddings, to enhance the accuracy of semantic relatedness measurement. In our model, each polyseme is decomposed into a series of monosemous words with diverse WordNet synset tags which represent different word meanings, and there is a one-to-one correspondence between a word meaning and a vector. Our algorithm proceeds in 3 steps. First, the word sense disambiguation of each polyseme in different contexts is achieved by computing the maximum relatedness between the context of this polyseme and all its candidate meaning definitions in WordNet. Second, each tagged word is lemmatized according to its synset tag to alleviate the word sparsity problem caused by polysemes decomposition. Third, the word single-meaning embeddings are learned from the meaning-tagged corpus, and the semantic relatedness between words can be more accurately measured based on such embeddings. Our experimental results show that our algorithm achieves better performance on the semantic relatedness measurement compared with existing techniques.https://ieeexplore.ieee.org/document/9521506/Semantic relatednessword single-meaning embeddingWordNetdocument embeddinglemmatization
spellingShingle Xiaotao Li
Shujuan You
Wai Chen
Enhancing Accuracy of Semantic Relatedness Measurement by Word Single-Meaning Embeddings
IEEE Access
Semantic relatedness
word single-meaning embedding
WordNet
document embedding
lemmatization
title Enhancing Accuracy of Semantic Relatedness Measurement by Word Single-Meaning Embeddings
title_full Enhancing Accuracy of Semantic Relatedness Measurement by Word Single-Meaning Embeddings
title_fullStr Enhancing Accuracy of Semantic Relatedness Measurement by Word Single-Meaning Embeddings
title_full_unstemmed Enhancing Accuracy of Semantic Relatedness Measurement by Word Single-Meaning Embeddings
title_short Enhancing Accuracy of Semantic Relatedness Measurement by Word Single-Meaning Embeddings
title_sort enhancing accuracy of semantic relatedness measurement by word single meaning embeddings
topic Semantic relatedness
word single-meaning embedding
WordNet
document embedding
lemmatization
url https://ieeexplore.ieee.org/document/9521506/
work_keys_str_mv AT xiaotaoli enhancingaccuracyofsemanticrelatednessmeasurementbywordsinglemeaningembeddings
AT shujuanyou enhancingaccuracyofsemanticrelatednessmeasurementbywordsinglemeaningembeddings
AT waichen enhancingaccuracyofsemanticrelatednessmeasurementbywordsinglemeaningembeddings