A metaheuristic with a neural surrogate function for Word Sense Disambiguation

Word Sense Disambiguation (WSD) is one of the earliest problems in natural language processing which aims to determine the correct sense of words in context. The semantic information provided by WSD systems is highly beneficial to many tasks such as machine translation, information extraction, and s...

Full description

Bibliographic Details
Main Authors: Azim Keshavarzian Nodehi, Nasrollah Moghadam Charkari
Format: Article
Language:English
Published: Elsevier 2022-09-01
Series:Machine Learning with Applications
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666827022000627
_version_ 1811274982944145408
author Azim Keshavarzian Nodehi
Nasrollah Moghadam Charkari
author_facet Azim Keshavarzian Nodehi
Nasrollah Moghadam Charkari
author_sort Azim Keshavarzian Nodehi
collection DOAJ
description Word Sense Disambiguation (WSD) is one of the earliest problems in natural language processing which aims to determine the correct sense of words in context. The semantic information provided by WSD systems is highly beneficial to many tasks such as machine translation, information extraction, and semantic parsing. In this work, a new approach for WSD is proposed which uses a neural network as a surrogate fitness function in a metaheuristic algorithm. Also, a new method for simultaneous training of word and sense embeddings is proposed in this work. Accordingly, the node2vec algorithm is employed on the WordNet graph to generate sequences containing both words and senses. These sequences are then used along with paragraphs from Wikipedia in the word2vec algorithm to generate embeddings for words and senses at the same time. In order to address data imbalance in this task, sense probability distribution data extracted from the training corpus is used in the search process of the proposed simulated annealing algorithm. Furthermore, we introduce a new approach for clustering and mapping senses in the WordNet graph, which considerably improves the accuracy of the proposed method. In this approach, nodes in the WordNet graph are clustered on the condition that no two senses of the same word be present in one cluster. Then, repeatedly, all nodes in each cluster are mapped to a randomly selected node from that cluster, meaning that the representative node can take advantage of the training instances of all the other nodes in the cluster. Training the proposed method in this work is done using the SemCor dataset and the SemEval-2015 dataset has been used as the validation set. The final evaluation of the system is performed on SensEval-2, SensEval-3, SemEval-2007, SemEval-2013, SemEval-2015, and the concatenation of all five mentioned datasets. The performance of the system is also evaluated on the four content word categories, namely, nouns, verbs, adjectives, and adverbs. Experimental results show that the proposed method achieves accuracies in the range of 74.8 to 84.6 percent in the ten aforementioned evaluation categories which are close to and in some cases better than the state of the art in this task.
first_indexed 2024-04-12T23:30:00Z
format Article
id doaj.art-001e6d7867334fe9a2acfba1b631a7b7
institution Directory Open Access Journal
issn 2666-8270
language English
last_indexed 2024-04-12T23:30:00Z
publishDate 2022-09-01
publisher Elsevier
record_format Article
series Machine Learning with Applications
spelling doaj.art-001e6d7867334fe9a2acfba1b631a7b72022-12-22T03:12:19ZengElsevierMachine Learning with Applications2666-82702022-09-019100369A metaheuristic with a neural surrogate function for Word Sense DisambiguationAzim Keshavarzian Nodehi0Nasrollah Moghadam Charkari1Corresponding author.; Tarbiat Modares University, Tehran, IranTarbiat Modares University, Tehran, IranWord Sense Disambiguation (WSD) is one of the earliest problems in natural language processing which aims to determine the correct sense of words in context. The semantic information provided by WSD systems is highly beneficial to many tasks such as machine translation, information extraction, and semantic parsing. In this work, a new approach for WSD is proposed which uses a neural network as a surrogate fitness function in a metaheuristic algorithm. Also, a new method for simultaneous training of word and sense embeddings is proposed in this work. Accordingly, the node2vec algorithm is employed on the WordNet graph to generate sequences containing both words and senses. These sequences are then used along with paragraphs from Wikipedia in the word2vec algorithm to generate embeddings for words and senses at the same time. In order to address data imbalance in this task, sense probability distribution data extracted from the training corpus is used in the search process of the proposed simulated annealing algorithm. Furthermore, we introduce a new approach for clustering and mapping senses in the WordNet graph, which considerably improves the accuracy of the proposed method. In this approach, nodes in the WordNet graph are clustered on the condition that no two senses of the same word be present in one cluster. Then, repeatedly, all nodes in each cluster are mapped to a randomly selected node from that cluster, meaning that the representative node can take advantage of the training instances of all the other nodes in the cluster. Training the proposed method in this work is done using the SemCor dataset and the SemEval-2015 dataset has been used as the validation set. The final evaluation of the system is performed on SensEval-2, SensEval-3, SemEval-2007, SemEval-2013, SemEval-2015, and the concatenation of all five mentioned datasets. The performance of the system is also evaluated on the four content word categories, namely, nouns, verbs, adjectives, and adverbs. Experimental results show that the proposed method achieves accuracies in the range of 74.8 to 84.6 percent in the ten aforementioned evaluation categories which are close to and in some cases better than the state of the art in this task.http://www.sciencedirect.com/science/article/pii/S2666827022000627Word Sense DisambiguationMetaheuristicsSurrogate FunctionsSense Mapping
spellingShingle Azim Keshavarzian Nodehi
Nasrollah Moghadam Charkari
A metaheuristic with a neural surrogate function for Word Sense Disambiguation
Machine Learning with Applications
Word Sense Disambiguation
Metaheuristics
Surrogate Functions
Sense Mapping
title A metaheuristic with a neural surrogate function for Word Sense Disambiguation
title_full A metaheuristic with a neural surrogate function for Word Sense Disambiguation
title_fullStr A metaheuristic with a neural surrogate function for Word Sense Disambiguation
title_full_unstemmed A metaheuristic with a neural surrogate function for Word Sense Disambiguation
title_short A metaheuristic with a neural surrogate function for Word Sense Disambiguation
title_sort metaheuristic with a neural surrogate function for word sense disambiguation
topic Word Sense Disambiguation
Metaheuristics
Surrogate Functions
Sense Mapping
url http://www.sciencedirect.com/science/article/pii/S2666827022000627
work_keys_str_mv AT azimkeshavarziannodehi ametaheuristicwithaneuralsurrogatefunctionforwordsensedisambiguation
AT nasrollahmoghadamcharkari ametaheuristicwithaneuralsurrogatefunctionforwordsensedisambiguation
AT azimkeshavarziannodehi metaheuristicwithaneuralsurrogatefunctionforwordsensedisambiguation
AT nasrollahmoghadamcharkari metaheuristicwithaneuralsurrogatefunctionforwordsensedisambiguation