Hybrid Sense Classification Method for Large-Scale Word Sense Disambiguation

Word sense disambiguation (WSD) is a task of determining a reasonable sense of a word in a particular context. Although recent studies have demonstrated some progress in the advancement of neural language models, the scope of research is still such that the senses of several words can only be determ...

Full description

Bibliographic Details
Main Authors: Yoonseok Heo, Sangwoo Kang, Jungyun Seo
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8976181/
_version_ 1818857165159399424
author Yoonseok Heo
Sangwoo Kang
Jungyun Seo
author_facet Yoonseok Heo
Sangwoo Kang
Jungyun Seo
author_sort Yoonseok Heo
collection DOAJ
description Word sense disambiguation (WSD) is a task of determining a reasonable sense of a word in a particular context. Although recent studies have demonstrated some progress in the advancement of neural language models, the scope of research is still such that the senses of several words can only be determined in a few domains. Therefore, it is necessary to move toward developing a highly scalable process that can address a lot of senses occurring in various domains. This paper introduces a new large WSD dataset that is automatically constructed from the Oxford Dictionary, which is widely used as a standard source for the meaning of words. We propose a new WSD model that individually determines the sense of the word in accordance with its part of speech in the context. In addition, we introduce a hybrid sense prediction method that separately classifies the less frequently used senses for achieving a reasonable performance. We have conducted comparative experiments to demonstrate that the proposed method is more reliable compared with the baseline approaches. Also, we investigated the adaptation of the method to a realistic environment with the use of news articles.
first_indexed 2024-12-19T08:36:03Z
format Article
id doaj.art-dbb398f9072d415cad83de7562f4a0ff
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-19T08:36:03Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-dbb398f9072d415cad83de7562f4a0ff2022-12-21T20:29:03ZengIEEEIEEE Access2169-35362020-01-018272472725610.1109/ACCESS.2020.29704368976181Hybrid Sense Classification Method for Large-Scale Word Sense DisambiguationYoonseok Heo0https://orcid.org/0000-0002-4480-6415Sangwoo Kang1https://orcid.org/0000-0002-0281-1726Jungyun Seo2https://orcid.org/0000-0003-3670-7334Department of Computer Science and Engineering, Sogang University, Seoul, South KoreaDepartment of Software, Gachon University, Gyeonggi, South KoreaDepartment of Computer Science and Engineering, Sogang University, Seoul, South KoreaWord sense disambiguation (WSD) is a task of determining a reasonable sense of a word in a particular context. Although recent studies have demonstrated some progress in the advancement of neural language models, the scope of research is still such that the senses of several words can only be determined in a few domains. Therefore, it is necessary to move toward developing a highly scalable process that can address a lot of senses occurring in various domains. This paper introduces a new large WSD dataset that is automatically constructed from the Oxford Dictionary, which is widely used as a standard source for the meaning of words. We propose a new WSD model that individually determines the sense of the word in accordance with its part of speech in the context. In addition, we introduce a hybrid sense prediction method that separately classifies the less frequently used senses for achieving a reasonable performance. We have conducted comparative experiments to demonstrate that the proposed method is more reliable compared with the baseline approaches. Also, we investigated the adaptation of the method to a realistic environment with the use of news articles.https://ieeexplore.ieee.org/document/8976181/Computational and artificial intelligenceEnglish vocabulary learningnatural language processingneural networksword sense disambiguation
spellingShingle Yoonseok Heo
Sangwoo Kang
Jungyun Seo
Hybrid Sense Classification Method for Large-Scale Word Sense Disambiguation
IEEE Access
Computational and artificial intelligence
English vocabulary learning
natural language processing
neural networks
word sense disambiguation
title Hybrid Sense Classification Method for Large-Scale Word Sense Disambiguation
title_full Hybrid Sense Classification Method for Large-Scale Word Sense Disambiguation
title_fullStr Hybrid Sense Classification Method for Large-Scale Word Sense Disambiguation
title_full_unstemmed Hybrid Sense Classification Method for Large-Scale Word Sense Disambiguation
title_short Hybrid Sense Classification Method for Large-Scale Word Sense Disambiguation
title_sort hybrid sense classification method for large scale word sense disambiguation
topic Computational and artificial intelligence
English vocabulary learning
natural language processing
neural networks
word sense disambiguation
url https://ieeexplore.ieee.org/document/8976181/
work_keys_str_mv AT yoonseokheo hybridsenseclassificationmethodforlargescalewordsensedisambiguation
AT sangwookang hybridsenseclassificationmethodforlargescalewordsensedisambiguation
AT jungyunseo hybridsenseclassificationmethodforlargescalewordsensedisambiguation