Word Sense Disambiguation Focusing on POS Tag Disambiguation in Persian:

<p>The present study deals with ambiguity at word level focusing on homographs. In different languages, homographs may cause ambiguity in text processing. In Persian, the number of homographs is high due to its orthographic structure as well as its complex derivational and inflectional morphol...

Full description

Bibliographic Details
Main Authors: Elham Alayiaboozar, Amirsaeid Moloodi, Manouchehr Kouhestani
Format: Article
Language:English
Published: Regional Information Center for Science and Technology (RICeST) 2019-07-01
Series:International Journal of Information Science and Management
Subjects:
Online Access:https://ijism.ricest.ac.ir/index.php/ijism/article/view/1523
_version_ 1819027475982712832
author Elham Alayiaboozar
Amirsaeid Moloodi
Manouchehr Kouhestani
author_facet Elham Alayiaboozar
Amirsaeid Moloodi
Manouchehr Kouhestani
author_sort Elham Alayiaboozar
collection DOAJ
description <p>The present study deals with ambiguity at word level focusing on homographs. In different languages, homographs may cause ambiguity in text processing. In Persian, the number of homographs is high due to its orthographic structure as well as its complex derivational and inflectional morphology. In this study, a broad list of homographs was extracted from some Persian corpora first. The list indicates that the number of homographs in Persian corpora is high and homographs with high frequency are those that occur as a result of the identical orthographic representation of some inflectional and derivational morphemes. Based on the list, the most frequent homographs are nouns and adjectives ending in &lt;ی&gt; /i/. POS tag disambiguation of such homographs would make word sense disambiguation easier and lead to better text processing. In this study, a list of noun and adjective homographs ending in &lt;ی&gt; is extracted in order to decide their correct POS tag. The result was studied to extract context-sensitive rules for allocating the right POS tag to the homograph in syntactic structures. The accuracy of rules was checked, and the result showed that the accuracy of most rules is high which proves most rules are true.</p>
first_indexed 2024-12-21T05:43:04Z
format Article
id doaj.art-b47f9ef29d184f6b92d6fe5e9cbdf07a
institution Directory Open Access Journal
issn 2008-8302
2008-8310
language English
last_indexed 2024-12-21T05:43:04Z
publishDate 2019-07-01
publisher Regional Information Center for Science and Technology (RICeST)
record_format Article
series International Journal of Information Science and Management
spelling doaj.art-b47f9ef29d184f6b92d6fe5e9cbdf07a2022-12-21T19:14:12ZengRegional Information Center for Science and Technology (RICeST)International Journal of Information Science and Management2008-83022008-83102019-07-01172329Word Sense Disambiguation Focusing on POS Tag Disambiguation in Persian:Elham AlayiaboozarAmirsaeid MoloodiManouchehr Kouhestani<p>The present study deals with ambiguity at word level focusing on homographs. In different languages, homographs may cause ambiguity in text processing. In Persian, the number of homographs is high due to its orthographic structure as well as its complex derivational and inflectional morphology. In this study, a broad list of homographs was extracted from some Persian corpora first. The list indicates that the number of homographs in Persian corpora is high and homographs with high frequency are those that occur as a result of the identical orthographic representation of some inflectional and derivational morphemes. Based on the list, the most frequent homographs are nouns and adjectives ending in &lt;ی&gt; /i/. POS tag disambiguation of such homographs would make word sense disambiguation easier and lead to better text processing. In this study, a list of noun and adjective homographs ending in &lt;ی&gt; is extracted in order to decide their correct POS tag. The result was studied to extract context-sensitive rules for allocating the right POS tag to the homograph in syntactic structures. The accuracy of rules was checked, and the result showed that the accuracy of most rules is high which proves most rules are true.</p>https://ijism.ricest.ac.ir/index.php/ijism/article/view/1523homographspos taggingpos disambiguationnoun and adjective homographs context-sensitive rules
spellingShingle Elham Alayiaboozar
Amirsaeid Moloodi
Manouchehr Kouhestani
Word Sense Disambiguation Focusing on POS Tag Disambiguation in Persian:
International Journal of Information Science and Management
homographs
pos tagging
pos disambiguation
noun and adjective homographs context-sensitive rules
title Word Sense Disambiguation Focusing on POS Tag Disambiguation in Persian:
title_full Word Sense Disambiguation Focusing on POS Tag Disambiguation in Persian:
title_fullStr Word Sense Disambiguation Focusing on POS Tag Disambiguation in Persian:
title_full_unstemmed Word Sense Disambiguation Focusing on POS Tag Disambiguation in Persian:
title_short Word Sense Disambiguation Focusing on POS Tag Disambiguation in Persian:
title_sort word sense disambiguation focusing on pos tag disambiguation in persian
topic homographs
pos tagging
pos disambiguation
noun and adjective homographs context-sensitive rules
url https://ijism.ricest.ac.ir/index.php/ijism/article/view/1523
work_keys_str_mv AT elhamalayiaboozar wordsensedisambiguationfocusingonpostagdisambiguationinpersian
AT amirsaeidmoloodi wordsensedisambiguationfocusingonpostagdisambiguationinpersian
AT manouchehrkouhestani wordsensedisambiguationfocusingonpostagdisambiguationinpersian