Survey of Information Searching and Retrieving Challenges in Databases in Connection with Persian Language Writing Features

Purpose: The present research was carried out with the aim of explicating the major writing and semantic problems of Persian language when using data environments and determining the degree of compatibility and attention to these features in Persian databases. Methodology/Approach: The present resea...

Full description

Bibliographic Details
Main Authors: Hoda Homavandi, Yaghub Norouzi, Moluk S. Hoseine Beheshti
Format: Article
Language:fas
Published: Iranian Research Institute for Information and Technology 2018-06-01
Series:Iranian Journal of Information Processing & Management
Subjects:
Online Access:http://jipm.irandoc.ac.ir/browse.php?a_code=A-10-3580-1&slc_lang=en&sid=1
_version_ 1818388711532920832
author Hoda Homavandi
Yaghub Norouzi
Moluk S. Hoseine Beheshti
author_facet Hoda Homavandi
Yaghub Norouzi
Moluk S. Hoseine Beheshti
author_sort Hoda Homavandi
collection DOAJ
description Purpose: The present research was carried out with the aim of explicating the major writing and semantic problems of Persian language when using data environments and determining the degree of compatibility and attention to these features in Persian databases. Methodology/Approach: The present research is of survey analytical type being conducted through direct observation. Having reviewed the related literature, we kept a checklist of search key words. Each of these key words was searched in the databases under study, such as Iranian Research Institute for Information Science and Technology, regional Centre for Information Science and Technology, Noor Magaz, and Scientific Information database Affiliated with Jahad Daneshgahi, and the number of retrieved findings was recorded. Findings: Some of the writing and semantic features of Persian language contribute to problems associated with retrieving information from the selected databases. Some of these features include connected and disconnected forms of writing of derivative, compound, and derivative-compound words, diversity of plural forms, loanwords and their equivalents in writing as well as polysemy, homonymy, etc., in semantics. For instance, retrieving different results for various writing forms of the key words "فناوری و فن آوری" as derivative-compound words or "پتاسیوم و پتاسیم" as various forms of recording words, or retrieving different findings for key words "دریای خزر، دریای مازندران و دریای کاسپین" as well as lack of their appropriate coverage as synonymous words and giving the user information about it in order to improve the exploration process, for it has negative effects on search and retrieval process. Conclusion: Findings indicated that Persian databases do not pay adequate attention to writing and semantic features of Persian language, and disregard many of its features in searching and retrieving information. In connection with the impact of these features on the interaction of users with databases, Persian- speaking users' need for native exploration tools and databases designed in accordance with the features of their own language have become more and more urgent. The present research has examined the ability of Persian databases in covering some of the features of this language, which have a noticeable impact on the process of searching and retrieval, pinpointing the weak points and strengths of these databases. The results of the present research could be utilized to improve the performance of the above-mentioned databases.
first_indexed 2024-12-14T04:30:11Z
format Article
id doaj.art-918efa3d9aa241c5be3f5f789991746a
institution Directory Open Access Journal
issn 2251-8223
2251-8231
language fas
last_indexed 2024-12-14T04:30:11Z
publishDate 2018-06-01
publisher Iranian Research Institute for Information and Technology
record_format Article
series Iranian Journal of Information Processing & Management
spelling doaj.art-918efa3d9aa241c5be3f5f789991746a2022-12-21T23:17:06ZfasIranian Research Institute for Information and TechnologyIranian Journal of Information Processing & Management2251-82232251-82312018-06-0133310871110Survey of Information Searching and Retrieving Challenges in Databases in Connection with Persian Language Writing FeaturesHoda Homavandi0Yaghub Norouzi1Moluk S. Hoseine Beheshti2 . PhD Condidate in Knowledge & Information Science, Tehran University Qom University Iranian Research Institute for Information Science & Technology (IRANDOC), Tehran, Iran. Purpose: The present research was carried out with the aim of explicating the major writing and semantic problems of Persian language when using data environments and determining the degree of compatibility and attention to these features in Persian databases. Methodology/Approach: The present research is of survey analytical type being conducted through direct observation. Having reviewed the related literature, we kept a checklist of search key words. Each of these key words was searched in the databases under study, such as Iranian Research Institute for Information Science and Technology, regional Centre for Information Science and Technology, Noor Magaz, and Scientific Information database Affiliated with Jahad Daneshgahi, and the number of retrieved findings was recorded. Findings: Some of the writing and semantic features of Persian language contribute to problems associated with retrieving information from the selected databases. Some of these features include connected and disconnected forms of writing of derivative, compound, and derivative-compound words, diversity of plural forms, loanwords and their equivalents in writing as well as polysemy, homonymy, etc., in semantics. For instance, retrieving different results for various writing forms of the key words "فناوری و فن آوری" as derivative-compound words or "پتاسیوم و پتاسیم" as various forms of recording words, or retrieving different findings for key words "دریای خزر، دریای مازندران و دریای کاسپین" as well as lack of their appropriate coverage as synonymous words and giving the user information about it in order to improve the exploration process, for it has negative effects on search and retrieval process. Conclusion: Findings indicated that Persian databases do not pay adequate attention to writing and semantic features of Persian language, and disregard many of its features in searching and retrieving information. In connection with the impact of these features on the interaction of users with databases, Persian- speaking users' need for native exploration tools and databases designed in accordance with the features of their own language have become more and more urgent. The present research has examined the ability of Persian databases in covering some of the features of this language, which have a noticeable impact on the process of searching and retrieval, pinpointing the weak points and strengths of these databases. The results of the present research could be utilized to improve the performance of the above-mentioned databases.http://jipm.irandoc.ac.ir/browse.php?a_code=A-10-3580-1&slc_lang=en&sid=1information retrieval Databases Persian language Writing features
spellingShingle Hoda Homavandi
Yaghub Norouzi
Moluk S. Hoseine Beheshti
Survey of Information Searching and Retrieving Challenges in Databases in Connection with Persian Language Writing Features
Iranian Journal of Information Processing & Management
information retrieval
Databases
Persian language
Writing features
title Survey of Information Searching and Retrieving Challenges in Databases in Connection with Persian Language Writing Features
title_full Survey of Information Searching and Retrieving Challenges in Databases in Connection with Persian Language Writing Features
title_fullStr Survey of Information Searching and Retrieving Challenges in Databases in Connection with Persian Language Writing Features
title_full_unstemmed Survey of Information Searching and Retrieving Challenges in Databases in Connection with Persian Language Writing Features
title_short Survey of Information Searching and Retrieving Challenges in Databases in Connection with Persian Language Writing Features
title_sort survey of information searching and retrieving challenges in databases in connection with persian language writing features
topic information retrieval
Databases
Persian language
Writing features
url http://jipm.irandoc.ac.ir/browse.php?a_code=A-10-3580-1&slc_lang=en&sid=1
work_keys_str_mv AT hodahomavandi surveyofinformationsearchingandretrievingchallengesindatabasesinconnectionwithpersianlanguagewritingfeatures
AT yaghubnorouzi surveyofinformationsearchingandretrievingchallengesindatabasesinconnectionwithpersianlanguagewritingfeatures
AT molukshoseinebeheshti surveyofinformationsearchingandretrievingchallengesindatabasesinconnectionwithpersianlanguagewritingfeatures