Effects of Diacritics on Web Search Engines Performance for Retrieval Yoruba Documents

This paper aims to find out the possible effect of the use or nonuse of diacritics in Yoruba search queries on the performance of major search engines, AOL, Bing, Google and Yahoo!, in retrieving documents. 30 Yoruba queries created from the most searched keywords from Nigeria on Google search logs...

Full description

Bibliographic Details
Main Author: Toluwase Victor Asubiaro
Format: Article
Language:English
Published: National Taiwan University 2014-06-01
Series:Journal of Library and Information Studies
Subjects:
Online Access:https://jlis.lis.ntu.edu.tw/files/journal/j38-1.pdf
_version_ 1818445421534511104
author Toluwase Victor Asubiaro
author_facet Toluwase Victor Asubiaro
author_sort Toluwase Victor Asubiaro
collection DOAJ
description This paper aims to find out the possible effect of the use or nonuse of diacritics in Yoruba search queries on the performance of major search engines, AOL, Bing, Google and Yahoo!, in retrieving documents. 30 Yoruba queries created from the most searched keywords from Nigeria on Google search logs were submitted to the search engines. The search queries were posed to the search engines without diacritics and then with diacritics. All of the search engines retrieved more sites in response to the queries without diacritics. Also, they all retrieved more precise results for queries without diacritics. The search engines also answered more queries without diacritics. There was no significant difference in the precision values of any two of the four search engines for diacritized and undiacritized queries. There was a significant difference in the effectiveness of AOL and Yahoo when diacritics were applied and when they were not applied. The findings of the study indicate that the search engines do not find a relationship between the diacritized Yoruba words and the undiacritized versions. Therefore, there is a need for search engines to add normalization steps to pre-process Yoruba queries and indexes. This study concentrates on a problem with search engines that has not been previously investigated.
first_indexed 2024-12-14T19:31:34Z
format Article
id doaj.art-d537caeb84194e2f958cd60475710aca
institution Directory Open Access Journal
issn 1606-7509
1606-7509
language English
last_indexed 2024-12-14T19:31:34Z
publishDate 2014-06-01
publisher National Taiwan University
record_format Article
series Journal of Library and Information Studies
spelling doaj.art-d537caeb84194e2f958cd60475710aca2022-12-21T22:50:04ZengNational Taiwan UniversityJournal of Library and Information Studies1606-75091606-75092014-06-0112111910.6182/jlis.2014.12(1).001Effects of Diacritics on Web Search Engines Performance for Retrieval Yoruba DocumentsToluwase Victor Asubiaro0E. Latunde Odeku Medical Library, College of Medicine, University College Hospital Campus, University of Ibadan, Ibadan, NigeriaThis paper aims to find out the possible effect of the use or nonuse of diacritics in Yoruba search queries on the performance of major search engines, AOL, Bing, Google and Yahoo!, in retrieving documents. 30 Yoruba queries created from the most searched keywords from Nigeria on Google search logs were submitted to the search engines. The search queries were posed to the search engines without diacritics and then with diacritics. All of the search engines retrieved more sites in response to the queries without diacritics. Also, they all retrieved more precise results for queries without diacritics. The search engines also answered more queries without diacritics. There was no significant difference in the precision values of any two of the four search engines for diacritized and undiacritized queries. There was a significant difference in the effectiveness of AOL and Yahoo when diacritics were applied and when they were not applied. The findings of the study indicate that the search engines do not find a relationship between the diacritized Yoruba words and the undiacritized versions. Therefore, there is a need for search engines to add normalization steps to pre-process Yoruba queries and indexes. This study concentrates on a problem with search engines that has not been previously investigated.https://jlis.lis.ntu.edu.tw/files/journal/j38-1.pdfinformation retrievalinformation retrieval evaluationdiacriticssearch enginesyoruba language
spellingShingle Toluwase Victor Asubiaro
Effects of Diacritics on Web Search Engines Performance for Retrieval Yoruba Documents
Journal of Library and Information Studies
information retrieval
information retrieval evaluation
diacritics
search engines
yoruba language
title Effects of Diacritics on Web Search Engines Performance for Retrieval Yoruba Documents
title_full Effects of Diacritics on Web Search Engines Performance for Retrieval Yoruba Documents
title_fullStr Effects of Diacritics on Web Search Engines Performance for Retrieval Yoruba Documents
title_full_unstemmed Effects of Diacritics on Web Search Engines Performance for Retrieval Yoruba Documents
title_short Effects of Diacritics on Web Search Engines Performance for Retrieval Yoruba Documents
title_sort effects of diacritics on web search engines performance for retrieval yoruba documents
topic information retrieval
information retrieval evaluation
diacritics
search engines
yoruba language
url https://jlis.lis.ntu.edu.tw/files/journal/j38-1.pdf
work_keys_str_mv AT toluwasevictorasubiaro effectsofdiacriticsonwebsearchenginesperformanceforretrievalyorubadocuments