Qur'anic words stemming
Arabic words are known to have complex morphological structure. The different structures produce various word patterns or derivatives from a root word. This paper attempts to identify various word patterns that originate from a root word. These word patterns are compared to the words in the 30th par...
Main Authors: | , , , |
---|---|
Format: | Article |
Published: |
Springer
2010
|
Subjects: |
_version_ | 1796945323035197440 |
---|---|
author | Raja Yusof, Raja Jamilah Zainuddin, R. Baba, Mohd Sapiyan Yusoff, Z.M. |
author_facet | Raja Yusof, Raja Jamilah Zainuddin, R. Baba, Mohd Sapiyan Yusoff, Z.M. |
author_sort | Raja Yusof, Raja Jamilah |
collection | UM |
description | Arabic words are known to have complex morphological structure. The different structures produce various word patterns or derivatives from a root word. This paper attempts to identify various word patterns that originate from a root word. These word patterns are compared to the words in the 30th part of the Qur'an. Nine stemming test cases were outlined for words in the 30 th part of the Qur'an. Analysis showed that stemming nouns and particles leads to a lower percentage error compared to stemming the 10 alphabets that can be added as affixes in a root word. A rule-based stemming engine (RSE) was also implemented and the stemming accuracy achieved was 62.5 and the average time taken to stem 1000 word tokens was 11.7ms. The accuracy of the results was comparable to other stemming engines such as the Khoja stemmer, Buckwalter Morphological Analyzer (BAMA), Tri-literal Root Extraction (TRE) algorithm, and Voting algorithm. |
first_indexed | 2024-03-06T05:14:56Z |
format | Article |
id | um.eprints-5673 |
institution | Universiti Malaya |
last_indexed | 2024-03-06T05:14:56Z |
publishDate | 2010 |
publisher | Springer |
record_format | dspace |
spelling | um.eprints-56732021-04-30T00:43:29Z http://eprints.um.edu.my/5673/ Qur'anic words stemming Raja Yusof, Raja Jamilah Zainuddin, R. Baba, Mohd Sapiyan Yusoff, Z.M. T Technology (General) Arabic words are known to have complex morphological structure. The different structures produce various word patterns or derivatives from a root word. This paper attempts to identify various word patterns that originate from a root word. These word patterns are compared to the words in the 30th part of the Qur'an. Nine stemming test cases were outlined for words in the 30 th part of the Qur'an. Analysis showed that stemming nouns and particles leads to a lower percentage error compared to stemming the 10 alphabets that can be added as affixes in a root word. A rule-based stemming engine (RSE) was also implemented and the stemming accuracy achieved was 62.5 and the average time taken to stem 1000 word tokens was 11.7ms. The accuracy of the results was comparable to other stemming engines such as the Khoja stemmer, Buckwalter Morphological Analyzer (BAMA), Tri-literal Root Extraction (TRE) algorithm, and Voting algorithm. Springer 2010 Article PeerReviewed Raja Yusof, Raja Jamilah and Zainuddin, R. and Baba, Mohd Sapiyan and Yusoff, Z.M. (2010) Qur'anic words stemming. Arabian Journal for Science and Engineering, 35 (2C). pp. 37-49. ISSN 2193-567X, https://pdfs.semanticscholar.org/fec6/3fee7f1ab03ed674cdf6ea2cd3560d46ef17.pdf?_ga=2.68002260.498228766.1581304948-1568696367.1580870149 |
spellingShingle | T Technology (General) Raja Yusof, Raja Jamilah Zainuddin, R. Baba, Mohd Sapiyan Yusoff, Z.M. Qur'anic words stemming |
title | Qur'anic words stemming |
title_full | Qur'anic words stemming |
title_fullStr | Qur'anic words stemming |
title_full_unstemmed | Qur'anic words stemming |
title_short | Qur'anic words stemming |
title_sort | qur anic words stemming |
topic | T Technology (General) |
work_keys_str_mv | AT rajayusofrajajamilah quranicwordsstemming AT zainuddinr quranicwordsstemming AT babamohdsapiyan quranicwordsstemming AT yusoffzm quranicwordsstemming |