Rules frequency order stemmer for Malay language.
The importance of stemmer is obvious with the advent of effective information retrieval systems. Unfortunately, Malay stemming problems are difficult to solve due to complexity of words morphology. The Rules Application Order (RAO)stemmer is examined for enhancing performance to minimize the percen...
Principais autores: | , , , |
---|---|
Formato: | Artigo |
Idioma: | English English |
Publicado em: |
International Journal of Computer Science and Network Security (IJCSNS)
2009
|
Acesso em linha: | http://psasir.upm.edu.my/id/eprint/17738/1/Rules%20frequency%20order%20stemmer%20for%20Malay%20language.pdf |
_version_ | 1825946068913750016 |
---|---|
author | Abdullah, Muhammad Taufik Ahmad, Fatimah Mahmod, Ramlan Tengku Sembok, Tengku Mohd |
author_facet | Abdullah, Muhammad Taufik Ahmad, Fatimah Mahmod, Ramlan Tengku Sembok, Tengku Mohd |
author_sort | Abdullah, Muhammad Taufik |
collection | UPM |
description | The importance of stemmer is obvious with the advent of
effective information retrieval systems. Unfortunately, Malay stemming problems are difficult to solve due to complexity of words morphology. The Rules Application Order (RAO)stemmer is examined for enhancing performance to minimize the percentage of stemming errors. This paper presents a stemming approach called Rules Frequency Order (RFO). RFO rearranges the stemming rules according to the frequency of their usage from the previous execution. It shows that the approach provides a higher percentage of stemming correctness as compared to RAO stemming approach. |
first_indexed | 2024-03-06T07:41:19Z |
format | Article |
id | upm.eprints-17738 |
institution | Universiti Putra Malaysia |
language | English English |
last_indexed | 2024-03-06T07:41:19Z |
publishDate | 2009 |
publisher | International Journal of Computer Science and Network Security (IJCSNS) |
record_format | dspace |
spelling | upm.eprints-177382015-11-16T09:02:03Z http://psasir.upm.edu.my/id/eprint/17738/ Rules frequency order stemmer for Malay language. Abdullah, Muhammad Taufik Ahmad, Fatimah Mahmod, Ramlan Tengku Sembok, Tengku Mohd The importance of stemmer is obvious with the advent of effective information retrieval systems. Unfortunately, Malay stemming problems are difficult to solve due to complexity of words morphology. The Rules Application Order (RAO)stemmer is examined for enhancing performance to minimize the percentage of stemming errors. This paper presents a stemming approach called Rules Frequency Order (RFO). RFO rearranges the stemming rules according to the frequency of their usage from the previous execution. It shows that the approach provides a higher percentage of stemming correctness as compared to RAO stemming approach. International Journal of Computer Science and Network Security (IJCSNS) 2009-02 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/17738/1/Rules%20frequency%20order%20stemmer%20for%20Malay%20language.pdf Abdullah, Muhammad Taufik and Ahmad, Fatimah and Mahmod, Ramlan and Tengku Sembok, Tengku Mohd (2009) Rules frequency order stemmer for Malay language. International Journal of Computer Science and Network Security, 9 (2). pp. 433-438. ISSN 1738-7906 http://ijcsns.org/ English |
spellingShingle | Abdullah, Muhammad Taufik Ahmad, Fatimah Mahmod, Ramlan Tengku Sembok, Tengku Mohd Rules frequency order stemmer for Malay language. |
title | Rules frequency order stemmer for Malay language. |
title_full | Rules frequency order stemmer for Malay language. |
title_fullStr | Rules frequency order stemmer for Malay language. |
title_full_unstemmed | Rules frequency order stemmer for Malay language. |
title_short | Rules frequency order stemmer for Malay language. |
title_sort | rules frequency order stemmer for malay language |
url | http://psasir.upm.edu.my/id/eprint/17738/1/Rules%20frequency%20order%20stemmer%20for%20Malay%20language.pdf |
work_keys_str_mv | AT abdullahmuhammadtaufik rulesfrequencyorderstemmerformalaylanguage AT ahmadfatimah rulesfrequencyorderstemmerformalaylanguage AT mahmodramlan rulesfrequencyorderstemmerformalaylanguage AT tengkusemboktengkumohd rulesfrequencyorderstemmerformalaylanguage |