Automatic construction of generic stop words list for hausa text
Stop-words are words having the highest frequencies in a document without any significant information. They are characterized by having common relations within a cluster. They are the noise of the text that are evenly distributed over a document. Removal of stop words improve the performance and acc...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Institute of Advanced Engineering and Science
2022
|
Subjects: | |
Online Access: | http://eprints.utm.my/98676/1/RuhaidahSamsudin2022_AutomaticConstructionofGenericStop.pdf |