Framework for Urdu News Headlines Classification
Automatic text classification has great significance in the field of text mining and plays a pivotal role in areas such as spam filtering, news classification, noise reduction etc. It is evident from the literature that there is ample of research conducted for classifying text documents e.g. Eng...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Stefan cel Mare University of Suceava
2016-04-01
|
Series: | Journal of Applied Computer Science & Mathematics |
Subjects: | |
Online Access: | http://jacsm.ro/view/?pid=21_2 |
Summary: | Automatic text classification has great significance
in the field of text mining and plays a pivotal role in areas such
as spam filtering, news classification, noise reduction etc. It is
evident from the literature that there is ample of research
conducted for classifying text documents e.g. English news
classification, Persian text classification etc. but there is no
copious amount of work related to short Urdu text or Urdu
news headlines classification. Therefore, after examining various
existing news classification methodologies we propose an SVM
based framework in this paper for classification of Urdu news
headlines. This approach classifies Urdu news based on
headlines in their respective pre-defined categories by utilizing
their feature vector’s maximum indexes. This proposed system
is compared with existing state-of-the art techniques. |
---|---|
ISSN: | 2066-4273 2066-3129 |