Framework for Urdu News Headlines Classification

Automatic text classification has great significance in the field of text mining and plays a pivotal role in areas such as spam filtering, news classification, noise reduction etc. It is evident from the literature that there is ample of research conducted for classifying text documents e.g. Eng...

Full description

Bibliographic Details
Main Authors: Kashif AHMED, Mubashir ALI, Shehzad KHALID, Muhammad KAMRAN
Format: Article
Language:English
Published: Stefan cel Mare University of Suceava 2016-04-01
Series:Journal of Applied Computer Science & Mathematics
Subjects:
Online Access:http://jacsm.ro/view/?pid=21_2
Description
Summary:Automatic text classification has great significance in the field of text mining and plays a pivotal role in areas such as spam filtering, news classification, noise reduction etc. It is evident from the literature that there is ample of research conducted for classifying text documents e.g. English news classification, Persian text classification etc. but there is no copious amount of work related to short Urdu text or Urdu news headlines classification. Therefore, after examining various existing news classification methodologies we propose an SVM based framework in this paper for classification of Urdu news headlines. This approach classifies Urdu news based on headlines in their respective pre-defined categories by utilizing their feature vector’s maximum indexes. This proposed system is compared with existing state-of-the art techniques.
ISSN:2066-4273
2066-3129