Automated document preprocessing for text categorization

This study deals with the document preprocessing step of Text Categorization System (TCS) done by previous researcher. Previously, the document used for TCS was entered manually by key-in the abstract of the document into database.So it will burden if it involves a large volume of articles.In this s...

Full description

Bibliographic Details
Main Authors: Abd Rahman, Suraya, Sainin, Mohd Shamrie
Format: Conference or Workshop Item
Language:English
Published: 2006
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/9590/1/Sur.pdf
Description
Summary:This study deals with the document preprocessing step of Text Categorization System (TCS) done by previous researcher. Previously, the document used for TCS was entered manually by key-in the abstract of the document into database.So it will burden if it involves a large volume of articles.In this study, the extraction based approach was applied in order to automate the document preprocessing. One module was added into the prototype of text categorization system that is used to add document into database.