Applying learning to filter text

Text filtering has been a successful application especially in e-mail filtering. The use of probabilistic approaches such as naïve Bayes algorithm is the effective algorithms currently known for learning to filter or classify text document.Naïve Bayes algorithm is one of the algorithms in Machi...

Full description

Bibliographic Details
Main Author: Sainin, Mohd Shamrie
Format: Conference or Workshop Item
Language:English
Published: 2005
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/12431/1/rie%20seit3_ver5.pdf
_version_ 1825803006889689088
author Sainin, Mohd Shamrie
author_facet Sainin, Mohd Shamrie
author_sort Sainin, Mohd Shamrie
collection UUM
description Text filtering has been a successful application especially in e-mail filtering. The use of probabilistic approaches such as naïve Bayes algorithm is the effective algorithms currently known for learning to filter or classify text document.Naïve Bayes algorithm is one of the algorithms in Machine Learning that manipulates probability estimation or reasoning about the observed data.The growing of bulk e-mail or known as spam e-mail becomes a threat to users’ privacy and network load and in the case of e -mail filtering,naïve Bayes classifier can be trained to automatically detect spam messages.Similar to the e-mail, forum application may be misused by the user to send bad messages and in some extent may offence other readers.Forum filtering may be less important compared to e-mail spam filtering; however there is a possibility of using naïve Bayes to learn the messages and automatically detect bad messages.Most of the forum application found in the web is applying keyword based text filtering which scan the words and change the detected words into certain representation.Instead of defining a set of keywords to filter the forum messages, this paper will explains the experiment in applying a learning to filter text especially in the educational and anonymous forum message, where there is no user registration required to submit messages.
first_indexed 2024-07-04T05:49:53Z
format Conference or Workshop Item
id uum-12431
institution Universiti Utara Malaysia
language English
last_indexed 2024-07-04T05:49:53Z
publishDate 2005
record_format eprints
spelling uum-124312014-10-26T08:22:32Z https://repo.uum.edu.my/id/eprint/12431/ Applying learning to filter text Sainin, Mohd Shamrie QA76 Computer software Text filtering has been a successful application especially in e-mail filtering. The use of probabilistic approaches such as naïve Bayes algorithm is the effective algorithms currently known for learning to filter or classify text document.Naïve Bayes algorithm is one of the algorithms in Machine Learning that manipulates probability estimation or reasoning about the observed data.The growing of bulk e-mail or known as spam e-mail becomes a threat to users’ privacy and network load and in the case of e -mail filtering,naïve Bayes classifier can be trained to automatically detect spam messages.Similar to the e-mail, forum application may be misused by the user to send bad messages and in some extent may offence other readers.Forum filtering may be less important compared to e-mail spam filtering; however there is a possibility of using naïve Bayes to learn the messages and automatically detect bad messages.Most of the forum application found in the web is applying keyword based text filtering which scan the words and change the detected words into certain representation.Instead of defining a set of keywords to filter the forum messages, this paper will explains the experiment in applying a learning to filter text especially in the educational and anonymous forum message, where there is no user registration required to submit messages. 2005 Conference or Workshop Item NonPeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/12431/1/rie%20seit3_ver5.pdf Sainin, Mohd Shamrie (2005) Applying learning to filter text. In: Seminar Kebangsaan Sosio-ekonomi dan IT (SEIT), 20-21 August 2005, Putra Brasmana Hotel, Perlis. (Unpublished)
spellingShingle QA76 Computer software
Sainin, Mohd Shamrie
Applying learning to filter text
title Applying learning to filter text
title_full Applying learning to filter text
title_fullStr Applying learning to filter text
title_full_unstemmed Applying learning to filter text
title_short Applying learning to filter text
title_sort applying learning to filter text
topic QA76 Computer software
url https://repo.uum.edu.my/id/eprint/12431/1/rie%20seit3_ver5.pdf
work_keys_str_mv AT saininmohdshamrie applyinglearningtofiltertext