Improved Reinforcement-Based Profile Learning For Document Filtering

Today the amount of accessible information is overwhelming. A personalized information filtering system must be able to tailor to current interests of the user and to adapt as they change over time. This system has to monitor a stream of incoming documents to learn the user’s information requirement...

Full description

Bibliographic Details
Main Author:	Mohammed Almurtadha, Yahya
Format:	Thesis
Language:	English English
Published:	2007
Subjects:	Information filtering systems. Reinforcement learning.
Online Access:	http://psasir.upm.edu.my/id/eprint/5211/1/FSKTM_2007_13.pdf

_version_	1796966917791023104
author	Mohammed Almurtadha, Yahya
author_facet	Mohammed Almurtadha, Yahya
author_sort	Mohammed Almurtadha, Yahya
collection	UPM
description	Today the amount of accessible information is overwhelming. A personalized information filtering system must be able to tailor to current interests of the user and to adapt as they change over time. This system has to monitor a stream of incoming documents to learn the user’s information requirements, which is the user profile. The research has proposed a content-based personal information system learns the user’s preferences by analyzing the document contents and building a user profile. This system is called RePLS; an agent-based Reinforcement Profile Learning System with adaptive information filtering. The research focuses on an improved terms weighting to measure the importance of the terms represent each profile called “purity term weighting”. The top selected terms are then used to filter the incoming documents to the learned user profiles. The agent approach is used because of its autonomous and adaptive capabilities to perform the filtering. The proposed method was evaluated and compared with three Information Filtering methods, namely Rocchio, Okapi/BSS Basic Search System and Reinf, the incremental profile learning method. Based on the proposed method, a profile learning system is developed using Microsoft VC++ connected to Microsoft Access database through an ODBC. AFC kit is used to implement the proposed agents under RETSINA architecture. The experiments are carried out on the TREC 2002 Filtering Track dataset provided by the National Institute of Standards and Technology (NIST). This research has proven that RePLS is able to filter the stream of incoming documents according to the user interests (profiles) learned by the proposed Purity term weighting method. Based on the experiments results, Purity weighting shows better terms weighting and profile learning than the other methods. The outcome of a considerably good accuracy is mainly due to the right weighting of the profile’s terms during the learning phase. This research opens a wide range of future works to be considered, including the investigation of the dependency between the selected terms for each profile, investigating the quality of the method on different datasets, and finally, the possibility to apply the proposed method in other area like the recommendation systems.
first_indexed	2024-03-06T07:06:15Z
format	Thesis
id	upm.eprints-5211
institution	Universiti Putra Malaysia
language	English English
last_indexed	2024-03-06T07:06:15Z
publishDate	2007
record_format	dspace
spelling	upm.eprints-52112013-05-27T07:21:11Z http://psasir.upm.edu.my/id/eprint/5211/ Improved Reinforcement-Based Profile Learning For Document Filtering Mohammed Almurtadha, Yahya Today the amount of accessible information is overwhelming. A personalized information filtering system must be able to tailor to current interests of the user and to adapt as they change over time. This system has to monitor a stream of incoming documents to learn the user’s information requirements, which is the user profile. The research has proposed a content-based personal information system learns the user’s preferences by analyzing the document contents and building a user profile. This system is called RePLS; an agent-based Reinforcement Profile Learning System with adaptive information filtering. The research focuses on an improved terms weighting to measure the importance of the terms represent each profile called “purity term weighting”. The top selected terms are then used to filter the incoming documents to the learned user profiles. The agent approach is used because of its autonomous and adaptive capabilities to perform the filtering. The proposed method was evaluated and compared with three Information Filtering methods, namely Rocchio, Okapi/BSS Basic Search System and Reinf, the incremental profile learning method. Based on the proposed method, a profile learning system is developed using Microsoft VC++ connected to Microsoft Access database through an ODBC. AFC kit is used to implement the proposed agents under RETSINA architecture. The experiments are carried out on the TREC 2002 Filtering Track dataset provided by the National Institute of Standards and Technology (NIST). This research has proven that RePLS is able to filter the stream of incoming documents according to the user interests (profiles) learned by the proposed Purity term weighting method. Based on the experiments results, Purity weighting shows better terms weighting and profile learning than the other methods. The outcome of a considerably good accuracy is mainly due to the right weighting of the profile’s terms during the learning phase. This research opens a wide range of future works to be considered, including the investigation of the dependency between the selected terms for each profile, investigating the quality of the method on different datasets, and finally, the possibility to apply the proposed method in other area like the recommendation systems. 2007 Thesis NonPeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/5211/1/FSKTM_2007_13.pdf Mohammed Almurtadha, Yahya (2007) Improved Reinforcement-Based Profile Learning For Document Filtering. Masters thesis, Universiti Putra Malaysia. Information filtering systems. Reinforcement learning. English
spellingShingle	Information filtering systems. Reinforcement learning. Mohammed Almurtadha, Yahya Improved Reinforcement-Based Profile Learning For Document Filtering
title	Improved Reinforcement-Based Profile Learning For Document Filtering
title_full	Improved Reinforcement-Based Profile Learning For Document Filtering
title_fullStr	Improved Reinforcement-Based Profile Learning For Document Filtering
title_full_unstemmed	Improved Reinforcement-Based Profile Learning For Document Filtering
title_short	Improved Reinforcement-Based Profile Learning For Document Filtering
title_sort	improved reinforcement based profile learning for document filtering
topic	Information filtering systems. Reinforcement learning.
url	http://psasir.upm.edu.my/id/eprint/5211/1/FSKTM_2007_13.pdf
work_keys_str_mv	AT mohammedalmurtadhayahya improvedreinforcementbasedprofilelearningfordocumentfiltering

Improved Reinforcement-Based Profile Learning For Document Filtering

Similar Items