Automatic classification using concept knowledge of web documents

In order to classify web documents, we suggest a method using concept knowledge of category.In our study, the concept relations between keywords are extracted using hyperlink information and after the extracted keywords are classified into each category, these are used as an index.Then TFIDF for eac...

Full description

Bibliographic Details
Main Authors:	Choi, Sang-Ho, Park, Sa-Joon, Hwang, Su-Cheol, Kim, Ki-Tae
Format:	Conference or Workshop Item
Language:	English
Published:	2004
Subjects:	QA76 Computer software
Online Access:	https://repo.uum.edu.my/id/eprint/13843/1/KM112.pdf

_version_	1825803260192096256
author	Choi, Sang-Ho Park, Sa-Joon Hwang, Su-Cheol Kim, Ki-Tae
author_facet	Choi, Sang-Ho Park, Sa-Joon Hwang, Su-Cheol Kim, Ki-Tae
author_sort	Choi, Sang-Ho
collection	UUM
description	In order to classify web documents, we suggest a method using concept knowledge of category.In our study, the concept relations between keywords are extracted using hyperlink information and after the extracted keywords are classified into each category, these are used as an index.Then TFIDF for each category is extended to determine index weight value.The system is constructed for experimenting and estimating,which is consist of web robot, indexer, concept knowledge database for each category and the document classifier.Our system to be applied the extended TFIDF method shows an accuracy of 88% in automatic classifying of web documents.
first_indexed	2024-07-04T05:53:57Z
format	Conference or Workshop Item
id	uum-13843
institution	Universiti Utara Malaysia
language	English
last_indexed	2024-07-04T05:53:57Z
publishDate	2004
record_format	eprints
spelling	uum-138432015-04-13T08:56:36Z https://repo.uum.edu.my/id/eprint/13843/ Automatic classification using concept knowledge of web documents Choi, Sang-Ho Park, Sa-Joon Hwang, Su-Cheol Kim, Ki-Tae QA76 Computer software In order to classify web documents, we suggest a method using concept knowledge of category.In our study, the concept relations between keywords are extracted using hyperlink information and after the extracted keywords are classified into each category, these are used as an index.Then TFIDF for each category is extended to determine index weight value.The system is constructed for experimenting and estimating,which is consist of web robot, indexer, concept knowledge database for each category and the document classifier.Our system to be applied the extended TFIDF method shows an accuracy of 88% in automatic classifying of web documents. 2004-02-14 Conference or Workshop Item PeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/13843/1/KM112.pdf Choi, Sang-Ho and Park, Sa-Joon and Hwang, Su-Cheol and Kim, Ki-Tae (2004) Automatic classification using concept knowledge of web documents. In: Knowledge Management International Conference and Exhibition 2004 (KMICE 2004), 14-15 February 2004, Evergreen Laurel Hotel, Penang. http://www.kmice.cms.net.my
spellingShingle	QA76 Computer software Choi, Sang-Ho Park, Sa-Joon Hwang, Su-Cheol Kim, Ki-Tae Automatic classification using concept knowledge of web documents
title	Automatic classification using concept knowledge of web documents
title_full	Automatic classification using concept knowledge of web documents
title_fullStr	Automatic classification using concept knowledge of web documents
title_full_unstemmed	Automatic classification using concept knowledge of web documents
title_short	Automatic classification using concept knowledge of web documents
title_sort	automatic classification using concept knowledge of web documents
topic	QA76 Computer software
url	https://repo.uum.edu.my/id/eprint/13843/1/KM112.pdf
work_keys_str_mv	AT choisangho automaticclassificationusingconceptknowledgeofwebdocuments AT parksajoon automaticclassificationusingconceptknowledgeofwebdocuments AT hwangsucheol automaticclassificationusingconceptknowledgeofwebdocuments AT kimkitae automaticclassificationusingconceptknowledgeofwebdocuments

Automatic classification using concept knowledge of web documents

Similar Items