Pengekstrakan data berasaskan pendekatan ontologi: kes data jujukan hidrologi

Information Extraction is a process that extracts information from existing system source and stores into a database. Previous researchers had focus on information extraction for HTML data using wrapper approach. The drawback from this approach is resiliency where wrapper fails to function when the...

Full description

Bibliographic Details
Main Author:	Abd. Hamid, Ahmad Ghadaffi
Format:	Thesis
Language:	English
Published:	2005
Subjects:	QA75 Electronic computers. Computer science
Online Access:	http://eprints.utm.my/2943/1/AhmadGhadaffiAbdHamidMFC2005.pdf

_version_	1796853493105950720
author	Abd. Hamid, Ahmad Ghadaffi
author_facet	Abd. Hamid, Ahmad Ghadaffi
author_sort	Abd. Hamid, Ahmad Ghadaffi
collection	ePrints
description	Information Extraction is a process that extracts information from existing system source and stores into a database. Previous researchers had focus on information extraction for HTML data using wrapper approach. The drawback from this approach is resiliency where wrapper fails to function when the file of interestâ€™s structure changes. Ontology based information extraction is an alternative solution for this problem. In this research, ontology based information extraction used hydrological data from Jabatan Pengairan dan Saliran (JPS) as the case study. Ontology based information extraction for hydrology domain or also known as â€˜EkstrakProâ€™ is divided into three main processes; which are ontology parser process, keyword and sequences recognition process, and a data mapping process. â€˜EkstrakProâ€™ used two inputs; the hydrology data and ontology extraction. An important feature in â€˜EkstrakProâ€™ is that ontology extraction, where unit object is introduced to simplify the ontology maintenance. The sequential recognition algorithm is to solve the time consuming issues for extracting sequential data. Five types of hydrological data are used in the experiment. These data are divided into three categories; (i) original data taken from gauging machine, (ii) the altered data and (iii) the different sizes of data. Based on these categories, the information extraction resiliency and time taken have been measured using a precise equation and O-notation. The results show that prototype â€˜EkstrakProâ€™ can extract different structure hydrology data correctly by using only one algorithm. Using sequential recognition algorithm can also further reduce the time required for extraction of information. The result of the research proves that information extraction can be solved using ontology approach
first_indexed	2024-03-05T18:00:19Z
format	Thesis
id	utm.eprints-2943
institution	Universiti Teknologi Malaysia - ePrints
language	English
last_indexed	2024-03-05T18:00:19Z
publishDate	2005
record_format	dspace
spelling	utm.eprints-29432018-06-25T00:42:55Z http://eprints.utm.my/2943/ Pengekstrakan data berasaskan pendekatan ontologi: kes data jujukan hidrologi Abd. Hamid, Ahmad Ghadaffi QA75 Electronic computers. Computer science Information Extraction is a process that extracts information from existing system source and stores into a database. Previous researchers had focus on information extraction for HTML data using wrapper approach. The drawback from this approach is resiliency where wrapper fails to function when the file of interestâ€™s structure changes. Ontology based information extraction is an alternative solution for this problem. In this research, ontology based information extraction used hydrological data from Jabatan Pengairan dan Saliran (JPS) as the case study. Ontology based information extraction for hydrology domain or also known as â€˜EkstrakProâ€™ is divided into three main processes; which are ontology parser process, keyword and sequences recognition process, and a data mapping process. â€˜EkstrakProâ€™ used two inputs; the hydrology data and ontology extraction. An important feature in â€˜EkstrakProâ€™ is that ontology extraction, where unit object is introduced to simplify the ontology maintenance. The sequential recognition algorithm is to solve the time consuming issues for extracting sequential data. Five types of hydrological data are used in the experiment. These data are divided into three categories; (i) original data taken from gauging machine, (ii) the altered data and (iii) the different sizes of data. Based on these categories, the information extraction resiliency and time taken have been measured using a precise equation and O-notation. The results show that prototype â€˜EkstrakProâ€™ can extract different structure hydrology data correctly by using only one algorithm. Using sequential recognition algorithm can also further reduce the time required for extraction of information. The result of the research proves that information extraction can be solved using ontology approach 2005-12 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/2943/1/AhmadGhadaffiAbdHamidMFC2005.pdf Abd. Hamid, Ahmad Ghadaffi (2005) Pengekstrakan data berasaskan pendekatan ontologi: kes data jujukan hidrologi. Masters thesis, Universiti Teknologi Malaysia, Faculty of Computer Science and Information System.
spellingShingle	QA75 Electronic computers. Computer science Abd. Hamid, Ahmad Ghadaffi Pengekstrakan data berasaskan pendekatan ontologi: kes data jujukan hidrologi
title	Pengekstrakan data berasaskan pendekatan ontologi: kes data jujukan hidrologi
title_full	Pengekstrakan data berasaskan pendekatan ontologi: kes data jujukan hidrologi
title_fullStr	Pengekstrakan data berasaskan pendekatan ontologi: kes data jujukan hidrologi
title_full_unstemmed	Pengekstrakan data berasaskan pendekatan ontologi: kes data jujukan hidrologi
title_short	Pengekstrakan data berasaskan pendekatan ontologi: kes data jujukan hidrologi
title_sort	pengekstrakan data berasaskan pendekatan ontologi kes data jujukan hidrologi
topic	QA75 Electronic computers. Computer science
url	http://eprints.utm.my/2943/1/AhmadGhadaffiAbdHamidMFC2005.pdf
work_keys_str_mv	AT abdhamidahmadghadaffi pengekstrakandataberasaskanpendekatanontologikesdatajujukanhidrologi

Pengekstrakan data berasaskan pendekatan ontologi: kes data jujukan hidrologi

Similar Items