Replacing missing values using trustworthy data values from web data sources

In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a fram...

Full description

Bibliographic Details
Main Authors: Mohd Jaya, Mohd Izham, Sidi, Fatimah, Mat Yusof, Sharmila, Affendey, Lilly Suriani, Ishak, Iskandar, A. Jabar, Marzanah
Format: Article
Language:English
Published: Institute of Physics Publishing 2017
Online Access:http://psasir.upm.edu.my/id/eprint/62958/1/Replacing%20missing%20values%20using%20trustworthy%20data%20values%20from%20web%20data%20sources.pdf
_version_ 1796977676014059520
author Mohd Jaya, Mohd Izham
Sidi, Fatimah
Mat Yusof, Sharmila
Affendey, Lilly Suriani
Ishak, Iskandar
A. Jabar, Marzanah
author_facet Mohd Jaya, Mohd Izham
Sidi, Fatimah
Mat Yusof, Sharmila
Affendey, Lilly Suriani
Ishak, Iskandar
A. Jabar, Marzanah
author_sort Mohd Jaya, Mohd Izham
collection UPM
description In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framework of missing value replacement using trustworthy data values from web data sources. The proposed framework adopted ontology to map data values from web data sources to the incomplete dataset. As data from web is conflicting with each other, we proposed a trust score measurement based on data accuracy and data reliability. Trust score is then used to select trustworthy data values from web data sources for missing values replacement. We successfully implemented the proposed framework using financial dataset and presented the findings in this paper. From our experiment, we manage to show that replacing missing values with trustworthy data values is important especially in a case of conflicting data to solve missing values problem.
first_indexed 2024-03-06T09:43:34Z
format Article
id upm.eprints-62958
institution Universiti Putra Malaysia
language English
last_indexed 2024-03-06T09:43:34Z
publishDate 2017
publisher Institute of Physics Publishing
record_format dspace
spelling upm.eprints-629582018-11-28T09:23:49Z http://psasir.upm.edu.my/id/eprint/62958/ Replacing missing values using trustworthy data values from web data sources Mohd Jaya, Mohd Izham Sidi, Fatimah Mat Yusof, Sharmila Affendey, Lilly Suriani Ishak, Iskandar A. Jabar, Marzanah In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framework of missing value replacement using trustworthy data values from web data sources. The proposed framework adopted ontology to map data values from web data sources to the incomplete dataset. As data from web is conflicting with each other, we proposed a trust score measurement based on data accuracy and data reliability. Trust score is then used to select trustworthy data values from web data sources for missing values replacement. We successfully implemented the proposed framework using financial dataset and presented the findings in this paper. From our experiment, we manage to show that replacing missing values with trustworthy data values is important especially in a case of conflicting data to solve missing values problem. Institute of Physics Publishing 2017 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/62958/1/Replacing%20missing%20values%20using%20trustworthy%20data%20values%20from%20web%20data%20sources.pdf Mohd Jaya, Mohd Izham and Sidi, Fatimah and Mat Yusof, Sharmila and Affendey, Lilly Suriani and Ishak, Iskandar and A. Jabar, Marzanah (2017) Replacing missing values using trustworthy data values from web data sources. Journal of Physics: Conference Series, 892 (1). pp. 1-11. ISSN 1742-6588; ESSN: 1742-6596 http://iopscience.iop.org/article/10.1088/1742-6596/892/1/012009/pdf 10.1088/1742-6596/892/1/012009
spellingShingle Mohd Jaya, Mohd Izham
Sidi, Fatimah
Mat Yusof, Sharmila
Affendey, Lilly Suriani
Ishak, Iskandar
A. Jabar, Marzanah
Replacing missing values using trustworthy data values from web data sources
title Replacing missing values using trustworthy data values from web data sources
title_full Replacing missing values using trustworthy data values from web data sources
title_fullStr Replacing missing values using trustworthy data values from web data sources
title_full_unstemmed Replacing missing values using trustworthy data values from web data sources
title_short Replacing missing values using trustworthy data values from web data sources
title_sort replacing missing values using trustworthy data values from web data sources
url http://psasir.upm.edu.my/id/eprint/62958/1/Replacing%20missing%20values%20using%20trustworthy%20data%20values%20from%20web%20data%20sources.pdf
work_keys_str_mv AT mohdjayamohdizham replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources
AT sidifatimah replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources
AT matyusofsharmila replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources
AT affendeylillysuriani replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources
AT ishakiskandar replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources
AT ajabarmarzanah replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources