Replacing missing values using trustworthy data values from web data sources

In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framewor...

Full description

Bibliographic Details
Main Authors: Izham Jaya, M., Sidi, Fatimah, Mat Yusof, Sharmila, Affendey, Lilly Suriani, Ishak, Iskandar, Jabar, Marzanah A.
Format: Article
Language:English
Published: IOP Publishing 2017
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/25423/1/JPCS%20892%202017%201%2011.pdf
_version_ 1825805285117132800
author Izham Jaya, M.
Sidi, Fatimah
Mat Yusof, Sharmila
Affendey, Lilly Suriani
Ishak, Iskandar
Jabar, Marzanah A.
author_facet Izham Jaya, M.
Sidi, Fatimah
Mat Yusof, Sharmila
Affendey, Lilly Suriani
Ishak, Iskandar
Jabar, Marzanah A.
author_sort Izham Jaya, M.
collection UUM
description In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framework of missing value replacement using trustworthy data values from web data sources. The proposed framework adopted ontology to map data values from web data sources to the incomplete dataset. As data from web is conflicting with each other, we proposed a trust score measurement based on data accuracy and data reliability. Trust score is then used to select trustworthy data values from web data sources for missing values replacement. We successfully implemented the proposed framework using financial dataset and presented the findings in this paper. From our experiment, we manage to show that replacing missing values with trustworthy data values is important especially in a case of conflicting data to solve missing values problem.
first_indexed 2024-07-04T06:29:46Z
format Article
id uum-25423
institution Universiti Utara Malaysia
language English
last_indexed 2024-07-04T06:29:46Z
publishDate 2017
publisher IOP Publishing
record_format eprints
spelling uum-254232019-01-15T23:56:20Z https://repo.uum.edu.my/id/eprint/25423/ Replacing missing values using trustworthy data values from web data sources Izham Jaya, M. Sidi, Fatimah Mat Yusof, Sharmila Affendey, Lilly Suriani Ishak, Iskandar Jabar, Marzanah A. QA75 Electronic computers. Computer science In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framework of missing value replacement using trustworthy data values from web data sources. The proposed framework adopted ontology to map data values from web data sources to the incomplete dataset. As data from web is conflicting with each other, we proposed a trust score measurement based on data accuracy and data reliability. Trust score is then used to select trustworthy data values from web data sources for missing values replacement. We successfully implemented the proposed framework using financial dataset and presented the findings in this paper. From our experiment, we manage to show that replacing missing values with trustworthy data values is important especially in a case of conflicting data to solve missing values problem. IOP Publishing 2017 Article PeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/25423/1/JPCS%20892%202017%201%2011.pdf Izham Jaya, M. and Sidi, Fatimah and Mat Yusof, Sharmila and Affendey, Lilly Suriani and Ishak, Iskandar and Jabar, Marzanah A. (2017) Replacing missing values using trustworthy data values from web data sources. Journal of Physics: Conference Series, 892. pp. 1-11. ISSN 1742-6588 http://doi.org/10.1088/1742-6596/892/1/012009 doi:10.1088/1742-6596/892/1/012009 doi:10.1088/1742-6596/892/1/012009
spellingShingle QA75 Electronic computers. Computer science
Izham Jaya, M.
Sidi, Fatimah
Mat Yusof, Sharmila
Affendey, Lilly Suriani
Ishak, Iskandar
Jabar, Marzanah A.
Replacing missing values using trustworthy data values from web data sources
title Replacing missing values using trustworthy data values from web data sources
title_full Replacing missing values using trustworthy data values from web data sources
title_fullStr Replacing missing values using trustworthy data values from web data sources
title_full_unstemmed Replacing missing values using trustworthy data values from web data sources
title_short Replacing missing values using trustworthy data values from web data sources
title_sort replacing missing values using trustworthy data values from web data sources
topic QA75 Electronic computers. Computer science
url https://repo.uum.edu.my/id/eprint/25423/1/JPCS%20892%202017%201%2011.pdf
work_keys_str_mv AT izhamjayam replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources
AT sidifatimah replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources
AT matyusofsharmila replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources
AT affendeylillysuriani replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources
AT ishakiskandar replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources
AT jabarmarzanaha replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources