Replacing missing values using trustworthy data values from web data sources
In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framewor...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IOP Publishing
2017
|
Subjects: | |
Online Access: | https://repo.uum.edu.my/id/eprint/25423/1/JPCS%20892%202017%201%2011.pdf |
_version_ | 1825805285117132800 |
---|---|
author | Izham Jaya, M. Sidi, Fatimah Mat Yusof, Sharmila Affendey, Lilly Suriani Ishak, Iskandar Jabar, Marzanah A. |
author_facet | Izham Jaya, M. Sidi, Fatimah Mat Yusof, Sharmila Affendey, Lilly Suriani Ishak, Iskandar Jabar, Marzanah A. |
author_sort | Izham Jaya, M. |
collection | UUM |
description | In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framework of missing value replacement using trustworthy data values from web data sources. The proposed framework adopted ontology to map data values from web data sources to the incomplete dataset. As data from web is conflicting with each other, we proposed a trust score measurement based on data accuracy and data reliability. Trust score is then used to select trustworthy data values from web data sources for missing values replacement. We successfully implemented the proposed framework using financial dataset and presented the findings in this paper. From our experiment, we manage to show that replacing missing values with trustworthy data values is important especially in a case of conflicting data to solve missing values problem. |
first_indexed | 2024-07-04T06:29:46Z |
format | Article |
id | uum-25423 |
institution | Universiti Utara Malaysia |
language | English |
last_indexed | 2024-07-04T06:29:46Z |
publishDate | 2017 |
publisher | IOP Publishing |
record_format | eprints |
spelling | uum-254232019-01-15T23:56:20Z https://repo.uum.edu.my/id/eprint/25423/ Replacing missing values using trustworthy data values from web data sources Izham Jaya, M. Sidi, Fatimah Mat Yusof, Sharmila Affendey, Lilly Suriani Ishak, Iskandar Jabar, Marzanah A. QA75 Electronic computers. Computer science In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framework of missing value replacement using trustworthy data values from web data sources. The proposed framework adopted ontology to map data values from web data sources to the incomplete dataset. As data from web is conflicting with each other, we proposed a trust score measurement based on data accuracy and data reliability. Trust score is then used to select trustworthy data values from web data sources for missing values replacement. We successfully implemented the proposed framework using financial dataset and presented the findings in this paper. From our experiment, we manage to show that replacing missing values with trustworthy data values is important especially in a case of conflicting data to solve missing values problem. IOP Publishing 2017 Article PeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/25423/1/JPCS%20892%202017%201%2011.pdf Izham Jaya, M. and Sidi, Fatimah and Mat Yusof, Sharmila and Affendey, Lilly Suriani and Ishak, Iskandar and Jabar, Marzanah A. (2017) Replacing missing values using trustworthy data values from web data sources. Journal of Physics: Conference Series, 892. pp. 1-11. ISSN 1742-6588 http://doi.org/10.1088/1742-6596/892/1/012009 doi:10.1088/1742-6596/892/1/012009 doi:10.1088/1742-6596/892/1/012009 |
spellingShingle | QA75 Electronic computers. Computer science Izham Jaya, M. Sidi, Fatimah Mat Yusof, Sharmila Affendey, Lilly Suriani Ishak, Iskandar Jabar, Marzanah A. Replacing missing values using trustworthy data values from web data sources |
title | Replacing missing values using trustworthy data values from web data sources |
title_full | Replacing missing values using trustworthy data values from web data sources |
title_fullStr | Replacing missing values using trustworthy data values from web data sources |
title_full_unstemmed | Replacing missing values using trustworthy data values from web data sources |
title_short | Replacing missing values using trustworthy data values from web data sources |
title_sort | replacing missing values using trustworthy data values from web data sources |
topic | QA75 Electronic computers. Computer science |
url | https://repo.uum.edu.my/id/eprint/25423/1/JPCS%20892%202017%201%2011.pdf |
work_keys_str_mv | AT izhamjayam replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources AT sidifatimah replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources AT matyusofsharmila replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources AT affendeylillysuriani replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources AT ishakiskandar replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources AT jabarmarzanaha replacingmissingvaluesusingtrustworthydatavaluesfromwebdatasources |