A review of data quality for big data
We constantly produce lots of data everyday via social media, public transport, global positioning system (GPS)), satellite and healthcare applications etc. Now a day with a combination of clouds, endless networks of servers and powerful algorithms businesses can analyze over a million pieces of dat...
Main Authors: | , , , , |
---|---|
Format: | Article |
Published: |
OIJI UTM
2018
|
Subjects: |
_version_ | 1796863503414329344 |
---|---|
author | Zakaria, Huraizah Ansaf, Mohammad Hassan, Noor Hafizah Maarop, Nurazean Samy, Ganthan Narayana |
author_facet | Zakaria, Huraizah Ansaf, Mohammad Hassan, Noor Hafizah Maarop, Nurazean Samy, Ganthan Narayana |
author_sort | Zakaria, Huraizah |
collection | ePrints |
description | We constantly produce lots of data everyday via social media, public transport, global positioning system (GPS)), satellite and healthcare applications etc. Now a day with a combination of clouds, endless networks of servers and powerful algorithms businesses can analyze over a million pieces of data in a minute. These large volumes of data beyond the range of Exabyte (EB) is called Big Data. Data producing volumes exceeds the limit of current storage systems. Considering the large amount of volume and the speed in which, big data is produced it is also possible that the challenges are bigger as well especially in its quality. Therefore, the objective of this paper to review of data quality of big data. A review of related literature has been conducted for this research. In this paper, we will be discussing about big data, its characteristics and the issues and challenges that data scientist should be looking at before analyzing the data. Issues and challenges of technical and non-technical dimensions are summarized based on literature conducted. |
first_indexed | 2024-03-05T20:27:44Z |
format | Article |
id | utm.eprints-82027 |
institution | Universiti Teknologi Malaysia - ePrints |
last_indexed | 2024-03-05T20:27:44Z |
publishDate | 2018 |
publisher | OIJI UTM |
record_format | dspace |
spelling | utm.eprints-820272019-10-16T10:08:06Z http://eprints.utm.my/82027/ A review of data quality for big data Zakaria, Huraizah Ansaf, Mohammad Hassan, Noor Hafizah Maarop, Nurazean Samy, Ganthan Narayana QA75 Electronic computers. Computer science We constantly produce lots of data everyday via social media, public transport, global positioning system (GPS)), satellite and healthcare applications etc. Now a day with a combination of clouds, endless networks of servers and powerful algorithms businesses can analyze over a million pieces of data in a minute. These large volumes of data beyond the range of Exabyte (EB) is called Big Data. Data producing volumes exceeds the limit of current storage systems. Considering the large amount of volume and the speed in which, big data is produced it is also possible that the challenges are bigger as well especially in its quality. Therefore, the objective of this paper to review of data quality of big data. A review of related literature has been conducted for this research. In this paper, we will be discussing about big data, its characteristics and the issues and challenges that data scientist should be looking at before analyzing the data. Issues and challenges of technical and non-technical dimensions are summarized based on literature conducted. OIJI UTM 2018 Article PeerReviewed Zakaria, Huraizah and Ansaf, Mohammad and Hassan, Noor Hafizah and Maarop, Nurazean and Samy, Ganthan Narayana (2018) A review of data quality for big data. Open International Journal Of Informatics(OIJI), 6 (1). pp. 26-33. ISSN 2289-2370 http://apps.razak.utm.my/ojs/index.php/oiji/article/view/40 |
spellingShingle | QA75 Electronic computers. Computer science Zakaria, Huraizah Ansaf, Mohammad Hassan, Noor Hafizah Maarop, Nurazean Samy, Ganthan Narayana A review of data quality for big data |
title | A review of data quality for big data |
title_full | A review of data quality for big data |
title_fullStr | A review of data quality for big data |
title_full_unstemmed | A review of data quality for big data |
title_short | A review of data quality for big data |
title_sort | review of data quality for big data |
topic | QA75 Electronic computers. Computer science |
work_keys_str_mv | AT zakariahuraizah areviewofdataqualityforbigdata AT ansafmohammad areviewofdataqualityforbigdata AT hassannoorhafizah areviewofdataqualityforbigdata AT maaropnurazean areviewofdataqualityforbigdata AT samyganthannarayana areviewofdataqualityforbigdata AT zakariahuraizah reviewofdataqualityforbigdata AT ansafmohammad reviewofdataqualityforbigdata AT hassannoorhafizah reviewofdataqualityforbigdata AT maaropnurazean reviewofdataqualityforbigdata AT samyganthannarayana reviewofdataqualityforbigdata |