A review of data quality for big data

We constantly produce lots of data everyday via social media, public transport, global positioning system (GPS)), satellite and healthcare applications etc. Now a day with a combination of clouds, endless networks of servers and powerful algorithms businesses can analyze over a million pieces of dat...

Full description

Bibliographic Details
Main Authors: Zakaria, Huraizah, Ansaf, Mohammad, Hassan, Noor Hafizah, Maarop, Nurazean, Samy, Ganthan Narayana
Format: Article
Published: OIJI UTM 2018
Subjects:
_version_ 1796863503414329344
author Zakaria, Huraizah
Ansaf, Mohammad
Hassan, Noor Hafizah
Maarop, Nurazean
Samy, Ganthan Narayana
author_facet Zakaria, Huraizah
Ansaf, Mohammad
Hassan, Noor Hafizah
Maarop, Nurazean
Samy, Ganthan Narayana
author_sort Zakaria, Huraizah
collection ePrints
description We constantly produce lots of data everyday via social media, public transport, global positioning system (GPS)), satellite and healthcare applications etc. Now a day with a combination of clouds, endless networks of servers and powerful algorithms businesses can analyze over a million pieces of data in a minute. These large volumes of data beyond the range of Exabyte (EB) is called Big Data. Data producing volumes exceeds the limit of current storage systems. Considering the large amount of volume and the speed in which, big data is produced it is also possible that the challenges are bigger as well especially in its quality. Therefore, the objective of this paper to review of data quality of big data. A review of related literature has been conducted for this research. In this paper, we will be discussing about big data, its characteristics and the issues and challenges that data scientist should be looking at before analyzing the data. Issues and challenges of technical and non-technical dimensions are summarized based on literature conducted.
first_indexed 2024-03-05T20:27:44Z
format Article
id utm.eprints-82027
institution Universiti Teknologi Malaysia - ePrints
last_indexed 2024-03-05T20:27:44Z
publishDate 2018
publisher OIJI UTM
record_format dspace
spelling utm.eprints-820272019-10-16T10:08:06Z http://eprints.utm.my/82027/ A review of data quality for big data Zakaria, Huraizah Ansaf, Mohammad Hassan, Noor Hafizah Maarop, Nurazean Samy, Ganthan Narayana QA75 Electronic computers. Computer science We constantly produce lots of data everyday via social media, public transport, global positioning system (GPS)), satellite and healthcare applications etc. Now a day with a combination of clouds, endless networks of servers and powerful algorithms businesses can analyze over a million pieces of data in a minute. These large volumes of data beyond the range of Exabyte (EB) is called Big Data. Data producing volumes exceeds the limit of current storage systems. Considering the large amount of volume and the speed in which, big data is produced it is also possible that the challenges are bigger as well especially in its quality. Therefore, the objective of this paper to review of data quality of big data. A review of related literature has been conducted for this research. In this paper, we will be discussing about big data, its characteristics and the issues and challenges that data scientist should be looking at before analyzing the data. Issues and challenges of technical and non-technical dimensions are summarized based on literature conducted. OIJI UTM 2018 Article PeerReviewed Zakaria, Huraizah and Ansaf, Mohammad and Hassan, Noor Hafizah and Maarop, Nurazean and Samy, Ganthan Narayana (2018) A review of data quality for big data. Open International Journal Of Informatics(OIJI), 6 (1). pp. 26-33. ISSN 2289-2370 http://apps.razak.utm.my/ojs/index.php/oiji/article/view/40
spellingShingle QA75 Electronic computers. Computer science
Zakaria, Huraizah
Ansaf, Mohammad
Hassan, Noor Hafizah
Maarop, Nurazean
Samy, Ganthan Narayana
A review of data quality for big data
title A review of data quality for big data
title_full A review of data quality for big data
title_fullStr A review of data quality for big data
title_full_unstemmed A review of data quality for big data
title_short A review of data quality for big data
title_sort review of data quality for big data
topic QA75 Electronic computers. Computer science
work_keys_str_mv AT zakariahuraizah areviewofdataqualityforbigdata
AT ansafmohammad areviewofdataqualityforbigdata
AT hassannoorhafizah areviewofdataqualityforbigdata
AT maaropnurazean areviewofdataqualityforbigdata
AT samyganthannarayana areviewofdataqualityforbigdata
AT zakariahuraizah reviewofdataqualityforbigdata
AT ansafmohammad reviewofdataqualityforbigdata
AT hassannoorhafizah reviewofdataqualityforbigdata
AT maaropnurazean reviewofdataqualityforbigdata
AT samyganthannarayana reviewofdataqualityforbigdata