Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm

The way that email has extraordinary significance in present day business communication is certain. Consistently, a bulk of emails is sent from organizations to clients and suppliers, from representatives to their managers and starting with one colleague then onto the next. In this way there is vast...

Full description

Bibliographic Details
Main Authors: Arif, Hanafi, Sulaiman, Harun, Enggari, Sofika, Rani, Larissa Navia
Format: Article
Language:English
English
Published: LPPM UPI YPTK 2016
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/11101/1/Detecting%20Duplicate%20Entry%20in%20Email%20Field%20using%20Alliance%20Rules-based%20Algorithm.pdf
http://umpir.ump.edu.my/id/eprint/11101/7/fskkp-2016-arif-Detecting%20Duplicate%20Entry%20in%20Email%20Field1.pdf
_version_ 1796990924256968704
author Arif, Hanafi
Sulaiman, Harun
Enggari, Sofika
Rani, Larissa Navia
author_facet Arif, Hanafi
Sulaiman, Harun
Enggari, Sofika
Rani, Larissa Navia
author_sort Arif, Hanafi
collection UMP
description The way that email has extraordinary significance in present day business communication is certain. Consistently, a bulk of emails is sent from organizations to clients and suppliers, from representatives to their managers and starting with one colleague then onto the next. In this way there is vast of email in data warehouse. Data cleaning is an activity performed on the data sets of data warehouse to upgrade and keep up the quality and consistency of the data. This paper underlines the issues related with dirty data, detection of duplicatein email column. The paper identifies the strategy of data cleaning from adifferent point of view. It provides an algorithm to the discovery of error and duplicates entries in the data sets of existing data warehouse. The paper characterizes the alliance rules based on the concept of mathematical association rules to determine the duplicate entries in email column in data sets.
first_indexed 2024-03-06T11:58:43Z
format Article
id UMPir11101
institution Universiti Malaysia Pahang
language English
English
last_indexed 2024-03-06T11:58:43Z
publishDate 2016
publisher LPPM UPI YPTK
record_format dspace
spelling UMPir111012016-07-22T02:55:18Z http://umpir.ump.edu.my/id/eprint/11101/ Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm Arif, Hanafi Sulaiman, Harun Enggari, Sofika Rani, Larissa Navia QA76 Computer software The way that email has extraordinary significance in present day business communication is certain. Consistently, a bulk of emails is sent from organizations to clients and suppliers, from representatives to their managers and starting with one colleague then onto the next. In this way there is vast of email in data warehouse. Data cleaning is an activity performed on the data sets of data warehouse to upgrade and keep up the quality and consistency of the data. This paper underlines the issues related with dirty data, detection of duplicatein email column. The paper identifies the strategy of data cleaning from adifferent point of view. It provides an algorithm to the discovery of error and duplicates entries in the data sets of existing data warehouse. The paper characterizes the alliance rules based on the concept of mathematical association rules to determine the duplicate entries in email column in data sets. LPPM UPI YPTK 2016 Article PeerReviewed application/pdf en http://umpir.ump.edu.my/id/eprint/11101/1/Detecting%20Duplicate%20Entry%20in%20Email%20Field%20using%20Alliance%20Rules-based%20Algorithm.pdf application/pdf en http://umpir.ump.edu.my/id/eprint/11101/7/fskkp-2016-arif-Detecting%20Duplicate%20Entry%20in%20Email%20Field1.pdf Arif, Hanafi and Sulaiman, Harun and Enggari, Sofika and Rani, Larissa Navia (2016) Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm. Journal of Computer Science and Information Technology, 1 (1). pp. 71-81. (Published) http://jcsit.upiyptk.ac.id/index.php/jcsit/article/view/39
spellingShingle QA76 Computer software
Arif, Hanafi
Sulaiman, Harun
Enggari, Sofika
Rani, Larissa Navia
Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm
title Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm
title_full Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm
title_fullStr Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm
title_full_unstemmed Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm
title_short Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm
title_sort detecting duplicate entry in email field using alliance rules based algorithm
topic QA76 Computer software
url http://umpir.ump.edu.my/id/eprint/11101/1/Detecting%20Duplicate%20Entry%20in%20Email%20Field%20using%20Alliance%20Rules-based%20Algorithm.pdf
http://umpir.ump.edu.my/id/eprint/11101/7/fskkp-2016-arif-Detecting%20Duplicate%20Entry%20in%20Email%20Field1.pdf
work_keys_str_mv AT arifhanafi detectingduplicateentryinemailfieldusingalliancerulesbasedalgorithm
AT sulaimanharun detectingduplicateentryinemailfieldusingalliancerulesbasedalgorithm
AT enggarisofika detectingduplicateentryinemailfieldusingalliancerulesbasedalgorithm
AT ranilarissanavia detectingduplicateentryinemailfieldusingalliancerulesbasedalgorithm