New and Efficient Algorithms for Producing Frequent Itemsets with the Map-Reduce Framework

The Map-Reduce (MR) framework has become a popular framework for developing new parallel algorithms for Big Data. Efficient algorithms for data mining of big data and distributed databases has become an important problem. In this paper we focus on algorithms producing association rules and frequent...

Full description

Bibliographic Details
Main Authors: Yaron Gonen, Ehud Gudes, Kirill Kandalov
Format: Article
Language:English
Published: MDPI AG 2018-11-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/11/12/194
_version_ 1819160544974733312
author Yaron Gonen
Ehud Gudes
Kirill Kandalov
author_facet Yaron Gonen
Ehud Gudes
Kirill Kandalov
author_sort Yaron Gonen
collection DOAJ
description The Map-Reduce (MR) framework has become a popular framework for developing new parallel algorithms for Big Data. Efficient algorithms for data mining of big data and distributed databases has become an important problem. In this paper we focus on algorithms producing association rules and frequent itemsets. After reviewing the most recent algorithms that perform this task within the MR framework, we present two new algorithms: one algorithm for producing closed frequent itemsets, and the second one for producing frequent itemsets when the database is updated and new data is added to the old database. Both algorithms include novel optimizations which are suitable to the MR framework, as well as to other parallel architectures. A detailed experimental evaluation shows the effectiveness and advantages of the algorithms over existing methods when it comes to large distributed databases.
first_indexed 2024-12-22T16:58:09Z
format Article
id doaj.art-c36fceb640174484bd1c372b63ba8a05
institution Directory Open Access Journal
issn 1999-4893
language English
last_indexed 2024-12-22T16:58:09Z
publishDate 2018-11-01
publisher MDPI AG
record_format Article
series Algorithms
spelling doaj.art-c36fceb640174484bd1c372b63ba8a052022-12-21T18:19:24ZengMDPI AGAlgorithms1999-48932018-11-01111219410.3390/a11120194a11120194New and Efficient Algorithms for Producing Frequent Itemsets with the Map-Reduce FrameworkYaron Gonen0Ehud Gudes1Kirill Kandalov2Department of Computer Science, Ben-Gurion University, Beer-Sheva 8410501, IsraelDepartment of Computer Science, Ben-Gurion University, Beer-Sheva 8410501, IsraelDepartment of Computer Science, Open University, Ra’anana 4353701, IsraelThe Map-Reduce (MR) framework has become a popular framework for developing new parallel algorithms for Big Data. Efficient algorithms for data mining of big data and distributed databases has become an important problem. In this paper we focus on algorithms producing association rules and frequent itemsets. After reviewing the most recent algorithms that perform this task within the MR framework, we present two new algorithms: one algorithm for producing closed frequent itemsets, and the second one for producing frequent itemsets when the database is updated and new data is added to the old database. Both algorithms include novel optimizations which are suitable to the MR framework, as well as to other parallel architectures. A detailed experimental evaluation shows the effectiveness and advantages of the algorithms over existing methods when it comes to large distributed databases.https://www.mdpi.com/1999-4893/11/12/194apriorimap reducebig datafrequent itemsetsclosed itemsetsincremental computation
spellingShingle Yaron Gonen
Ehud Gudes
Kirill Kandalov
New and Efficient Algorithms for Producing Frequent Itemsets with the Map-Reduce Framework
Algorithms
apriori
map reduce
big data
frequent itemsets
closed itemsets
incremental computation
title New and Efficient Algorithms for Producing Frequent Itemsets with the Map-Reduce Framework
title_full New and Efficient Algorithms for Producing Frequent Itemsets with the Map-Reduce Framework
title_fullStr New and Efficient Algorithms for Producing Frequent Itemsets with the Map-Reduce Framework
title_full_unstemmed New and Efficient Algorithms for Producing Frequent Itemsets with the Map-Reduce Framework
title_short New and Efficient Algorithms for Producing Frequent Itemsets with the Map-Reduce Framework
title_sort new and efficient algorithms for producing frequent itemsets with the map reduce framework
topic apriori
map reduce
big data
frequent itemsets
closed itemsets
incremental computation
url https://www.mdpi.com/1999-4893/11/12/194
work_keys_str_mv AT yarongonen newandefficientalgorithmsforproducingfrequentitemsetswiththemapreduceframework
AT ehudgudes newandefficientalgorithmsforproducingfrequentitemsetswiththemapreduceframework
AT kirillkandalov newandefficientalgorithmsforproducingfrequentitemsetswiththemapreduceframework