An Efficient Method for Mining Closed Potential High-Utility Itemsets

High-utility itemset mining (HUIM) has become a key phase of the pattern mining process, which has wide applications, related to both quantities and profits of items. Many algorithms have been proposed to mine high-utility itemsets (HUIs). Since these algorithms often return a large number of discov...

Full description

Bibliographic Details
Main Authors: Bay Vo, Loan T. T. Nguyen, Nguyen Bui, Trinh D. D. Nguyen, Van-Nam Huynh, Tzung-Pei Hong
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8999562/
_version_ 1819276497705238528
author Bay Vo
Loan T. T. Nguyen
Nguyen Bui
Trinh D. D. Nguyen
Van-Nam Huynh
Tzung-Pei Hong
author_facet Bay Vo
Loan T. T. Nguyen
Nguyen Bui
Trinh D. D. Nguyen
Van-Nam Huynh
Tzung-Pei Hong
author_sort Bay Vo
collection DOAJ
description High-utility itemset mining (HUIM) has become a key phase of the pattern mining process, which has wide applications, related to both quantities and profits of items. Many algorithms have been proposed to mine high-utility itemsets (HUIs). Since these algorithms often return a large number of discovered patterns, a more compact and lossless representation has been proposed. The recently proposed closed high utility itemset mining (CHUIM) algorithms were designed to work with certain types of databases (e.g., those without probabilities). In fact, real-world databases might contain items or itemsets associated with probability values. To effectively mine frequent patterns from uncertain databases, several techniques have been developed, but there does not exist any method for mining CHUIs from this type of databases. This work presents a novel and efficient method without generating candidates, named CPHUI-List, to mine closed potential high-utility itemsets (CPHUIs) from uncertain databases. The proposed algorithm is DFS-based and utilizes the downward closure property of high transaction-weighted probabilistic mining to prune non-CPHUIs. It can be seen from the experiment evaluations that the proposed algorithm has better execution time and memory usage than the CHUI-Miner.
first_indexed 2024-12-23T23:41:10Z
format Article
id doaj.art-443ce42ab9d442c78b41e170434225e6
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-23T23:41:10Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-443ce42ab9d442c78b41e170434225e62022-12-21T17:25:39ZengIEEEIEEE Access2169-35362020-01-018318133182210.1109/ACCESS.2020.29741048999562An Efficient Method for Mining Closed Potential High-Utility ItemsetsBay Vo0https://orcid.org/0000-0002-2723-1138Loan T. T. Nguyen1https://orcid.org/0000-0001-6440-6462Nguyen Bui2Trinh D. D. Nguyen3Van-Nam Huynh4Tzung-Pei Hong5https://orcid.org/0000-0001-7305-6492Faculty of Information Technology, Ho Chi Minh City University of Technology (HUTECH), Ho Chi Minh City, VietnamSchool of Computer Science and Engineering, International University, Ho Chi Minh City, VietnamFaculty of Information Technology, Ho Chi Minh City University of Technology (HUTECH), Ho Chi Minh City, VietnamInstitute of Research and Development, Duy Tan University, Da Nang, VietnamSchool of Knowledge Science, Japan Advanced Institute of Science and Technology (JAIST), JapanDepartment of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, TaiwanHigh-utility itemset mining (HUIM) has become a key phase of the pattern mining process, which has wide applications, related to both quantities and profits of items. Many algorithms have been proposed to mine high-utility itemsets (HUIs). Since these algorithms often return a large number of discovered patterns, a more compact and lossless representation has been proposed. The recently proposed closed high utility itemset mining (CHUIM) algorithms were designed to work with certain types of databases (e.g., those without probabilities). In fact, real-world databases might contain items or itemsets associated with probability values. To effectively mine frequent patterns from uncertain databases, several techniques have been developed, but there does not exist any method for mining CHUIs from this type of databases. This work presents a novel and efficient method without generating candidates, named CPHUI-List, to mine closed potential high-utility itemsets (CPHUIs) from uncertain databases. The proposed algorithm is DFS-based and utilizes the downward closure property of high transaction-weighted probabilistic mining to prune non-CPHUIs. It can be seen from the experiment evaluations that the proposed algorithm has better execution time and memory usage than the CHUI-Miner.https://ieeexplore.ieee.org/document/8999562/Uncertain databasedata miningknowledge discoveryhigh-utility itemsetclosed high-utility itemset
spellingShingle Bay Vo
Loan T. T. Nguyen
Nguyen Bui
Trinh D. D. Nguyen
Van-Nam Huynh
Tzung-Pei Hong
An Efficient Method for Mining Closed Potential High-Utility Itemsets
IEEE Access
Uncertain database
data mining
knowledge discovery
high-utility itemset
closed high-utility itemset
title An Efficient Method for Mining Closed Potential High-Utility Itemsets
title_full An Efficient Method for Mining Closed Potential High-Utility Itemsets
title_fullStr An Efficient Method for Mining Closed Potential High-Utility Itemsets
title_full_unstemmed An Efficient Method for Mining Closed Potential High-Utility Itemsets
title_short An Efficient Method for Mining Closed Potential High-Utility Itemsets
title_sort efficient method for mining closed potential high utility itemsets
topic Uncertain database
data mining
knowledge discovery
high-utility itemset
closed high-utility itemset
url https://ieeexplore.ieee.org/document/8999562/
work_keys_str_mv AT bayvo anefficientmethodforminingclosedpotentialhighutilityitemsets
AT loanttnguyen anefficientmethodforminingclosedpotentialhighutilityitemsets
AT nguyenbui anefficientmethodforminingclosedpotentialhighutilityitemsets
AT trinhddnguyen anefficientmethodforminingclosedpotentialhighutilityitemsets
AT vannamhuynh anefficientmethodforminingclosedpotentialhighutilityitemsets
AT tzungpeihong anefficientmethodforminingclosedpotentialhighutilityitemsets
AT bayvo efficientmethodforminingclosedpotentialhighutilityitemsets
AT loanttnguyen efficientmethodforminingclosedpotentialhighutilityitemsets
AT nguyenbui efficientmethodforminingclosedpotentialhighutilityitemsets
AT trinhddnguyen efficientmethodforminingclosedpotentialhighutilityitemsets
AT vannamhuynh efficientmethodforminingclosedpotentialhighutilityitemsets
AT tzungpeihong efficientmethodforminingclosedpotentialhighutilityitemsets