Multi-level dataset decomposition for parallel frequent itemset mining on a cluster of personal computers

Chun-Hong Huang,Yungho Leu

doi:10.1007/s10586-017-1609-6

Abstract

Frequent Itemset mining is time consuming for large datasets. Many parallel frequent itemset mining algorithms have been proposed to speed up the mining process. This paper presents a parallel frequent itemset mining algorithm on a cluster of personal computers. To facilitate parallel frequent itemset mining, we use prefix path based method to decompose a transactional dataset into its frequent 1-itemset sub-datasets. We called the parallel frequent itemset mining algorithm based on the frequent 1-itemset sub-dataset decomposition the single-level parallel frequent itemset mining algorithm (SLPFIM) in our PC cluster platform. To mitigate the bottleneck caused by time-consuming 1-itemset sub-datasets, we propose a multi-level parallel frequent itemset mining (MLPFIM) algorithm to further decompose the time-consuming 1-itemset sub-datasets into their corresponding sub-sub-datasets. The fine granule of the sub-sub-datasets enhances the load balancing in parallel frequent itemset mining. The experimental results showed that the SLPFIM offered a maximum of 11.9x speedup over the non-parallel execution of the FP-Growth algorithm while the MLPFIM achieved a maximum of 23.1x speedup over the non-parallel execution of the FP-Growth algorithm. The experimental results also showed that the MLPFIM offered a maximum of 2.14x speedup over the SLPFIM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-level dataset decomposition for parallel frequent itemset mining on a cluster of personal computers

Abstract

Talk to us

Similar Papers

More From: Cluster Computing

Lead the way for us

Journal: Cluster Computing	Publication Date: Jan 3, 2018
Citations: 3

Similar Papers

Accelerating Parallel Frequent Itemset Mining on Graphics Processors with Sorting
Yuan-Shao Huang ... Li-Wei Zhou
-
Yuan-Shao Huang, et. al.Yuan-Shao Huang ... Li-Wei Zhou
01 Jan 2013
01 Jan 2013

D2P-Apriori: A deep parallel frequent itemset mining algorithm with dynamic queue
Yuxin Wang ... Tongkun Xu
-
Yuxin Wang, et. al.Yuxin Wang ... Tongkun Xu
01 Mar 2018
01 Mar 2018

FiDoop-DP: Data Partitioning in Frequent Itemset Mining on Hadoop Clusters
Yaling Xun ... Xujun Zhao
IEEE Transactions on Parallel and Distributed Systems | VOL. 28
Yaling Xun, et. al.Yaling Xun ... Xujun Zhao
01 Jan 2017
IEEE Transactions on Parallel and Distributed Systems | VOL. 28

PFIMD: a parallel MapReduce-based algorithm for frequent itemset mining
Mao Yimin ... Deborah Simon Mwakapesa
Multimedia Systems | VOL. 27
Mao Yimin, et. al.Mao Yimin ... Deborah Simon Mwakapesa
13 Mar 2021
Multimedia Systems | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-level dataset decomposition for parallel frequent itemset mining on a cluster of personal computers

Abstract

Talk to us

Similar Papers

More From: Cluster Computing