A Novel Parallel Algorithm for Frequent Itemsets Mining in Large Transactional Databases

Huan Phan,Bac Le

doi:10.1007/978-3-319-95786-9_21

Abstract

Since the era of data explosion, data mining in large transactional databases has become more and more important. There are many data mining techniques like association rule mining, the most important and well-researched one. Furthermore, frequent itemset mining is one of the fundamental but time-consuming steps in association rule mining. Most of the algorithms used in literature find frequent itemsets on search space items having at least a minsup and are not reused for subsequent mining. Therefore, in order to decrease the execution time, some parallel algorithms have been proposed for mining frequent itemsets. Nonetheless, these algorithms merely implement the parallelization of Apriori and FP-Growth algorithms. To deal with this problem, several parallel NPA-FI algorithms are proposed as a new approach in order to quickly detect frequent itemsets from large transactional databases using an array of co-occurrences and occurrences of kernel item in at least one transaction. Parallel NPA-FI algorithms are easily used in many distributed file system, namely Hadoop and Spark. Finally, the experimental results show that the proposed algorithms perform better than other existing algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel Parallel Algorithm for Frequent Itemsets Mining in Large Transactional Databases

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Novel Algorithm for Frequent Itemsets Mining in Transactional Databases
Huan Phan ... Bac Le
-
Huan Phan, et. al.Huan Phan ... Bac Le
01 Jan 2018
01 Jan 2018

A Novel Approach for Association Rule Mining using Pattern Generation
Deepa S Deshpande
International Journal of Information Technology and Computer Science | VOL. 6
Deepa S DeshpandeDeepa S Deshpande
08 Oct 2014
International Journal of Information Technology and Computer Science | VOL. 6

Mining Frequent Itemsets in Large Data Warehouses: A Novel Approach Proposed for Sparse Data Sets
S M Fakhrahmad ... M H Sadreddini
-
S M Fakhrahmad, et. al.S M Fakhrahmad ... M H Sadreddini
16 Dec 2007
16 Dec 2007

ASCF: Optimization of the Apriori Algorithm Using Spark-Based Cuckoo Filter Structure
Bana Ahmad Alrahwan ... Yu-An Tan
International Journal of Intelligent Systems | VOL. 2024
Bana Ahmad Alrahwan, et. al.Bana Ahmad Alrahwan ... Yu-An Tan
22 Jan 2024
International Journal of Intelligent Systems | VOL. 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Parallel Algorithm for Frequent Itemsets Mining in Large Transactional Databases

Abstract

Talk to us

Similar Papers