Frequent Itemset Mining in Big Data With Effective Single Scan Algorithms

Youcef Djenouri,Jerry Chun-Wei Lin,Djamel Djenouri,Asma Belhadi

doi:10.1109/access.2018.2880275

Abstract

This paper considers frequent itemsets mining in transactional databases. It introduces a new accurate single scan approach for frequent itemset mining (SSFIM), a heuristic as an alternative approach (EA-SSFIM), as well as a parallel implementation on Hadoop clusters (MR-SSFIM). EA-SSFIM and MR-SSFIM target sparse and big databases, respectively. The proposed approach (in all its variants) requires only one scan to extract the candidate itemsets, and it has the advantage to generate a fixed number of candidate itemsets independently from the value of the minimum support. This accelerates the scan process compared with existing approaches while dealing with sparse and big databases. Numerical results show that SSFIM outperforms the state-of-the-art FIM approaches while dealing with medium and large databases. Moreover, EA-SSFIM provides similar performance as SSFIM while considerably reducing the runtime for large databases. The results also reveal the superiority of MR-SSFIM compared with the existing HPC-based solutions for FIM using sparse and big databases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2018
Citations: 63	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Frequent Itemset Mining in Big Data With Effective Single Scan Algorithms

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Single Scan Polynomial Algorithms for Frequent Itemset Mining in Big Databases
Youcef Djenouri ... Asma Belhadi
-
Youcef Djenouri, et. al.Youcef Djenouri ... Asma Belhadi
01 Jun 2019
01 Jun 2019

Big Data Analysis and Mining
Carson K.-S Leung
-
Carson K.-S LeungCarson K.-S Leung
01 Jan 2018
01 Jan 2018

Big Data Analysis and Mining
Carson K.-S Leung
-
Carson K.-S LeungCarson K.-S Leung
01 Jan 2019
01 Jan 2019

Exploiting GPU and cluster parallelism in single scan frequent itemset mining
Youcef Djenouri ... Alberto Cano
Information Sciences | VOL. 496
Youcef Djenouri, et. al.Youcef Djenouri ... Alberto Cano
20 Jul 2018
Information Sciences | VOL. 496

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Frequent Itemset Mining in Big Data With Effective Single Scan Algorithms

Abstract

Talk to us

Similar Papers

More From: IEEE Access