An Efficient Approach of Extracting Frequent Itemsets from Large Data Using HDFS Framework

Prajakta G Kulkarni,S R Khonde

doi:10.15866/irecap.v7i6.13354

Abstract

Frequent itemsets extraction is very important in various data mining applications. It attempts to extract interesting patterns from given databases like association rules, correlations and clusters. It is difficult to calculate the frequent itemsets having a good speed from the available database. There are various algorithms to find out frequent itemsets like Apriori, FP growth algorithms, etc. Unfortunately, these algorithms fail in extracting interesting items, when it comes across excessive data. In the distributing environment, there is not only a need to automatically parallelize, but also to balance workloads well, which is also not possible with these algorithms. To defeat these disadvantages, there is a need to implement an algorithm supporting the missing elements, like automatic parallelization and workload balancing. This paper proposes a new algorithm for the extraction of frequent itemsets using Hadoop and MapReduce paradigms. The proposed algorithm is based on Modified Apriori algorithm, named as Frequent Itemset Mining using Modified Apriori (FIMMA). In this method, mappers will work independently and concurrently using the hashing technique for large databases; databases will distribute the number of mappers and the result will be given to the reducers. The reducers will give the final result showing the most frequent itemsets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Efficient Approach of Extracting Frequent Itemsets from Large Data Using HDFS Framework

Abstract

Talk to us

Similar Papers

More From: International Journal on Communications Antenna and Propagation (IRECAP)

Lead the way for us

Similar Papers

Indexed Enhancement on GenMax Algorithm for Fast and Less Memory Utilized Pruning of MFI and CFI
C Chandrasekar ... C Sathya
International Journal of Computer Applications | VOL. 41
C Chandrasekar, et. al.C Chandrasekar ... C Sathya
31 Mar 2012
International Journal of Computer Applications | VOL. 41

A Novel Approach of Frequent Itemset Mining Using HDFS Framework
Prajakta G Kulkarni ... S R Khonde
-
Prajakta G Kulkarni, et. al.Prajakta G Kulkarni ... S R Khonde
01 Jan 2018
01 Jan 2018

Mining Temporal Sequence Patterns Using Association Rule Mining Algorithms for Prediction of Human Activity from Surveillance Videos
D Manju ... V Radha
-
D Manju, et. al.D Manju ... V Radha
24 Jul 2020
24 Jul 2020

Mining Frequent Item and Item Sets Using Fuzzy Slices
Ms Poonam A Manjare ... Mrs R.R Shelke
international journal of engineering trends and technology | VOL. -
Ms Poonam A Manjare, et. al.Ms Poonam A Manjare ... Mrs R.R Shelke
25 Mar 2014
international journal of engineering trends and technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Efficient Approach of Extracting Frequent Itemsets from Large Data Using HDFS Framework

Abstract

Talk to us

Similar Papers

More From: International Journal on Communications Antenna and Propagation (IRECAP)