An Improved FP-Growth Algorithm Based on SOM Partition

Kuikui Jia,Haibin Liu

doi:10.1007/978-981-10-6385-5_15

Abstract

FP-growth algorithm is an algorithm for mining association rules without generating candidate sets. It has high practical value in many fields. However, it is a memory resident algorithm, and can only handle small data sets. It seems powerless when dealing with massive data sets. This paper improves the FP-growth algorithm. The core idea of the improved algorithm is to partition massive data set into small data sets, which would be dealt with separately. Firstly, systematic sampling methods are used to extract representative samples from large data sets, and these samples are used to make SOM (Self-organizing Map) cluster analysis. Then, the large data set is partitioned into several subsets according to the cluster results. Lastly, FP-growth algorithm is executed in each subset, and association rules are mined. The experimental result shows that the improved algorithm reduces the memory consumption, and shortens the time of data mining. The processing capacity and efficiency of massive data is enhanced by the improved algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Improved FP-Growth Algorithm Based on SOM Partition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Research of Data Mining Algorithm Based on the Intrusion Prevention System

Applied Mechanics and Materials | VOL. 644-650

01 Sep 2014
Applied Mechanics and Materials | VOL. 644-650

Dynamic Parallel Mining Algorithm of Association Rules Based on Interval Concept Lattice
Yafeng Yang ... Baoxiang Liu
Mathematics | VOL. 7
Yafeng Yang, et. al.Yafeng Yang ... Baoxiang Liu
19 Jul 2019
Mathematics | VOL. 7

Neutrosophic Fuzzy Association Rule Generation-Based Big Data Mining Analysis Algorithm
Qunfeng Wei ... Bin Qi
International Transactions on Electrical Energy Systems | VOL. 2022
Qunfeng Wei, et. al.Qunfeng Wei ... Bin Qi
25 Sep 2022
International Transactions on Electrical Energy Systems | VOL. 2022

A Parallel FP-Growth Mining Algorithm with Load Balancing Constraints for Traffic Crash Data
Yang Yang ... Zhenzhou Yuan
INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL | VOL. 17
Yang Yang, et. al.Yang Yang ... Zhenzhou Yuan
20 Jul 2022
INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Improved FP-Growth Algorithm Based on SOM Partition

Abstract

Talk to us

Similar Papers