FP-Growth Algorithm Research Articles

The typical hypothesis testing issue in statistical analysis is determining whether a pattern is significantly associated with a specific class label. This usually leads to highly challenging multiple-hypothesis testing problems in big data mining scenarios, as millions or billions of hypothesis tests in large-scale exploratory data analysis can result in a large number of false positive results. The permutation testing-based FWER control method (PFWER) is theoretically effective in dealing with multiple hypothesis testing issues. In reality, however, this theoretical approach confronts a serious computational efficiency problem. It takes an extremely long time to compute an appropriate FWER false positive control threshold using PFWER, which is almost impossible to achieve in a reasonable amount of time using human effort on medium- or large-scale data. Although some methods for improving the efficiency of the FWER false positive control threshold calculation have been proposed, most of them are stand-alone, and there is still a lot of space for efficiency improvement. To address this problem, this paper proposes a distributed PFWER false-positive threshold calculation method for large-scale data. The computational effectiveness increases significantly when compared to the current approaches. The FP-growth algorithm is used first for pattern mining, and the mining process reduces the computation of invalid patterns by using pruning operations and index optimization for merging patterns with index transactions. The distributed computing technique is introduced on this basis, and the constructed FP tree is decomposed into a set of subtrees, each corresponding to a subtask. All subtrees (subtasks) are distributed to different computing nodes. Each node independently calculates the local significance threshold according to the designated subtasks. Finally, all local results are aggregated to compute the FWER false positive control threshold, which is completely consistent with the theoretical result. A series of experimental findings on 11 real-world datasets demonstrate that the distributed algorithm proposed in this paper can significantly improve the computation efficiency of PFWER while ensuring its theoretical accuracy.

Read full abstract

Existing transaction data is only recorded and stored as a sales transaction memorandum, so it has not been utilized optimally. The data is only stored and used as transaction history. The availability of a lot of data and having a pattern of sales transactions that are similar to MSME Cafe Over Limit will be utilized by using data mining science. This research uses the association rules method. Implementation of fp-growth to get item combinations. The purpose of this research is to make it easier for MSMEs to determine menu recommendations for customers. The fp-growth algorithm is used to process as many as 2038 transaction data with a minimum support value of 10%, while for a minimum confidence value of 50%. So that there are 3 rules, namely "if you order Mariam chocolate cheese milk then the customer will order Kopsus Overlimit", from this rule it will form a support value of 10.79%, using a confidence value of 54.19% and a lift ratio of 0.93. Furthermore "if you order Kopsus Overlimit then you will order tofu at grandma's house", from the rule it will produce a support value of 34.69%, with a specified confidence value of 59.76%, so the lift ratio value is 1.15. The last rule "if you order tofu at grandma's house, the customer orders Kopsus Overlimit", from the rule that occurs, the support value is 34.69%, with a confidence value of 66.7% and a lift ratio of 1.15. The results of the study found the two best rules, namely "if the customer orders over-limit Kopsus, he will order tofu at grandma's house" and "if he orders tofu at grandma's house, the customer orders over-limit Kopsus". Based on the results of the rules formed, it can be concluded that only two rules can be categorized as valid and can be used as a reference in food and beverage menu recommendations at MSME Cafe Over Limit. So the results of this study can be useful to be applied to MSMEs, especially in terms of menu recommendations.

Read full abstract

FP-Growth Algorithm Research Articles

Related Topics

Articles published on FP-Growth Algorithm

A statistical method for predicting quantitative variables in association rule mining

Implementasi Data Mining Menggunakan Algoritma Fp-Growth Untuk Menganalisa Transaksi Penjualan Ekspor Online

Application of Market Basket Analysis on Beauty Clinic to Increasing Customer’s Buying Decision

Penerapan Algoritma FP-Growth untuk Menentukan Strategi Promosi Berdasarkan Waktu dan Pembelian Produk

Analisis Pola Asosiasi Data Transaksi Penjualan Minuman Menggunakan Algoritma FP-Growth dan Eclat

Market Basket Analysis dengan Perbandingan Metode Apriori dan FP-Growth Pada Data Transaksi XYZ

Efficient Algorithms for Patterns Identification in Medical Data

Enhancing Sales Determination for Coffee Shop Packages through Associated Data Mining: Leveraging the FP-Growth Algorithm

Perancangan Tata Letak Toko Ritel Berdasarkan Pola Belanja Konsumen Dengan Market Basket Analysis (Studi Kasus: Indomaret Sukatani)

MBA: Market Basket Analysis Using Frequent Pattern Mining Techniques

Implementasi Data Mining Pada Penjualan Pakaian dengan Algoritma FP-Growth

Efficient False Positive Control Algorithms in Big Data Mining

Utilization of Data Mining on MSMEs using FP-Growth Algorithm for Menu Recommendations

Machine Learning Prediction Models applied to Weather Forecasting: A survey

Use of Data Mining for The Analysis of Consumer Purchase Patterns with The Fpgrowth Algorithm on Motor Spare Part Sales Transactions Data

Implementasi Data Mining FP-Growth Untuk Analisis Pola Pembelian Pada Transaksi Penjualan

Association rules and prediction of transportation mode choice: Application to national travel survey data

Frequent Itemset Mining Algorithm Based on Linear Table

Android malware detection based on sensitive patterns

Market Basket Analysis Using FP-Growth Algorithm to Design Marketing Strategy by Determining Consumer Purchasing Patterns

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

FP-Growth Algorithm Research Articles

Related Topics

Articles published on FP-Growth Algorithm

A statistical method for predicting quantitative variables in association rule mining

Implementasi Data Mining Menggunakan Algoritma Fp-Growth Untuk Menganalisa Transaksi Penjualan Ekspor Online

Application of Market Basket Analysis on Beauty Clinic to Increasing Customer’s Buying Decision

Penerapan Algoritma FP-Growth untuk Menentukan Strategi Promosi Berdasarkan Waktu dan Pembelian Produk

Analisis Pola Asosiasi Data Transaksi Penjualan Minuman Menggunakan Algoritma FP-Growth dan Eclat

Market Basket Analysis dengan Perbandingan Metode Apriori dan FP-Growth Pada Data Transaksi XYZ

Efficient Algorithms for Patterns Identification in Medical Data

Enhancing Sales Determination for Coffee Shop Packages through Associated Data Mining: Leveraging the FP-Growth Algorithm

Perancangan Tata Letak Toko Ritel Berdasarkan Pola Belanja Konsumen Dengan Market Basket Analysis (Studi Kasus: Indomaret Sukatani)

MBA: Market Basket Analysis Using Frequent Pattern Mining Techniques

Implementasi Data Mining Pada Penjualan Pakaian dengan Algoritma FP-Growth

Efficient False Positive Control Algorithms in Big Data Mining

Utilization of Data Mining on MSMEs using FP-Growth Algorithm for Menu Recommendations

Machine Learning Prediction Models applied to Weather Forecasting: A survey

Use of Data Mining for The Analysis of Consumer Purchase Patterns with The Fpgrowth Algorithm on Motor Spare Part Sales Transactions Data

Implementasi Data Mining FP-Growth Untuk Analisis Pola Pembelian Pada Transaksi Penjualan

Association rules and prediction of transportation mode choice: Application to national travel survey data

Frequent Itemset Mining Algorithm Based on Linear Table

Android malware detection based on sensitive patterns

Market Basket Analysis Using FP-Growth Algorithm to Design Marketing Strategy by Determining Consumer Purchasing Patterns