Average-Case Performance of the Apriori Algorithm

Paul W Purdom,Dennis P Groth,Dirk Van Gucht

doi:10.1137/s0097539703422881

Abstract

The failure rate of the Apriori Algorithm is studied analytically for the case of random shoppers. The time needed by the Apriori Algorithm is determined by the number of item sets that are output (successes: item sets that occur in at least k baskets) and the number of item sets that are counted but not output (failures: item sets where all subsets of the item set occur in at least k baskets but the full set occurs in less than k baskets). The number of successes is a property of the data; no algorithm that is required to output each success can avoid doing work associated with the successes. The number of failures is a property of both the algorithm and the data.We find that under a wide range of conditions the performance of the Apriori Algorithm is almost as bad as is permitted under sophisticated worst-case analyses. In particular, there is usually a bad level with two properties: (1) it is the level where nearly all of the work is done, and (2) nearly all item sets counted are failures. Let l be the...

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Average-Case Performance of the Apriori Algorithm

Abstract

Talk to us

Similar Papers

More From: SIAM Journal on Computing

Lead the way for us

Journal: SIAM Journal on Computing	Publication Date: Jan 1, 2004
Citations: 36

Similar Papers

A false negative approach to mining frequent itemsets from high speed transactional data streams
Jeffrey Xu Yu ... Aoying Zhou
Information Sciences | VOL. 176
Jeffrey Xu Yu, et. al.Jeffrey Xu Yu ... Aoying Zhou
29 Nov 2005
Information Sciences | VOL. 176

A Refined K-Means Technique to Find the Frequent Item Sets
A Sarvani ... Nagaraju Devarakonda
-
A Sarvani, et. al.A Sarvani ... Nagaraju Devarakonda
23 Dec 2017
23 Dec 2017

A computer program for Likert format questionnaire analysis
Richard F Antonak
Behavior Research Methods & Instrumentation | VOL. 9
Richard F AntonakRichard F Antonak
01 Jan 1976
Behavior Research Methods & Instrumentation | VOL. 9

Research and improvement of Apriori algorithm
Jiaoling Du ... Xiangli Zhang
-
Jiaoling Du, et. al.Jiaoling Du ... Xiangli Zhang
01 May 2016
01 May 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Average-Case Performance of the Apriori Algorithm

Abstract

Talk to us

Similar Papers

More From: SIAM Journal on Computing