Communication-efficient distributed mining of association rules

Assaf Schuster,Ran Wolff

doi:10.1145/376284.375728

Abstract

Mining for associations between items in large transactional databases is a central problem in the field of knowledge discovery. When the database is partitioned among several share-nothing machines, the problem can be addressed using distributed data mining algorithms. One such algorithm, called CD, was proposed by Agrawal and Shafer in [1] and was later enhanced by the FDM algorithm of Cheung, Han et al. [5]. The main problem with these algorithms is that they do not scale well with the number of partitions. They are thus impractical for use in modern distributed environments such as peer-to-peer systems, in which hundreds or thousands of computers may interact. In this paper we present a set of new algorithms that solve the Distributed Association Rule Mining problem using far less communication. In addition to being very efficient, the new algorithms are also extremely robust. Unlike existing algorithms, they continue to be efficient even when the data is skewed or the partition sizes are imbalanced. We present both experimental and theoretical results concerning the behavior of these algorithms and explain how they can be implemented in different settings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Communication-efficient distributed mining of association rules

Abstract

Talk to us

Similar Papers

More From: ACM SIGMOD Record

Lead the way for us

Journal: ACM SIGMOD Record	Publication Date: May 1, 2001
Citations: 6

Similar Papers

Communication-Efficient Distributed Mining of Association Rules
Assaf Schuster ... Ran Wolff
Data Mining and Knowledge Discovery | VOL. 8
Assaf Schuster, et. al.Assaf Schuster ... Ran Wolff
01 Mar 2004
Data Mining and Knowledge Discovery | VOL. 8

Communication-efficient distributed mining of association rules
Assaf Schuster ... Ran Wolff
-
Assaf Schuster, et. al.Assaf Schuster ... Ran Wolff
01 May 2001
01 May 2001

A New Dynamic Distributed Algorithm for Frequent Itemsets Mining
Azam Adelpoor ... Mohammad Saniee Abadeh
International Journal of Computer Applications | VOL. 67
Azam Adelpoor, et. al.Azam Adelpoor ... Mohammad Saniee Abadeh
18 Apr 2013
International Journal of Computer Applications | VOL. 67

An efficient approach for mining positive and negative association rules from large transactional databases
Peddi Kishor ... Sammulal Porika
-
Peddi Kishor, et. al.Peddi Kishor ... Sammulal Porika
01 Aug 2016
01 Aug 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Communication-efficient distributed mining of association rules

Abstract

Talk to us

Similar Papers

More From: ACM SIGMOD Record