Probabilistic Static Load-Balancing of Parallel Mining of Frequent Sequences

Robert Kessl

doi:10.1109/tkde.2016.2515622

Abstract

Frequent sequence mining is well known and well studied problem in datamining. The output of the algorithm is used in many other areas like bioinformatics, chemistry, and market basket analysis. Unfortunately, the frequent sequence mining is computationally quite expensive. In this paper, we present a novel parallel algorithm for mining of frequent sequences based on a static load-balancing. The static load-balancing is done by measuring the computational time using a probabilistic algorithm. For reasonable size of instance, the algorithms achieve speedups up to $\approx 3/4\cdot P$ where $P$ is the number of processors. In the experimental evaluation, we show that our method performs significantly better then the current state-of-the-art methods. The presented approach is very universal: it can be used for static load-balancing of other pattern mining algorithms such as itemset/tree/graph mining algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Probabilistic Static Load-Balancing of Parallel Mining of Frequent Sequences

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: May 1, 2016
Citations: 24

Similar Papers

Efficient mining of intra-periodic frequent sequences
Edith Belise Kenmogne ... Clémentin Tayou Djamegni
Array | VOL. 16
Edith Belise Kenmogne, et. al.Edith Belise Kenmogne ... Clémentin Tayou Djamegni
01 Dec 2022
Array | VOL. 16

Probabilistic algorithm for mining frequent sequences
Julija Pragarauskaitė ... Gintautas Dzemyda
Lietuvos matematikos rinkinys | VOL. 51
Julija Pragarauskaitė, et. al.Julija Pragarauskaitė ... Gintautas Dzemyda
21 Dec 2010
Lietuvos matematikos rinkinys | VOL. 51

Differentially Private Frequent Sequence Mining.
Shengzhi Xu ... Xiang Cheng
IEEE Transactions on Knowledge and Data Engineering | VOL. 28
Shengzhi Xu, et. al.Shengzhi Xu ... Xiang Cheng
01 Nov 2016
IEEE Transactions on Knowledge and Data Engineering | VOL. 28

Mining frequent biological sequences based on bitmap without candidate sequence generation
Qian Wang ... Jiadong Ren
Computers in Biology and Medicine | VOL. 69
Qian Wang, et. al.Qian Wang ... Jiadong Ren
30 Dec 2015
Computers in Biology and Medicine | VOL. 69

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Probabilistic Static Load-Balancing of Parallel Mining of Frequent Sequences

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering