A novel mapreduce algorithm for distributed mining of sequential patterns using co-occurrence information

Sumalatha Saleti,R. B. V. Subramanyam

doi:10.1007/s10489-018-1259-2

Abstract

Sequential Pattern Mining (SPM) problem is much studied and extended in several directions. With the tremendous growth in the size of datasets, traditional algorithms are not scalable. In order to solve the scalability issue, recently few researchers have developed distributed algorithms based on MapReduce. However, the existing MapReduce algorithms require multiple rounds of MapReduce, which increases communication and scheduling overhead. Also, they do not address the issue of handling long sequences. They generate huge number of candidate sequences that do not appear in the input database and increases the search space. This results in more number of candidate sequences for support counting. Our algorithm is a two phase MapReduce algorithm that generates the promising candidate sequences using the pruning strategies. It also reduces the search space and thus the support computation is effective. We make use of the item co-occurrence information and the proposed Sequence Index List (SIL) data structure helps in computing the support at fast. The experimental results show that the proposed algorithm has better performance over the existing MapReduce algorithms for the SPM problem.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel mapreduce algorithm for distributed mining of sequential patterns using co-occurrence information

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence

Lead the way for us

Journal: Applied Intelligence	Publication Date: Aug 20, 2018
Citations: 11

Similar Papers

A New Approach for Problem of Sequential Pattern Mining
Thanh-Trung Nguyen ... Phi-Khu Nguyen
-
Thanh-Trung Nguyen, et. al.Thanh-Trung Nguyen ... Phi-Khu Nguyen
01 Jan 2012
01 Jan 2012

Sequential Pattern Mining: A Survey on Approaches
R Boghey ... S Singh
-
R Boghey, et. al.R Boghey ... S Singh
01 Apr 2013
01 Apr 2013

Contiguous item sequential pattern mining using UpDown Tree
Jinlin Chen
Intelligent Data Analysis | VOL. 12
Jinlin ChenJinlin Chen
18 Feb 2008
Intelligent Data Analysis | VOL. 12

Targeted mining of contiguous sequential patterns
Kaixia Hu ... Philippe Fournier-Viger
Information Sciences | VOL. 653
Kaixia Hu, et. al.Kaixia Hu ... Philippe Fournier-Viger
20 Oct 2023
Information Sciences | VOL. 653

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel mapreduce algorithm for distributed mining of sequential patterns using co-occurrence information

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence