Distributed mining of high utility time interval sequential patterns using mapreduce approach

Saleti Sumalatha,R.B.V Subramanyam

doi:10.1016/j.eswa.2019.112967

Abstract

High Utility Sequential Pattern mining (HUSP) algorithms aim to find all the high utility sequences from a sequence database. Due to the large explosion of data, recently few distributed algorithms have been designed for mining HUSPs based on the MapReduce framework. However, the existing HUSP algorithms such as USpan, HUS-Span and BigHUSP are able to predict only the order of items, they do not predict the time between the items, that is, they do not include the time intervals between the successive items. But in a real-world scenario, time interval patterns provide more valuable information than conventional high utility sequential patterns. Therefore, we propose a distributed high utility time interval sequential pattern mining (DHUTISP) algorithm using the MapReduce approach that is suitable for big data. DHUTISP creates a novel time interval utility linked list data structure (TIUL) to efficiently calculate the utility of the resulting patterns. Moreover, two utility upper bounds, namely, remaining utility upper bound (RUUB) and co-occurrence utility upper bound (CUUB) are proposed to prune the unpromising candidates. We conducted various experiments to prove the efficiency of the proposed algorithm over both the distributed and non-distributed approaches. The experimental results show the efficiency of DHUTISP over state-of-the-art algorithms, namely, BigHUSP, AHUS-P, PUSOM and UTMining_A.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Distributed mining of high utility time interval sequential patterns using mapreduce approach

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Sep 19, 2019
Citations: 25

Similar Papers

Mining actionable combined high utility incremental and associated sequential patterns.
Min Shi ... Yongshun Gong
PloS one | VOL. 18
Min Shi, et. al.Min Shi ... Yongshun Gong
29 Mar 2023
PloS one | VOL. 18

A pure array structure and parallel strategy for high-utility sequential pattern mining
Bac Le ... Duy-Tai Dinh
Expert Systems with Applications | VOL. 104
Bac Le, et. al.Bac Le ... Duy-Tai Dinh
12 Mar 2018
Expert Systems with Applications | VOL. 104

Memory-adaptive high utility sequential pattern mining over data streams
Morteza Zihayat ... Yan Chen
Machine Learning | VOL. 106
Morteza Zihayat, et. al.Morteza Zihayat ... Yan Chen
02 Feb 2017
Machine Learning | VOL. 106

Efficiently Mining Top-K High Utility Sequential Patterns
Junfu Yin ... Yin Song
-
Junfu Yin, et. al.Junfu Yin ... Yin Song
01 Dec 2013
01 Dec 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Distributed mining of high utility time interval sequential patterns using mapreduce approach

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications