Efficient algorithms for mining frequent high utility sequences with constraints

Tin Truong,Hai Duong,Bac Le,Philippe Fournier-Viger,Unil Yun,Hamido Fujita

doi:10.1016/j.ins.2021.01.060

Abstract

An important data mining task is to discover all high utility sequences in a quantitative sequence database. Although useful, the number of discovered sequences is often very large. To find patterns that are more tailored to a user’s needs, this paper studies the problem of mining frequent high utility sequences satisfying item constraints. This article proposes a novel algorithm named C-FHUSM to quickly obtain these sequences from two concise representations discovered from a quantitative sequence database, namely frequent generator high utility sequences and frequent closed high utility sequences. The first set is extracted using a novel algorithm named FGenHUSM, while an existing algorithm is applied to extract the second set. C-FHUSM integrates novel pruning techniques to ignore sequences that do not satisfy item constraints early by checking only a small number of representative sequences at the beginning of the mining process. Experimental results show that C-FHUSM can be more than ten times faster and has better scalability than a modified version of the state-of-the-art EHUSM algorithm for mining sequences with item constraints. Moreover, it is found that using C-FHUSM is beneficial when a user frequently changes constraints as results can be updated without rescanning the database.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient algorithms for mining frequent high utility sequences with constraints

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Feb 2, 2021
Citations: 21

Similar Papers

FMaxCloHUSM: An efficient algorithm for mining frequent closed and maximal high utility sequences
Tin Truong ... Philippe Fournier-Viger
Engineering Applications of Artificial Intelligence | VOL. 85
Tin Truong, et. al.Tin Truong ... Philippe Fournier-Viger
11 Jun 2019
Engineering Applications of Artificial Intelligence | VOL. 85

EHAUSM: An efficient algorithm for high average utility sequence mining
Tin Truong ... Philippe Fournier-Viger
Information Sciences | VOL. 515
Tin Truong, et. al.Tin Truong ... Philippe Fournier-Viger
19 Dec 2019
Information Sciences | VOL. 515

A Survey of High Utility Sequential Pattern Mining
Tin Truong-Chi ... Philippe Fournier-Viger
-
Tin Truong-Chi, et. al.Tin Truong-Chi ... Philippe Fournier-Viger
01 Jan 2019
01 Jan 2019

Fast generation of sequential patterns with item constraints from concise representations
Hai Duong ... Tin Truong
Knowledge and Information Systems | VOL. 62
Hai Duong, et. al.Hai Duong ... Tin Truong
08 Nov 2019
Knowledge and Information Systems | VOL. 62

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient algorithms for mining frequent high utility sequences with constraints

Abstract

Talk to us

Similar Papers

More From: Information Sciences