Discovering Significant Sequential Patterns in Data Stream by an Efficient Two-Phase Procedure

Huijun Tang,Le Wang,Yangguang Liu,Jiangbo Qian

doi:10.1155/2022/5379086

Huijun Tang, Le Wang + Show 2 more

Open Access

https://doi.org/10.1155/2022/5379086

Copy DOI

Journal: Mathematical Problems in Engineering	Publication Date: Dec 13, 2022
Citations: 1	License type: CC BY 4.0

Affiliation: Ningbo University

Abstract

One essential topic of mining sequential patterns in the data stream is to optimize the time-space computations. However, more importantly, it should pay more attention to the significance of mining results as a large portion of them just response to the user-defined constraints purely by accident and they may have no statistical significance. In this paper, we propose FSSPDS, an efficient two-phase algorithm to discover the significant sequential patterns (SSPs) in the data stream with typical sliding windows, which has never been considered in existing problems. First, for generating SSPs candidates with high-quality, FSSPDS takes testable support and pattern length constraints into account and insignificant patterns were removed timely by a pattern-growth method. In the second phase, appropriate permutation testing is used to test the significance of the SSPs candidates. Exact permutation p values are obtained in a novel combination way based on unconditional Barnard’s test statistic which better reflects the process of data generations and collections. Experimental evaluations show that FSSPDS allows the discovery of SSPs in the data stream and rivals the state-of-the-art approaches efficiently under the control of family-wise error rate (FWER), especially for time efficiency, which was approximately an order of magnitude higher.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discovering Significant Sequential Patterns in Data Stream by an Efficient Two-Phase Procedure

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering

Lead the way for us

Similar Papers

A Variable Sliding Window Algorithm Based on Concept Drift for Frequent Pattern Mining Over Data Streams*
Yue Yin ... Peng Li
-
Yue Yin, et. al.Yue Yin ... Peng Li
01 Jan 2023
01 Jan 2023

Mining sequential patterns from data streams: a centroid approach
Alice Marascu ... Florent Masseglia
Journal of Intelligent Information Systems | VOL. 27
Alice Marascu, et. al.Alice Marascu ... Florent Masseglia
01 Nov 2006
Journal of Intelligent Information Systems | VOL. 27

SPAMS: A Novel Incremental Approach for Sequential Pattern Mining in Data Streams
Lionel Vinceslas ... Pascal Poncelet
-
Lionel Vinceslas, et. al.Lionel Vinceslas ... Pascal Poncelet
24 Nov 2009
24 Nov 2009

Mining Closed Regular Patterns in Data Streams
Sreedevi M ... Reddy L.S.S
International Journal of Computer Science and Information Technology | VOL. 5
Sreedevi M, et. al.Sreedevi M ... Reddy L.S.S
28 Feb 2013
International Journal of Computer Science and Information Technology | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discovering Significant Sequential Patterns in Data Stream by an Efficient Two-Phase Procedure

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering