Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases

Zhou Zhao Zhou Zhao,Da Yan Da Yan,Wilfred Ng

doi:10.1109/tkde.2013.124

Abstract

Data uncertainty is inherent in many real-world applications such as environmental surveillance and mobile tracking. Mining sequential patterns from inaccurate data, such as those data arising from sensor readings and GPS trajectories, is important for discovering hidden knowledge in such applications. In this paper, we propose to measure pattern frequentness based on the possible world semantics. We establish two uncertain sequence data models abstracted from many real-life applications involving uncertain sequence data, and formulate the problem of mining probabilistically frequent sequential patterns (or p-FSPs) from data that conform to our models. However, the number of possible worlds is extremely large, which makes the mining prohibitively expensive. Inspired by the famous PrefixSpan algorithm, we develop two new algorithms, collectively called U-PrefixSpan, for p-FSP mining. U-PrefixSpan effectively avoids the problem of “possible worlds explosion”, and when combined with our four pruning and validating methods, achieves even better performance. We also propose a fast validating method to further speed up our U-PrefixSpan algorithm. The efficiency and effectiveness of U-PrefixSpan are verified through extensive experiments on both real and synthetic datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: May 1, 2014
Citations: 46

Similar Papers

Mining probabilistically frequent sequential patterns in uncertain databases
Zhou Zhao ... Da Yan
-
Zhou Zhao, et. al.Zhou Zhao ... Da Yan
27 Mar 2012
27 Mar 2012

Model-based probabilistic frequent itemset mining
Thomas Bernecker ... Matthias Renz
Knowledge and Information Systems | VOL. 37
Thomas Bernecker, et. al.Thomas Bernecker ... Matthias Renz
21 Oct 2012
Knowledge and Information Systems | VOL. 37

Mining uncertain data with probabilistic guarantees
Liwen Sun ... Reynold Cheng
-
Liwen Sun, et. al.Liwen Sun ... Reynold Cheng
25 Jul 2010
25 Jul 2010

Accelerating probabilistic frequent itemset mining
Liang Wang ... Sau Dan Lee
-
Liang Wang, et. al.Liang Wang ... Sau Dan Lee
26 Oct 2010
26 Oct 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering