Efficient Mining of Outlying Sequence Patterns for Analyzing Outlierness of Sequence Data

Tingting Wang,Lei Duan,Zhifeng Bao,Guozhu Dong

doi:10.1145/3399671

Abstract

Recently, a lot of research work has been proposed in different domains to detect outliers and analyze the outlierness of outliers for relational data. However, while sequence data is ubiquitous in real life, analyzing the outlierness for sequence data has not received enough attention. In this article, we study the problem of mining outlying sequence patterns in sequence data addressing the question: given a query sequence s in a sequence dataset D , the objective is to discover sequence patterns that will indicate the most unusualness (i.e., outlierness) of s compared against other sequences. Technically, we use the rank defined by the average probabilistic strength ( aps ) of a sequence pattern in a sequence to measure the outlierness of the sequence. Then a minimal sequence pattern where the query sequence is ranked the highest is defined as an outlying sequence pattern. To address the above problem, we present OSPMiner, a heuristic method that computes aps by incorporating several pruning techniques. Our empirical study using both real and synthetic data demonstrates that OSPMiner is effective and efficient.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Mining of Outlying Sequence Patterns for Analyzing Outlierness of Sequence Data

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Knowledge Discovery from Data

Lead the way for us

Journal: ACM Transactions on Knowledge Discovery from Data	Publication Date: Aug 5, 2020
Citations: 20

Similar Papers

The evaluation of occupational accident with sequential pattern mining
Nazli Gulum Mutlu ... Turkay Dereli
Safety Science | VOL. 166
Nazli Gulum Mutlu, et. al.Nazli Gulum Mutlu ... Turkay Dereli
07 Jun 2023
Safety Science | VOL. 166

Mining Compressed Repetitive Gapped Sequential Patterns Efficiently
Yongxin Tong ... Dan Yu
-
Yongxin Tong, et. al.Yongxin Tong ... Dan Yu
01 Jan 2009
01 Jan 2009

Discovering Sequential Source Code Patterns in Software Engineering
Kökten Bi̇rant ... Dilara Kirnapci
Düzce Üniversitesi Bilim ve Teknoloji Dergisi | VOL. 10
Kökten Bi̇rant, et. al.Kökten Bi̇rant ... Dilara Kirnapci
31 Jan 2022
Düzce Üniversitesi Bilim ve Teknoloji Dergisi | VOL. 10

On mining multi-time-interval sequential patterns
Ya-Han Hu ... Yen-Liang Chen
Data & Knowledge Engineering | VOL. 68
Ya-Han Hu, et. al.Ya-Han Hu ... Yen-Liang Chen
23 May 2009
Data & Knowledge Engineering | VOL. 68

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Mining of Outlying Sequence Patterns for Analyzing Outlierness of Sequence Data

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Knowledge Discovery from Data