Incremental sequence-based frequent query pattern mining from XML queries

Guoliang Li,Jianyong Wang,Jianhua Feng,Lizhu Zhou

doi:10.1007/s10618-009-0126-5

Abstract

Existing algorithms of mining frequent XML query patterns (XQPs) employ a candidate generate-and-test strategy. They involve expensive candidate enumeration and costly tree-containment checking. Further, most of existing methods compute the frequencies of candidate query patterns from scratch periodically by checking the entire transaction database, which consists of XQPs transferred from user query logs. However, it is not straightforward to maintain such discovered frequent patterns in real XML databases as there may be frequent updates that may not only invalidate some existing frequent query patterns but also generate some new frequent query patterns. Therefore, a drawback of existing methods is that they are rather inefficient for the evolution of transaction databases. To address above-mentioned problems, this paper proposes an efficient algorithm ESPRIT to mine frequent XQPs without costly tree-containment checking. ESPRIT transforms XML queries into sequences using a one-to-one mapping technique and mines the frequent sequences to generate frequent XQPs. We propose two efficient incremental algorithms, ESPRIT-i and ESPRIT-i +, to incrementally mine frequent XQPs. We devise several novel optimization techniques of query rewriting, cache lookup, and cache replacement to improve the answerability and the hit rate of caching. We have implemented our algorithms and conducted a set of experimental studies on various datasets. The experimental results demonstrate that our algorithms achieve high efficiency and scalability and outperform state-of-the-art methods significantly.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Incremental sequence-based frequent query pattern mining from XML queries

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery

Lead the way for us

Journal: Data Mining and Knowledge Discovery	Publication Date: Feb 12, 2009
Citations: 38

Similar Papers

Incremental Mining of Frequent Query Patterns from XML Queries for Caching
Guoliang Li ... Yong Zhang
-
Guoliang Li, et. al.Guoliang Li ... Yong Zhang
01 Dec 2006
01 Dec 2006

BUXMiner: An Efficient Bottom-Up Approach to Mining XML Query Patterns
Yijun Bei ... Jinxiang Dong
-
Yijun Bei, et. al.Yijun Bei ... Jinxiang Dong
16 Jun 2007
16 Jun 2007

Discovery of Frequent Query Patterns in XML Pattern Graph with DTD Cardinality Constraints
Yunfeng Liu ... Tengjiao Wang
-
Yunfeng Liu, et. al.Yunfeng Liu ... Tengjiao Wang
01 Jan 2008
01 Jan 2008

A Parallel Encoding Method of XML User Query Patterns
Tsui-Ping Chang
-
Tsui-Ping ChangTsui-Ping Chang
01 Jul 2016
01 Jul 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Incremental sequence-based frequent query pattern mining from XML queries

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery