Indexing useful structural patterns for XML query processing

Wang Lian Wang Lian,D.W Cheung,S.M Yiu,N Mamoulis

doi:10.1109/tkde.2005.110

Wang Lian Wang Lian, D.W Cheung + Show 2 more

Open Access

https://doi.org/10.1109/tkde.2005.110

Copy DOI

Abstract

Queries on semistructured data are hard to process due to the complex nature of the data and call for specialized techniques. Existing path-based indexes and query processing algorithms are not efficient for searching complex structures beyond simple paths, even when the queries are high-selective. We introduce the definition of minimal infrequent structures (MIS), which are structures that 1) exist in the data, 2) are not frequent with respect to a support threshold, and 3) all substructures of them are frequent. By indexing the occurrences of MIS, we can efficiently locate the high-selective substructures of a query, improving search performance significantly. An efficient data mining algorithm is proposed, which finds the minimal infrequent structures. Their occurrences in the XML data are then indexed by a lightweight data structure and used as a fast filter step in query evaluation. We validate the efficiency and applicability of our methods through experimentation on both synthetic and real data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Jul 1, 2005
Citations: 36	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Indexing useful structural patterns for XML query processing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Similar Papers

Data Mining on XML Data
Qin Ding
-
Qin DingQin Ding
01 Jan 2009
01 Jan 2009

DBMSs with Native XML Support: Towards Faster, Richer, and Smarter Data Management
Min Wang
-
Min WangMin Wang
16 Jun 2007
16 Jun 2007

Efficient Multidimensional Simple Path Query Processing Algorithm for XML Data
Dhiaa Musleh ... Muhammed Al-Mulhem
-
Dhiaa Musleh, et. al.Dhiaa Musleh ... Muhammed Al-Mulhem
01 Jun 2013
01 Jun 2013

Adding tuples to Java: a study in lightweight data structures
C Van Reeuwijk ... H J Sips
Concurrency and Computation: Practice and Experience | VOL. 17
C Van Reeuwijk, et. al.C Van Reeuwijk ... H J Sips
22 Feb 2005
Concurrency and Computation: Practice and Experience | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Indexing useful structural patterns for XML query processing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering