XML Tree Research Articles

In recent years the inverted lists evaluation model along with holistic stack-based algorithms have been established as the most prominent techniques for evaluating XML queries on large persistent XML data. In this framework, we are using materialized views for optimizing XML queries. We consider a novel approach which instead of materializing the answer of a view materializes exactly the inverted sublists that are necessary for computing the answer of the view. This originality allows storing view materializations as compressed bitmaps, a solution that minimizes the materialization space and empowers performing optimization operations as CPU-efficient bitwise operations. To realize the potential of bitmap materialized views in optimizing query performance, we define and address the following problem (view configuration problem): given an XML tree and its schema find a template of tree-pattern views (view configuration) such that: (a) the views of this configuration can answer all the queries that can be issued against the schema, (b) their materialization fits in the space provided, and (c) evaluating the queries using these views minimizes the overall query evaluation cost. We consider an instance of this problem for tree pattern queries. Our intension is to find view configurations whose materializations are small enough to be stored in main memory. We find two candidate solution configurations and we identify cases where views can be excluded from materialization in a configuration without affecting query performance. In order to compare our approach with an approach which also can support the optimization of every query on the schema, we implemented an improvement of a state-of-the-art approach which is based on structural indexes. Our experimental results show that our approach is stable, greatly improves evaluating queries without materialized views, outperforms the structural index approach on all test cases and is very close to the optimal. These results characterize our approach as the best candidate for supporting the optimization of queries in the framework of the inverted lists model.

XML query languages typically allow the specification of structural patterns using XPath. Usually, these structural patterns are in the form of trees (Tree-Pattern Queries-TPQs). Finding the occurrences of such patterns in an XML tree is a key operation in XML query evaluation. The multiple previous algorithms presented for this operation focus mainly on the evaluation of tree-pattern queries. Recently, requirements for flexible querying of XML data have motivated the consideration of query classes that are more expressive and flexible than TPQs for which efficient nonmain-memory evaluation algorithms are not known. In this paper, we consider a class of queries, called Partial Tree-Pattern Queries (PTPQs), which generalize and strictly contain TPQs. PTPQs represent a broad fragment of XPath which is very useful in practice. In order to process PTPQs, we introduce a set of sound and complete inference rules to characterize structural relationship derivation. We provide necessary and sufficient conditions for detecting query unsatisfiability and node redundancy. We also show that PTPQs can be represented as directed acyclic graphs augmented with the “same-path” constraints. In order to leverage existing efficient evaluation algorithms for less expressive classes of queries, we design two approaches that evaluate a PTPQ by decomposing it into a set of simpler queries: algorithm IndexTPQGen, exploits a structural summary of the XML data and evaluates a PTPQ by generating an equivalent set of TPQs and unioning their answers. Algorithm PartialPathJoin decomposes the PTPQ into partial-path queries, and merge-joins their solutions. We also develop PartialTreeStack, an original polynomial time holistic algorithm for PTPQs. To the best of our knowledge, this is the first algorithm to support the evaluation of such a broad structural fragment of XPath in the inverted lists evaluation model. We provide a theoretical analysis of our algorithm and identify cases where it is asymptotically optimal. An extensive experimental evaluation shows that it is more efficient, robust, and stable than the other two and it outperforms a state-of-the art XQuery engine on PTPQs.

XML Tree Research Articles

Related Topics

Articles published on XML Tree

A Partial-tree-based Approach for XPath Query on Large XML Trees

Ef?cient Duplicate Detection and Elimination in Hierarchical Multimedia Data

Containment for Conditional Tree Patterns

Efficiently Deciding μ-Calculus with Converse over Finite Trees

XPath for DL Ontologies

XMap: A Novel Approach to Store and Retrieve XML Document in Relational Databases

The Three-dimensional Coding Based on the Cone for XML Under Weaving Multi-documents

An Accurate Identification of Extended XML Tree Pattern for XQuery Language

Massive XML Data Mining in Cloud Computing Environment

Efficiently Subtree Matching between XML and Probabilistic XML Documents

An Implementation of Tree Pattern Matching Algorithms for Enhancement of Query Processing Operations in Large XML Trees

Configuring bitmap materialized views for optimizing XML queries

Dividing Huge XML Trees Using the m-bridge Technique over One-to-one Corresponding Binary Trees

XPath fragments on XML in columns

실패 전이를 갖는 트리를 이용한 스트리밍 XML 하드웨어 파서

Algebraic incremental maintenance of XML views

XML tree structure compression using RePair

Research on Basic Operations for Query Probabilistic XML Document Based on Path Set

Processing and Evaluating Partial Tree Pattern Queries on XML Data

Partial Evaluation for Distributed XPath Query Processing and Beyond

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

XML Tree Research Articles

Related Topics

Articles published on XML Tree

A Partial-tree-based Approach for XPath Query on Large XML Trees

Ef?cient Duplicate Detection and Elimination in Hierarchical Multimedia Data

Containment for Conditional Tree Patterns

Efficiently Deciding μ-Calculus with Converse over Finite Trees

XPath for DL Ontologies

XMap: A Novel Approach to Store and Retrieve XML Document in Relational Databases

The Three-dimensional Coding Based on the Cone for XML Under Weaving Multi-documents

An Accurate Identification of Extended XML Tree Pattern for XQuery Language

Massive XML Data Mining in Cloud Computing Environment

Efficiently Subtree Matching between XML and Probabilistic XML Documents

An Implementation of Tree Pattern Matching Algorithms for Enhancement of Query Processing Operations in Large XML Trees

Configuring bitmap materialized views for optimizing XML queries

Dividing Huge XML Trees Using the m-bridge Technique over One-to-one Corresponding Binary Trees

XPath fragments on XML in columns

실패 전이를 갖는 트리를 이용한 스트리밍 XML 하드웨어 파서

Algebraic incremental maintenance of XML views

XML tree structure compression using RePair

Research on Basic Operations for Query Probabilistic XML Document Based on Path Set

Processing and Evaluating Partial Tree Pattern Queries on XML Data

Partial Evaluation for Distributed XPath Query Processing and Beyond