Distributed query evaluation on semistructured data

Dan Suciu

doi:10.1145/507234.507235

Abstract

Semistructured data is modeled as a rooted, labeled graph. The simplest kinds of queries on such data are those which traverse paths described by regular path expressions. More complex queries combine several regular path expressions, with complex data restructuring, and with sub-queries. This article addresses the problem of efficient query evaluation on distributed, semistructured databases. In our setting, the nodes of the database are distributed over a fixed number of sites, and the edges are classified into local (with both ends in the same site) and cross edges (with ends in two distinct sites). Efficient evaluation in this context means that the number of communication steps is fixed (independent on the data or the query), and that the total amount of data sent depends only on the number of cross links and of the size of the query's result. We give such algorithms in three different settings. First, for the simple case of queries consisting of a single regular expression; second, for all queries in a calculus for graphs based on structural recursion which in addition to regular path expressions can perform nontrivial restructuring of the graph; and third, for a class of queries we call select-where queries that combine pattern matching and regular path expressions with data restructuring and subqueries. This article also includes a discussion on how these methods can be used to derive efficient view maintenance algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Distributed query evaluation on semistructured data

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Database Systems

Lead the way for us

Journal: ACM Transactions on Database Systems	Publication Date: Mar 1, 2002
Citations: 102

Similar Papers

An effective query pruning technique for multiple regular path expressions
Chang-Won Park ... Chin-Wan Chung
The Journal of Systems & Software | VOL. 64
Chang-Won Park, et. al.Chang-Won Park ... Chin-Wan Chung
01 Dec 2002
The Journal of Systems & Software | VOL. 64

Query containment for conjunctive queries with regular expressions
Daniela Florescu ... Dan Suciu
-
Daniela Florescu, et. al.Daniela Florescu ... Dan Suciu
01 May 1998
01 May 1998

On the Efficient Processing Regular Path Expressions of an Enormous Volume of XML Data
Michal Krátký ... Václav Snášel
-
Michal Krátký, et. al.Michal Krátký ... Václav Snášel
03 Sep 2007
03 Sep 2007

Optimizing regular path expressions using graph schemas
M Fernandez ... D Suciu
-
M Fernandez, et. al.M Fernandez ... D Suciu
23 Feb 1998
23 Feb 1998

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Distributed query evaluation on semistructured data

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Database Systems