Recursive Queries Research Articles

Materialized views are excessively stored query execution results in the database. They can be used to partially or completely answer queries which will be further appeared instead of re-executing query from the scratch. There is a large number of published works that address the maintenance, especially incremental update, of materialized views and query rewriting for using those ones. Some of them support materialized views based on recursive query in datalog language. Although most of datalog queries can be transferred into SQL queries and vise versa but it is not the case for recursive queries. Recursive queries in the data log try to find all possible transitive closures. Recursive queries in SQL (Common Table Expression – CTE) return direct links but not transitive closures. In this paper, we propose efficient methods for incremental update of materialized views based on CTE; and then propose an algorithm for generating source codes in C language for any input SQL recursive queries. The synthesized source codes implement our proposed incremental update algorithms according to inserted/deleted/updated record set in the base tables. This paper focuses mainly on the recursive queries whose execution results are directed tree-structured data. The two cases of tree node are considered. In the first case, a child node has only one parent node and in the second case, a child node can have many parent nodes. Those two cases represent the two types of relationships between entities in real world, that are one–to–many and many–to–many, respectively. For the one–to–many relationships, the relationship data is accompanied with the records describing the child using some fields. Those fields are set as null in deleting a concrete relationship. For the many–to–many relationships, it is stored in a separate table and the concrete relationships are removed by deleting describing records from that table. Considering of enforcing referential integrity may help to reduce the searching space and therefore, help to improve the performance. However, the set of tree nodes or tree edges can be manipulated. All those combinations lead to different algorithms. The experimental results are provided and discussed to confirm the effectiveness of our proposed methods

Read full abstract

Querying graphs and conducting graph analytics become important in data processing since many real applications are dealing with massive graphs, such as online social networks, Semantic Web, knowledge graphs, etc. Over the years, many distributed graph processing systems have been developed to support graph analytics using various programming models, and many graph querying languages have been proposed. A natural question that arises is how to integrate graph data and traditional non-graph data in a distributed system for users to conduct analytics. There are two issues. One issue is related to expressiveness on how to specify graph analytics as well as data analytics by a querying language. The other issue is related to efficiency on how to process analytics in a distributed system. For the first issue, SQL is a best candidate, since SQL is a well-accepted language for data processing. We concentrate on SQL for graph analytics. Our early work shows that graph analytics can be supported by SQL in a way from “semiring + while” to “relational algebra + while” via the enhanced recursive SQL queries. In this article, we focus on the second issue on how to process such enhanced recursive SQL queries based on the GAS ( Gather - Apply - Scatter ) model under which efficient graph processing systems can be developed. To demonstrate the efficiency, we implemented a system by tightly coupling Spark SQL and GraphX on Spark which is one of the most popular in-memory data-flow processing platforms. First, we enhance Spark SQL by adding the capability of supporting the enhanced recursive SQL queries for graph analytics. In this regard, graph analytics can be processed using a distributed SQL engine alone. Second, we further propose new transformation rules to optimize/translate the operations for recursive SQL queries to the operations by GraphX . In this regard, graph analytics by SQL can be processed in a similar way as done by a distributed graph processing system using the APIs provided by the system. We conduct extensive performance studies to test graph analytics using large real graphs. We show that our approach can achieve similar or even higher efficiency, in comparison to the built-in graph algorithms in the existing graph processing systems.

Read full abstract

Recursive Queries Research Articles

Related Topics

Articles published on Recursive Queries

The Complexity of Why-Provenance for Datalog Queries

INTERFACES OF VIRTUAL DATA STORAGE IN THE CONDITIONS OF A FAST-CHANGING INFORMATION ENVIRONMENT

Convergence of datalog over (Pre-) Semirings

Optimizing Nested Recursive Queries

Convergence of Datalog over (Pre-) Semirings

On Monotonic Determinacy and Rewritability for Recursive Queries and Views

Optimizing differentially-maintained recursive queries on dynamic graphs

PRESERVATION OF HIERARCHY STRUCTURES IN RELATIVE DATABASES

Query Rewriting for Horn-SHIQ Plus Rules

Temel Parametreleri ve Algoritmalarıyla Bir Gıda İzlenebilirliği Veritabanı Modeli

Object traceability graph: Applying temporal graph traversals for efficient object traceability

Recursion in SPARQL

Blockzone: A Decentralized and Trustworthy Data Plane for DNS

Distribution Policies for Datalog

Technical Perspective for

A solution for synchronous incremental maintenance of materialized views based on SQL recursive query

A Case for Stale Synchronous Distributed Model for Declarative Recursive Computation

Scaling-up in-memory datalog processing

SQL-G: Efficient Graph Analytics by SQL

On Safety of Unary and Non-unary IFP-operators

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Recursive Queries Research Articles

Related Topics

Articles published on Recursive Queries

The Complexity of Why-Provenance for Datalog Queries

INTERFACES OF VIRTUAL DATA STORAGE IN THE CONDITIONS OF A FAST-CHANGING INFORMATION ENVIRONMENT

Convergence of datalog over (Pre-) Semirings

Optimizing Nested Recursive Queries

Convergence of Datalog over (Pre-) Semirings

On Monotonic Determinacy and Rewritability for Recursive Queries and Views

Optimizing differentially-maintained recursive queries on dynamic graphs

PRESERVATION OF HIERARCHY STRUCTURES IN RELATIVE DATABASES

Query Rewriting for Horn-SHIQ Plus Rules

Temel Parametreleri ve Algoritmalarıyla Bir Gıda İzlenebilirliği Veritabanı Modeli

Object traceability graph: Applying temporal graph traversals for efficient object traceability

Recursion in SPARQL

Blockzone: A Decentralized and Trustworthy Data Plane for DNS

Distribution Policies for Datalog

Technical Perspective for

A solution for synchronous incremental maintenance of materialized views based on SQL recursive query

A Case for Stale Synchronous Distributed Model for Declarative Recursive Computation

Scaling-up in-memory datalog processing

SQL-G: Efficient Graph Analytics by SQL

On Safety of Unary and Non-unary IFP-operators