Join Queries Research Articles

Most cloud service providers offer limited data privacy guarantees, discouraging clients from using them for managing their sensitive data. Cloud providers may use servers with Trusted Execution Environments (TEEs) to protect outsourced data, while supporting remote querying. However, TEEs may leak access patterns and allow communication volume attacks, enabling an honest-but-curious cloud provider to learn sensitive information. Oblivious algorithms can be used to completely hide data access patterns, but their high overhead could render them impractical. To alleviate the latter, the notion of Differential Obliviousness (DO) has been recently proposed. DO applies differential privacy (DP) on access patterns while hiding the communication volume of intermediate and final results; it does so by trading some level of privacy for efficiency. We present Doquet: D ifferentially O blivious Range and Join Que ries with Private Data Struc t ures, a framework for DO outsourced database systems. Doquet is the first approach that supports private data structures, indices, selection, foreign key join, many-to-many join, and their composition select-join in a realistic TEE setting, even when the accesses to the private memory can be eavesdropped on by the adversary. We prove that the algorithms in Doquet satisfy differential obliviousness. Furthermore, we implemented Doquet and tested it on a machine having a second generation of Intel SGX (TEE); the results show that Doquet offers up to an order of magnitude speedup in comparison with other fully oblivious and differentially oblivious approaches.

Read full abstract

Join query evaluation with ordering is a fundamental data processing task in relational database management systems. SQL and custom graph query languages such as Cypher offer this functionality by allowing users to specify the order via the ORDER BY clause. In many scenarios, the users also want to see the first k results quickly (expressed by the LIMIT clause), but the value of k is not predetermined as user queries are arriving in an online fashion. Recent work has made considerable progress in identifying optimal algorithms for ranked enumeration of join queries that do not contain any projections. In this paper, we initiate the study of the problem of enumerating results in ranked order for queries with projections. Our main result shows that for any acyclic query, it is possible to obtain a near-linear (in the size of the database) delay algorithm after only a linear time preprocessing step for two important ranking functions: sum and lexicographic ordering. For a practical subset of acyclic queries known as star queries, we show an even stronger result that allows a user to obtain a smooth tradeoff between faster answering time guarantees using more preprocessing time. Our results are also extensible to queries containing cycles and unions. We also perform a comprehensive experimental evaluation to demonstrate that our algorithms, which are simple to implement, improve up to three orders of magnitude in the running time over state-of-the-art algorithms implemented within open-source RDBMS and specialized graph databases.

Read full abstract

Join Queries Research Articles

Related Topics

Articles published on Join Queries

Advanced Join Query Optimization Using Firefly and Reinforcement Learning Techniques on TPC-H Dataset

Continual Observation of Joins under Differential Privacy

Classic distance join queries using compact data structures

Thorough Data Pruning for Join Query in Database System

ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-Oriented Sample Size Allocation and Data Generation

Doquet: Differentially Oblivious Range and Join Queries with Private Data Structures

Technical Perspective: Revisiting Runtime Dynamic Optimization for Join Queries in Big Data Management Systems

Revisiting Runtime Dynamic Optimization for Join Queries in Big Data Management Systems

In-Network Processing of Skyline Join Queries in Wireless Sensor Networks Using Synopses of Skyline Attribute Value Ranges.

JQPro:Join Query Processing in a Distributed System for Big RDF Data Using the Hash-Merge Join Technique

Efficient distributed algorithms for distance join queries in spark-based spatial analytics systems

Survey on Exact kNN Queries over High-Dimensional Data Space.

Comparative Evaluation of Techniques for n-way Stream Joins in Wireless Sensor Networks

Ranked enumeration of join queries with projections

Join queries optimization in the distributed databases using a hybrid multi-objective algorithm

Join query optimisation in the distributed databases using a hybrid harmony search and artificial bee colony algorithm

Join query optimisation in the distributed databases using a hybrid harmony search and artificial bee colony algorithm

Evaluating Top-N Join Queries with Real-time Entity Resolution

Data Search Using Hash Join Query and Nested Join Query

Improving Distance-Join Query processing with Voronoi-Diagram based partitioning in SpatialHadoop

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Join Queries Research Articles

Related Topics

Articles published on Join Queries

Advanced Join Query Optimization Using Firefly and Reinforcement Learning Techniques on TPC-H Dataset

Continual Observation of Joins under Differential Privacy

Classic distance join queries using compact data structures

Thorough Data Pruning for Join Query in Database System

ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-Oriented Sample Size Allocation and Data Generation

Doquet: Differentially Oblivious Range and Join Queries with Private Data Structures

Technical Perspective: Revisiting Runtime Dynamic Optimization for Join Queries in Big Data Management Systems

Revisiting Runtime Dynamic Optimization for Join Queries in Big Data Management Systems

In-Network Processing of Skyline Join Queries in Wireless Sensor Networks Using Synopses of Skyline Attribute Value Ranges.

JQPro:Join Query Processing in a Distributed System for Big RDF Data Using the Hash-Merge Join Technique

Efficient distributed algorithms for distance join queries in spark-based spatial analytics systems

Survey on Exact kNN Queries over High-Dimensional Data Space.

Comparative Evaluation of Techniques for n-way Stream Joins in Wireless Sensor Networks

Ranked enumeration of join queries with projections

Join queries optimization in the distributed databases using a hybrid multi-objective algorithm

Join query optimisation in the distributed databases using a hybrid harmony search and artificial bee colony algorithm

Join query optimisation in the distributed databases using a hybrid harmony search and artificial bee colony algorithm

Evaluating Top-N Join Queries with Real-time Entity Resolution

Data Search Using Hash Join Query and Nested Join Query

Improving Distance-Join Query processing with Voronoi-Diagram based partitioning in SpatialHadoop