Towards optimization of RDF analytical queries on MapReduce

Padmashree Ravindra

doi:10.1109/icdew.2014.6818351

Abstract

The broadened use of Semantic Web technologies across domains has led to a shift in focus from simple pattern matching queries on RDF data to analytical queries with complex grouping and aggregations. An RDF analytical query involves graph pattern matching, which translates to several join operations due to the fine-grained nature of RDF data model. Complex analytical queries involve multiple grouping-aggregations on different graph patterns, making such tasks join-intensive. Scale-out processing of RDF analytical queries on existing relational-style MapReduce platforms such as Apache Hive and Pig, results in lengthy execution workflows with multiple cycles of I/O and network transfer. Additionally, certain graph patterns result in avoidable redundancy in intermediate results, which negatively impacts processing costs. The PhD thesis summarized in this paper proposes a two-pronged approach to minimize the costs while processing RDF queries on MapReduce: an algebraic approach based on a Nested TripleGroup Data Model and Algebra that reinterprets graph pattern queries in a way that reduces the required number of map-reduce cycles, and special strategies to minimize the redundancy in intermediate data while processing certain graph patterns. The proposed techniques are integrated into Apache Pig. Empirical evaluation of this work for processing graph pattern queries show 45-60% performance gains over systems such as Pig and Hive.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards optimization of RDF analytical queries on MapReduce

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Type-based Semantic Optimization for Scalable RDF Graph Pattern Matching
Hyeongsik Kim ... Kemafor Anyanwu
-
Hyeongsik Kim, et. al.Hyeongsik Kim ... Kemafor Anyanwu
03 Apr 2017
03 Apr 2017

To nest or not to nest, when and how much
Padmashree Ravindra ... Kemafor Anyanwu
-
Padmashree Ravindra, et. al.Padmashree Ravindra ... Kemafor Anyanwu
20 May 2012
20 May 2012

Adding regular expressions to graph reachability and pattern queries
Wenfei Fan ... Yinghui Wu
Frontiers of Computer Science | VOL. 6
Wenfei Fan, et. al.Wenfei Fan ... Yinghui Wu
01 Jun 2012
Frontiers of Computer Science | VOL. 6

Adding regular expressions to graph reachability and pattern queries
Wenfei Fan ... Nan Tang
-
Wenfei Fan, et. al.Wenfei Fan ... Nan Tang
01 Apr 2011
01 Apr 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards optimization of RDF analytical queries on MapReduce

Abstract

Talk to us

Similar Papers