Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce

Huayu Wu

doi:10.1007/978-3-319-10085-2_16

Abstract

Processing XML queries over big XML data using MapReduce has been studied in recent years. However, the existing works focus on partitioning XML documents and distributing XML fragments into different compute nodes. This attempt may introduce high overhead in XML fragment transferring from one node to another during MapReduce execution. Motivated by the structural join based XML query processing approach, which uses only related inverted lists to process queries in order to reduce I/O cost, we propose a novel technique to use MapReduce to distribute labels in inverted lists in a computing cluster, so that structural joins can be parallelly performed to process queries. We also propose an optimization technique to reduce the computing space in our framework, to improve the performance of query processing. Last, we conduct experiment to validate our algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Structural Join and Staircase Join Algorithms of Sibling Relationship
Chang-Xuan Wan ... Xi-Ping Liu
Journal of Computer Science and Technology | VOL. 22
Chang-Xuan Wan, et. al.Chang-Xuan Wan ... Xi-Ping Liu
01 Mar 2007
Journal of Computer Science and Technology | VOL. 22

Fast Structural Join with a Location Function
Nan Tang ... Kam-Fai Wong
-
Nan Tang, et. al.Nan Tang ... Kam-Fai Wong
01 Jan 2006
01 Jan 2006

Structural Join Algorithm for Sequential Regular Path Expressions
Kevin Lu
Journal of Computing and Information Technology | VOL. 12
Kevin LuKevin Lu
01 Jan 2004
Journal of Computing and Information Technology | VOL. 12

Estimating XML Structural Join Size Quickly and Economically
Cheng Luo ... Zhewei Jiang
-
Cheng Luo, et. al. Cheng Luo ... Zhewei Jiang
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce

Abstract

Talk to us

Similar Papers