Parallel Prime Number Labeling of Large XML Data Using MapReduce

Jinhyun Ahn,Hong-Gee Kim,Dong-Hyuk Im,Taewhi Lee

doi:10.1109/icitcs.2016.7740360

Abstract

Massive XML (Extensible Markup Language) data are available on the web. XML data labeling schemes have been suggested for structural query processing of massive XML data. Notable schemes include interval- based, prefix-based, and prime number-based labeling schemes. Of these, the prime number labeling scheme has the advantage of query processing by simple arithmetic operations. However, a parallel algorithm for this scheme does not exist. The requirement that all parents' labels have to be multiplied to obtain the label of a node makes it difficult to label XML data in a parallel fashion. To address the issue, in this paper, we propose a cluster-based technique wherein all parent nodes for a node are aggregated to compute its label by two-step MapReduce jobs. Our experiments on real-world XML datasets showed the advantages over a single machine-based system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parallel Prime Number Labeling of Large XML Data Using MapReduce

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A dynamic and parallel approach for repetitive prime labeling of XML with MapReduce
Jinhyun Ahn ... Hong-Gee Kim
The Journal of Supercomputing | VOL. 73
Jinhyun Ahn, et. al.Jinhyun Ahn ... Hong-Gee Kim
05 Jul 2016
The Journal of Supercomputing | VOL. 73

Efficient Storage and Parallel Query of Massive XML Data in Hadoop
Wei Yan
-
Wei YanWei Yan
01 Jan 2019
01 Jan 2019

A MapReduce-Based Approach for Prefix-Based Labeling of Large XML Data
Jinhyun Ahn ... Hong-Gee Kim
-
Jinhyun Ahn, et. al.Jinhyun Ahn ... Hong-Gee Kim
01 Jan 2015
01 Jan 2015

Efficient Query Processing for Large XML Data in Distributed Environments
Hiroto Kurita ... Jun Miyazaki
-
Hiroto Kurita, et. al.Hiroto Kurita ... Jun Miyazaki
01 May 2007
01 May 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parallel Prime Number Labeling of Large XML Data Using MapReduce

Abstract

Talk to us

Similar Papers