Data placement and query processing based on RPE parallelisms

Yaxin Yu Yaxin Yu,Guoren Wang Guoren Wang,Nan Tang Nan Tang,Gang Wu Gang Wu,Ge Yu Ge Yu,Junan Hu Junan Hu

doi:10.1109/cmpsac.2003.1245335

Abstract

The basic idea behind parallel database systems is to perform operations in parallel to reduce the response time and improve the system throughput. Data placement is a key factor on the performance of parallel database systems. This paper proposes two data partition strategies to decluster XML documents with very large size, path schema based path instance balancing (PSPIB) strategy, in which all path instances with the same path schema in a data tree are declustered evenly over all sites, and node schema based node round-robin (NSNRR) strategy, in which all node objects with the same node schema in a data tree are declustered over all sites in a round-robin way. Accordingly, two query processing algorithms are proposed based on the two partition methods, parallel path merge (PPM) algorithm and parallel pipelining path join (PPPJ) algorithm. The performance analysis and evaluation on the two data placement strategies and corresponding query processing algorithms are given in this paper.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data placement and query processing based on RPE parallelisms

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Graph-Based Parallel Query Processingand Optimization Strategies for Object-Oriented Databases
Stanley Y.W Su ... Naoki Akaboshi
-
Stanley Y.W Su, et. al.Stanley Y.W Su ... Naoki Akaboshi
01 Jan 1998
01 Jan 1998

Parallel Database Techniques

Scalable Computing Practice and Experience | VOL. 4

03 Jan 2001
Scalable Computing Practice and Experience | VOL. 4

K-Nearest Neighbor Query Processing Algorithms for a Query Region in Road Networks
Hyeong-Il Kim ... Jae-Woo Chang
Journal of Computer Science and Technology | VOL. 28
Hyeong-Il Kim, et. al.Hyeong-Il Kim ... Jae-Woo Chang
01 Jul 2013
Journal of Computer Science and Technology | VOL. 28

ODCP: Optimizing Data Caching and Placement in Distributed File System Using Erasure Coding
Shuhan Wu ... Yunchun Li
-
Shuhan Wu, et. al.Shuhan Wu ... Yunchun Li
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data placement and query processing based on RPE parallelisms

Abstract

Talk to us

Similar Papers