Optimizing Communication for Multi-Join Query Processing in Cloud Data Warehouses

Swathi Kurunji,Cindy X Chen,Xinwen Fu,Benyuan Liu,Tingjian Ge

doi:10.4018/ijghpc.2013100108

Abstract

In this paper, the authors present storage structures, PK-map and Tuple-index-map, to improve the performance of query execution and inter-node communication in Cloud Data Warehouses. Cloud Data Warehouses require Read-Optimized databases because large amount of historical data are integrated on a regular basis to facilitate analytical applications for report generation, future analysis, and decision-making. This frequent data integration can grow the data size rapidly and hence there is a need to allocate resource dynamically on demand. As resource is scaled-out in the cloud environment, the number of nodes involved in the execution of a query increases. This in turn increases the number of inter-node communications. In queries, join operation between two different tables are most common. To perform the join operation of a query in the cloud environment, data need to be transferred among different nodes. This becomes critical when there is a huge amount of data (in Terabytes or Petabytes) stored across a large number of nodes. With the increase in number of nodes and amount of data, the size of the communication messages also increases, resulting in even increased bandwidth usage and performance degradation. In this paper, the authors show through extensive experiments using PlanetLab Cloud that their proposed storage structures PK-map and Tuple-index-map, and query execution algorithms improve the performance of join queries, decrease inter-node communication and workload in Cloud Data Warehouses.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimizing Communication for Multi-Join Query Processing in Cloud Data Warehouses

Abstract

Talk to us

Similar Papers

More From: International Journal of Grid and High Performance Computing

Lead the way for us

Journal: International Journal of Grid and High Performance Computing	Publication Date: Oct 1, 2013
Citations: 5

Similar Papers

Communication cost optimization for cloud Data Warehouse queries
Swathi Kurunji ... Tingjian Ge
-
Swathi Kurunji, et. al.Swathi Kurunji ... Tingjian Ge
01 Dec 2012
01 Dec 2012

Traditional Vs Cloud Data warehouse – Comparative Analysis & Survey paper
Robbia Gulnar
Journal of Computing and Artificial Intelligence | VOL. 1
Robbia Gulnar Robbia Gulnar
23 May 2023
Journal of Computing and Artificial Intelligence | VOL. 1

Modern approaches to data storage: comparison of relational and cloud data warehouses using etl and elt methods
N.I Boyko ... A.V Chernenko
Reporter of the Priazovskyi State Technical University. Section: Technical sciences | VOL. -
N.I Boyko, et. al.N.I Boyko ... A.V Chernenko
27 Jun 2024
Reporter of the Priazovskyi State Technical University. Section: Technical sciences | VOL. -

A Comparative Analysis of Traditional and Cloud Data Warehouse
Khawaja Ubaid Ur Rehman ... Umair Ahmad
VAWKUM Transactions on Computer Sciences | VOL. 15
Khawaja Ubaid Ur Rehman, et. al.Khawaja Ubaid Ur Rehman ... Umair Ahmad
30 Mar 2018
VAWKUM Transactions on Computer Sciences | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimizing Communication for Multi-Join Query Processing in Cloud Data Warehouses

Abstract

Talk to us

Similar Papers

More From: International Journal of Grid and High Performance Computing