Accelerating I/O Performance of Big Data Analytics on HPC Clusters through RDMA-Based Key-Value Store

Nusrat Sharmin Islam,Md Wasi-Ur-Rahman,Xiaoyi Lu,Dipti Shankar,Dhabaleswar K Panda

doi:10.1109/icpp.2015.79

Nusrat Sharmin Islam, Md Wasi-Ur-Rahman + Show 3 more

https://doi.org/10.1109/icpp.2015.79

Copy DOI

Export

Save

Cite

Publication Date: Sep 1, 2015

Citations: 19

Affiliation: The Ohio State University

Abstract
Full-Text
Similar Papers

Abstract

Listen

Hadoop Distributed File System (HDFS) is the underlying storage engine of many Big Data processing frameworks such as Hadoop MapReduce, HBase, Hive, and Spark. Even though HDFS is well-known for its scalability and reliability, the requirement of large amount of local storage space makes HDFS deployment challenging on HPC clusters. Moreover, HPC clusters usually have large installation of parallel file system like Lustre. In this study, we propose a novel design to integrate HDFS with Lustre through a high performance key-value store. We design a burst buffer system using RDMA-based Mem cached and present three schemes to integrate HDFS with Lustre through this buffer layer, considering different aspects of I/O, data-locality, and fault-tolerance. Our proposed schemes can ensure performance improvement for Big Data applications on HPC clusters. At the same time, they lead to reduced local storage requirement. Performance evaluations show that, our design can improve the write performance of Test DFSIO by up to 2.6x over HDFS and 1.5x over Lustre. The gain in read throughput is up to 8x. Sort execution time is reduced by up to 28% over Lustre and 19% over HDFS. Our design can also significantly benefit I/O-intensive workloads compared to both HDFS and Lustre.

Full Text