A Data-Aware Remote Procedure Call Method for Big Data Systems

Jin Wang,Jingyu Zhang,Xiaofeng Yu,Yaqiong Yang,Amr Tolba,Osama Alfarraj

doi:10.32604/csse.2020.35.523

Abstract

In recent years, big data has been one of the hottest development directions in the information field. With the development of artificial intelligence technology, mobile smart terminals and high-bandwidth wireless Internet, various types of data are increasing exponentially. Huge amounts of data contain a lot of potential value, therefore how to effectively store and process data efficiently becomes very important. Hadoop Distributed File System (HDFS) has emerged as a typical representative of dataintensive distributed big data file systems, and it has features such as high fault tolerance, high throughput, and can be deployed on low-cost hardwares. HDFS nodes communicate with each other to make the big data systems work properly, using the Remote Procedure Call (RPC) mechanism. However, the RPC in HDFS is still not good enough to work better in terms of network throughput and abnormal response. This paper presents an optimization method to improve the performance of HDFS. The proposed method dynamically adjusts the RPC configurations between NameNode and DataNodes by sensing the data characters that stored in DataNodes. This method can effectively reduce the NameNode processing pressure, and improve the network throughput generated by the information transmission between NameNode and DataNodes. It can also reduce the abnormal response time of the whole system. Finally, the extensive experiments show the effectiveness and efficiency of our proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computer Systems Science and Engineering	Publication Date: Jan 1, 2020
Citations: 8	License type: cc-by

R Discovery Prime

R Discovery Prime

A Data-Aware Remote Procedure Call Method for Big Data Systems

Abstract

Talk to us

Similar Papers

More From: Computer Systems Science and Engineering

Lead the way for us

Similar Papers

Remote Procedure Call Optimization of Big Data Systems Based on Data Awareness
Jin Wang ... Yaqiong Yang
-
Jin Wang, et. al.Jin Wang ... Yaqiong Yang
01 Dec 2020
01 Dec 2020

An Adaptive RPC Mechanism for Performance and Node Fault Tolerance Optimization in HDFS
Jingyu Zhang ... Lailong Luo
-
Jingyu Zhang, et. al.Jingyu Zhang ... Lailong Luo
01 Dec 2022
01 Dec 2022

Locality Sensitive Hashing based incremental clustering for creating affinity groups in Hadoop — HDFS - An infrastructure extension
A Kala Karun ... K Chitharanjan
-
A Kala Karun, et. al.A Kala Karun ... K Chitharanjan
01 Mar 2013
01 Mar 2013

ERP: An enhanced read policy for HDFS to improve read performance for files under construction
Junjie He ... Fei Hu
-
Junjie He, et. al. Junjie He ... Fei Hu
01 Dec 2015
01 Dec 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Data-Aware Remote Procedure Call Method for Big Data Systems

Abstract

Talk to us

Similar Papers

More From: Computer Systems Science and Engineering