Enhancing throughput of the Hadoop Distributed File System for interaction-intensive tasks

Xiayu Hua,Hao Wu,Zheng Li,Shangping Ren

doi:10.1016/j.jpdc.2014.03.010

Abstract

The Hadoop Distributed File System (HDFS) is designed to run on commodity hardware and can be used as a stand-alone general purpose distributed file system (Hdfs user guide, 2008). It provides the ability to access bulk data with high I/O throughput. As a result, this system is suitable for applications that have large I/O data sets. However, the performance of HDFS decreases dramatically when handling the operations of interaction-intensive files, i.e., files that have relatively small size but are frequently accessed. The paper analyzes the cause of throughput degradation issue when accessing interaction-intensive files and presents an enhanced HDFS architecture along with an associated storage allocation algorithm that overcomes the performance degradation problem. Experiments have shown that with the proposed architecture together with the associated storage allocation algorithm, the HDFS throughput for interaction-intensive files increases 300% on average with only a negligible performance decrease for large data set tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing throughput of the Hadoop Distributed File System for interaction-intensive tasks

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing

Lead the way for us

Journal: Journal of Parallel and Distributed Computing	Publication Date: Apr 3, 2014
Citations: 24

Similar Papers

Blockchain Enabled Hadoop Distributed File System Framework for Secure and Reliable Traceability
Manish Kumar Gupta ... Rajendra Kumar Dwivedi
ADCAIJ: Advances in Distributed Computing and Artificial Intelligence Journal | VOL. 12
Manish Kumar Gupta, et. al.Manish Kumar Gupta ... Rajendra Kumar Dwivedi
29 Dec 2023
ADCAIJ: Advances in Distributed Computing and Artificial Intelligence Journal | VOL. 12

A Novel Approach for Improving Security and Storage Efficiency on HDFS
Yannan Ma ... Sidan Du
Procedia Computer Science | VOL. 52
Yannan Ma, et. al.Yannan Ma ... Sidan Du
01 Jan 2015
Procedia Computer Science | VOL. 52

Locality Sensitive Hashing based incremental clustering for creating affinity groups in Hadoop — HDFS - An infrastructure extension
A Kala Karun ... K Chitharanjan
-
A Kala Karun, et. al.A Kala Karun ... K Chitharanjan
01 Mar 2013
01 Mar 2013

Hadoop Ecosystem and Its Analysis on Tweets
Can Uzunkaya ... Yusuf Kavurucu
Procedia - Social and Behavioral Sciences | VOL. 195
Can Uzunkaya, et. al.Can Uzunkaya ... Yusuf Kavurucu
01 Jul 2015
Procedia - Social and Behavioral Sciences | VOL. 195

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing throughput of the Hadoop Distributed File System for interaction-intensive tasks

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing