EDRFS: An Effective Distributed Replication File System for Small-File and Data-Intensive Application

Bin Cai,Changsheng Xie,Guangxi Zhu

doi:10.1109/comswa.2007.382422

Abstract

With the system scale keeping grown, the key challenge is to mask the failures that arise among the system components and to improve the performance of data-intensive applications. This paper designs and implements a cluster-based distributed replication file system EDRFS to meet these critical demands. EDRFS works with a single metadata server and multiple storage nodes, deploys whole-file replication scheme at the file level, and tracks what storage node a file is replicated on. We use a linear hash algorithm to evenly distribute data and load across multiple storage nodes so as to achieve balancing workload and incremental scalability of throughput and storage capacity as the system scale grows. In addition, we employ metadata caches and file data caches in clients to enhance system performance. Furthermore, we deploy a concurrency lock scheme to avoid namespace operation bottleneck and a replicas consistency method to keep a consistent mutation order among replicas of a file. We provide the initial experimental evaluations of our prototypical system on a small-file and data-intensive workload.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

EDRFS: An Effective Distributed Replication File System for Small-File and Data-Intensive Application

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Proofs of Physical Reliability for Cloud Storage Systems
Li Li ... Loukas Lazos
IEEE Transactions on Parallel and Distributed Systems | VOL. 31
Li Li, et. al.Li Li ... Loukas Lazos
01 May 2020
IEEE Transactions on Parallel and Distributed Systems | VOL. 31

MAD2: A scalable high-throughput exact deduplication approach for network backup services
Jiansheng Wei ... Dan Feng
-
Jiansheng Wei, et. al.Jiansheng Wei ... Dan Feng
01 May 2010
01 May 2010

NCFS: On the Practicality and Extensibility of a Network-Coding-Based Distributed File System
Yuchong Hu ... Patrick P C Lee
-
Yuchong Hu, et. al.Yuchong Hu ... Patrick P C Lee
01 Jul 2011
01 Jul 2011

XORInc: Optimizing Data Repair and Update for Erasure-Coded Systems with XOR-Based In-Network Computation
Fang Wang ... Yanwen Xie
-
Fang Wang, et. al.Fang Wang ... Yanwen Xie
01 May 2019
01 May 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

EDRFS: An Effective Distributed Replication File System for Small-File and Data-Intensive Application

Abstract

Talk to us

Similar Papers