Abstract

In past few years Hadoop Distributed File System (HDFS) has been used by many organizations with gigantic data sets and streams of operations on it. HDFS provides distinct features like, high fault tolerance, scalability, etc. The Name Node machine is a single point of failure (SPOF) for a HDFS cluster. If the Name Node machine fails, the system needs to be re-started manually, making the system less available. This paper proposes a highly available architecture and its working principle for the HDFS Name Node against its SPOF utilizing well-known 2-Phase Commit (2PC) Protocol and Election algorithm.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call