Abstract

In recent years, significant research has been conducted to boost the performance and increase the reliability of high performance computing (HPC) clusters. As the number of compute nodes in modern HPC clusters continues to grow, it is critical to design clusters with low power consumption and low failure rate. In particular, it is widely known that the internal disk drives of compute nodes (in the case of diskfull clusters) are a major source of failures. In addition, these diskfull HPC clusters tend to require more power and cooling requirements compared to diskless clusters. In this paper, we propose and implement a large-scale Infiniband-based diskless HPC cluster. The paper presents the cluster configuration and evaluates its performance using various High Performance LINPACK (HPL) benchmarks. The performance is measured in terms of the overall efficiency, speed in Giga-Floating Point Operations per Second (GFLOPS), and HPL execution time. We also measure temperature and power consumption. We compare the performance measurements of our diskless cluster to its diskfull counterpart. For our measurement and comparison, we consider three cluster sizes of 32, 64, and 126 compute nodes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call