Architecture and performance of Perlmutter's 35 PB ClusterStor E1000 all‐flash file system

Glenn K Lockwood,Kirill Lozinskiy,David Paul,Nicholas J Wright,Lisa Gerhardt,Alberto Chiusole

doi:10.1002/cpe.8143

Abstract

SummaryNERSC's newest system, Perlmutter, features a 35 PB all‐flash Lustre file system built on HPE Cray ClusterStor E1000. We present its architecture, early performance figures, and performance considerations unique to this architecture. We demonstrate the performance of E1000 OSSes through low‐level Lustre tests that achieve over 90% of the theoretical bandwidth of the SSDs at the OST and LNet levels. We also show end‐to‐end performance for both traditional dimensions of I/O performance (peak bulk‐synchronous bandwidth) and nonoptimal workloads endemic to production computing (small, incoherent I/Os at random offsets) and compare them to NERSC's previous system, Cori, to illustrate that Perlmutter achieves the performance of a burst buffer and the resilience of a scratch file system. Finally, we discuss performance considerations unique to all‐flash Lustre and present ways in which users and HPC facilities can adjust their I/O patterns and operations to make optimal use of such architectures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Architecture and performance of Perlmutter's 35 PB ClusterStor E1000 all‐flash file system

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience

Lead the way for us

Journal: Concurrency and Computation: Practice and Experience	Publication Date: Jul 23, 2024
License type: other-oa

Similar Papers

Challenges and Opportunities of User-Level File Systems for HPC (Dagstuhl Seminar 17202)
...
-
, et. al. ...
01 Jan 2017
Challenges and Opportunities of User-Level File Systems for HPC (Dagstuhl Seminar 17202)
...

Luster a scalable architecture file system: A research implementation on active storage array framework with Luster file system
Rushikesh Salunkhe ... Naveenkumar Jayakumar
-
Rushikesh Salunkhe, et. al.Rushikesh Salunkhe ... Naveenkumar Jayakumar
01 Mar 2016
01 Mar 2016

Reverse engineering of ReFS
Rune Nordvik ... Stefan Axelsson
Digital Investigation | VOL. 30
Rune Nordvik, et. al.Rune Nordvik ... Stefan Axelsson
23 Jul 2019
Digital Investigation | VOL. 30

In search of a scalable file system state-of-the-art file systems review and map view of new Scalable File system
Rushikesh Salunkhe ... Devendra Thakore
-
Rushikesh Salunkhe, et. al.Rushikesh Salunkhe ... Devendra Thakore
01 Mar 2016
01 Mar 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Architecture and performance of Perlmutter's 35 PB ClusterStor E1000 all‐flash file system

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience