The 2 PetaFLOP, 3 Petabyte, 9 TB/s, 90 kW Cabinet: A System Architecture for Exascale and Big Data

Bruce Jacob

doi:10.1109/lca.2015.2451652

Abstract

We present a system architecture that uses high-efficiency processors as opposed to high-performance processors, NAND flash as byte-addressable main memory, and high-speed DRAM as a cache front-end for the flash. The main memory system is interconnected and presents a unified global address space to the client microprocessors. A single cabinet contains 2,550 nodes, networked in a highly redundant modified Moore graph that yields a bisection bandwidth of 9.1 TB/s and a worst-case latency of four hops from any node to any other. At a per-cabinet level, the system supports a minimum of 2.6 petabytes of main memory, dissipates 90 kW, and achieves 2.2 PetaFLOPS. The system architecture provides several features desirable in today’s large-scale systems, including a global shared physical address space (and optional support for a global shared virtual space as well), the ability to partition the physical space unequally among clients as in a unified cache architecture (e.g., so as to support multiple VMs in a datacenter), pairwise system-wide sequential consistency on user-specified address sets, built-in checkpointing via journaled non-volatile main memory, memory cost-per-bit approaching that of NAND flash, and memory performance approaching that of pure DRAM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The 2 PetaFLOP, 3 Petabyte, 9 TB/s, 90 kW Cabinet: A System Architecture for Exascale and Big Data

Abstract

Talk to us

Similar Papers

More From: IEEE Computer Architecture Letters

Lead the way for us

Journal: IEEE Computer Architecture Letters	Publication Date: Jul 1, 2016
Citations: 8

Similar Papers

Towards a Complexity Model for Design and Analysis of PGAS-Based Algorithms
Mohamed Bakhouya ... Tarek El-Ghazawi
-
Mohamed Bakhouya, et. al.Mohamed Bakhouya ... Tarek El-Ghazawi
01 Jan 2007
01 Jan 2007

Using the PGAS Programming Paradigm for Biological Sequence Alignment on a Chip Multi-Threading Architecture
...
Zenodo (CERN European Organization for Nuclear Research) | VOL. -
, et. al. ...
29 Feb 2008
Zenodo (CERN European Organization for Nuclear Research) | VOL. -

Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model
-
-
--
12 Oct 2010
12 Oct 2010

TH-DPMS
Jiwu Shu ... Junru Li
ACM Transactions on Storage | VOL. 16
Jiwu Shu, et. al.Jiwu Shu ... Junru Li
01 Oct 2020
ACM Transactions on Storage | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The 2 PetaFLOP, 3 Petabyte, 9 TB/s, 90 kW Cabinet: A System Architecture for Exascale and Big Data

Abstract

Talk to us

Similar Papers

More From: IEEE Computer Architecture Letters