DASH: a Recipe for a Flash-based Data Intensive Supercomputer

Jiahua He,Jeffrey Bennett,Allan Snavely,Arun Jagatheesan,Sandeep Gupta

doi:10.1109/sc.2010.16

Abstract

Data intensive computing can be defined as computation involving large datasets and complicated I/O patterns. Data intensive computing is challenging because there is a five-orders-of-magnitude latency gap between main memory DRAM and spinning hard disks; the result is that an inordinate amount of time in data intensive computing is spent accessing data on disk. To address this problem we designed and built a prototype data intensive supercomputer named DASH that exploits flash-based Solid State Drive (SSD) technology and also virtually aggregated DRAM to fill the latency gap . DASH uses commodity parts including Intel® X25-E flash drives and distributed shared memory (DSM) software from ScaleMP®. The system is highly competitive with several commercial offerings by several metrics including achieved IOPS (input output operations per second), IOPS per dollar of system acquisition cost, IOPS per watt during operation, and IOPS per gigabyte (GB) of available storage. We present here an overview of the design of DASH, an analysis of its cost efficiency, then a detailed recipe for how we designed and tuned it for high data-performance, lastly show that running data-intensive scientific applications from graph theory, biology, and astronomy, we achieved as much as two orders-of- magnitude speedup compared to the same applications run on traditional architectures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DASH: a Recipe for a Flash-based Data Intensive Supercomputer

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Mitigating Self-Heating in Solid State Drives for Industrial Internet-of-Things Edge Gateways
Cristian Zambelli ... Rino Micheloni
Electronics | VOL. 9
Cristian Zambelli, et. al.Cristian Zambelli ... Rino Micheloni
20 Jul 2020
Electronics | VOL. 9

Full Disk Encryption: Bridging Theory and Practice
Louiza Khati ... Nicky Mouha
-
Louiza Khati, et. al.Louiza Khati ... Nicky Mouha
01 Jan 2017
01 Jan 2017

Garbage Collection for Low Performance Variation in NAND Flash Storage Systems
Sanghyuk Jung ... Yong Ho Song
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 34
Sanghyuk Jung, et. al.Sanghyuk Jung ... Yong Ho Song
01 Jan 2015
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 34

Macss: A metadata-aware combo storage system
Shengzhuo Liu ... Jinlei Jiang
-
Shengzhuo Liu, et. al.Shengzhuo Liu ... Jinlei Jiang
01 May 2012
01 May 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DASH: a Recipe for a Flash-based Data Intensive Supercomputer

Abstract

Talk to us

Similar Papers