MOSIQS: Persistent Memory Object Storage With Metadata Indexing and Querying for Scientific Computing

Awais Khan,Hyogi Sim,Youngjae Kim,Sudharshan S. Vazhkudai

doi:10.1109/access.2021.3087502

Awais Khan, Hyogi Sim + Show 2 more

Open Access

https://doi.org/10.1109/access.2021.3087502

Copy DOI

Abstract

Scientific applications often require high-bandwidth shared storage to perform joint simulations and collaborative data analytics. Shared memory pools provide a chance to satisfy such needs. Recently, a high-speed network such as Gen-Z utilizing persistent memory (PM) offers an opportunity to create a shared memory pool connected to compute nodes. However, there are several challenges to use scientific applications on the shared memory pool directly such as scalability, failure-atomicity, and lack of scientific metadata-based search and query. In this paper, we propose MOSIQS, a persistent memory object storage framework with metadata indexing and querying for scientific computing. We design MOSIQS based on the key idea that memory objects on PM pool can live beyond the application lifetime and can become the sharing currency for applications and scientists. MOSIQS provides an aggregate memory pool atop an array of persistent memory devices to store and access memory objects to accelerate scientific computing. MOSIQS uses a lightweight persistent memory key-value store to manage the metadata of memory objects, which enables memory object sharing. To facilitate metadata search and query over millions of memory objects resident on memory pool, we introduce Group Split and Merge (GSM), a novel persistent index data structure designed primarily for scientific datasets. GSM splits and merges dynamically to minimize the query search space and maintains low query processing time while overcoming the index storage overhead. MOSIQS is implemented on top of PMDK. We evaluate the proposed approach on many-core server with an array of real PM devices. Experimental results show that MOSIQS gains a 100% write performance improvement and executes multi-attribute queries efficiently with 2.7× less index storage overhead offering significant potential to speed up scientific computing applications.

Highlights

Large-scale scientific applications, including simulations, experiments, and observations, generate tens of petabytes of data objects and are forecasted to grow even further [1]–[3]
To enable efficient multi-attribute metadata search and querying for scientific data objects resident in Persistent Memory (PM) pool, we introduce Group Split-and-Merge (GSM) index data structure (Section IV)
EXPERIMENTAL SETUP Testbed: We perform our experiments on a machine equipped with second-generation Intel Xeon scalable dualsocket 56-core processor with 1.5 TB Intel Optane DC 3D-XPoint PM, and 768 GB DRAM

Summary

Introduction

Large-scale scientific applications, including simulations, experiments, and observations, generate tens of petabytes of data objects and are forecasted to grow even further [1]–[3]. Many applications from various scientific domains tend to store experimental, simulation, and analytical data in domain-specific scientific data formats such as HDF5 [33], netCDF [34], FITS [36], Plot3D [37], and GRIB [38] in the underlying parallel file system. These scientific data formats are often referred to as self-described and self-contained, i.e., the metadata is stored alongside the data objects [26].

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

MOSIQS: Persistent Memory Object Storage With Metadata Indexing and Querying for Scientific Computing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Persistent Memory Object Storage and Indexing for Scientific Computing
Awais Khan ... Youngjae Kim
-
Awais Khan, et. al.Awais Khan ... Youngjae Kim
01 Nov 2020
01 Nov 2020

Memory Perfectionism is Associated with Persistent Memory Complaints after Concussion.
Edwina L Picon ... Daniela J Palombo
Archives of Clinical Neuropsychology | VOL. 37
Edwina L Picon, et. al.Edwina L Picon ... Daniela J Palombo
19 Apr 2022
Archives of Clinical Neuropsychology | VOL. 37

Delegated persist ordering
Aasheesh Kolli ... Jeff Rosen
-
Aasheesh Kolli, et. al.Aasheesh Kolli ... Jeff Rosen
01 Oct 2016
01 Oct 2016

Delegated persist ordering
...
-
, et. al. ...
15 Oct 2016
15 Oct 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MOSIQS: Persistent Memory Object Storage With Metadata Indexing and Querying for Scientific Computing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access