Storage Stack Research Articles

All-flash storage (AFS) systems have become an essential infrastructure component to support enterprise applications, where sub-millisecond latency and very high throughput are required. Nevertheless, the price per capacity ofsolid-state drives (SSDs) is relatively high, which has encouraged system architects to adoptdata reduction techniques, mainlydeduplication andcompression, in enterprise storage solutions. To provide higher reliability and performance, SSDs are typically grouped usingredundant array of independent disk (RAID) configurations. Data reduction on top of RAID arrays, however, adds I/O overheads and also complicates the I/O patterns redirected to the underlying backend SSDs, which invalidates the best-practice configurations used in AFS. Unfortunately, existing works on the performance of data reduction do not consider its interaction and I/O overheads with other enterprise storage components including SSD arrays and RAID controllers. In this paper, using a real setup with enterprise-grade components and based on the open-source data reduction module RedHat VDO, we reveal novel observations on the performance gap between the state-of-the-art and the optimal all-flash storage stack with integrated data reduction. We therefore explore the I/O patterns at the storage entry point and compare them with those at the disk subsystem. Our analysis shows a significant amount of I/O overheads for guaranteeing consistency and avoiding data loss through data journaling, frequent small-sized metadata updates, and duplicate content verification. We accompany these observations with cross-layer optimizations to enhance the performance of AFS, which range from deriving new optimal hardware RAID configurations up to introducing changes to the enterprise storage stack. By analyzing the characteristics of I/O types and their overheads, we propose three techniques: (a) application-aware lazy persistence, (b) a fast, read-only I/O cache for duplicate verification, and (c) disaggregation of block maps and data by offloading block maps to a very fast persistent memory device. By consolidating all proposed optimizations and implementing them in an enterprise AFS, we show 1.3× to 12.5× speedup over the baseline AFS with 90% data reduction, and from 7.8× up to 57× performance/cost improvement over an optimized AFS (with no data reduction) running applications ranging from 100% read-only to 100% write-only accesses.

Read full abstract

Diagnosing storage system failures is challenging even for professionals. One recent example is the “When Solid State Drives Are Not That Solid” incident occurred at Algolia data center, where Samsung SSDs were mistakenly blamed for failures caused by a Linux kernel bug. With the system complexity keeps increasing, diagnosing failures will likely become more difficult.To better understand real-world failures and the potential limitations of state-of-the-art tools, we first conduct an empirical study on 277 user-reported storage failures in this paper. We characterize the issues along multiple dimensions (e.g., time to resolve, kernel components involved), which provides a quantitative measurement of the challenge in practice. Moreover, we analyze a set of the storage issues in depth and derive a benchmark suite called BugBenchk. The benchmark suite includes the necessary workloads and software environments to reproduce 9 storage failures, covers 4 different file systems and the block I/O layer of the storage stack, and enables realistic evaluation of diverse kernel-level tools for debugging.To demonstrate the usage, we apply BugBenchk to study two representative tools for debugging. We focus on measuring the observations that the tools enable developers to make (i.e., observability), and derive concrete metrics to measure the observability qualitatively and quantitatively. Our measurement demonstrates the different design tradeoffs in terms of debugging information and overhead. More importantly, we observe that both tools may behave abnormally when applied to diagnose a few tricky cases. Also, we find that neither tool can provide low-level information on how the persistent storage states are changed, which is essential for understanding storage failures. To address the limitation, we develop lightweight extensions to enable such functionality in both tools. We hope that BugBenchk and the enabled measurements will inspire follow-up research in benchmarking and tool support and help address the challenge of failure diagnosis in general.

Read full abstract

Storage Stack Research Articles

Related Topics

Articles published on Storage Stack

Check-QZP: A Lightweight Checkpoint Mechanism for Deep Learning Frameworks

An Evaluation of DAOS for Simulation and Deep Learning HPCWorkloads

Role of Correlation among Physical Factors in Probabilistic Simulation of Emissions of Volatile Organic Compounds from Floating Storage and Offloading Vent Stack

Index Shipping for Efficient Replication in LSM Key-Value Stores with Hybrid KV Placement

Caiti: I/O transit caching for persistent memory-based block device

Disaggregating RocksDB: A Production Experience

Energy, Volume and Cost Analyses of High Temperature Seasonal Thermal Storage for Plus Energy House

A Simulation of a Sustainable Plus-Energy House in Poland Equipped with a Photovoltaic Powered Seasonal Thermal Storage System

Dotori: A Key-Value SSD Based KV Store

储能堆供电充电桩的研究

Coordinated Control of Hybrid DERs Enabled Grid-Interactive Residential PCM With Hybrid Bus Layout

An Enterprise-Grade Open-Source Data Reduction Architecture for All-Flash Storage Systems

An Enterprise-Grade Open-Source Data Reduction Architecture for All-Flash Storage Systems

Concurrent and Robust End-to-End Data Integrity Verification Scheme for Flash-Based Storage Devices

Benchmarking for Observability: The Case of Diagnosing Storage Failures

Detailed black-box monitoring of distributed systems

A survey of operating system support for persistent memory

Recycling of Critical Raw Materials from Hydrogen Chemical Storage Stacks (PEMWE), Membrane Electrode Assemblies (MEA) and Electrocatalysts

Design of Phase‐Change Memory Using Apertureless Scanning Near‐Field Optical Microscopy in the Near‐Infrared Region

A Novel Global Key-Value Storage System Based on Kinetic Drives

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Storage Stack Research Articles

Related Topics

Articles published on Storage Stack

Check-QZP: A Lightweight Checkpoint Mechanism for Deep Learning Frameworks

An Evaluation of DAOS for Simulation and Deep Learning HPCWorkloads

Role of Correlation among Physical Factors in Probabilistic Simulation of Emissions of Volatile Organic Compounds from Floating Storage and Offloading Vent Stack

Index Shipping for Efficient Replication in LSM Key-Value Stores with Hybrid KV Placement

Caiti: I/O transit caching for persistent memory-based block device

Disaggregating RocksDB: A Production Experience

Energy, Volume and Cost Analyses of High Temperature Seasonal Thermal Storage for Plus Energy House

A Simulation of a Sustainable Plus-Energy House in Poland Equipped with a Photovoltaic Powered Seasonal Thermal Storage System

Dotori: A Key-Value SSD Based KV Store

储能堆供电充电桩的研究

Coordinated Control of Hybrid DERs Enabled Grid-Interactive Residential PCM With Hybrid Bus Layout

An Enterprise-Grade Open-Source Data Reduction Architecture for All-Flash Storage Systems

An Enterprise-Grade Open-Source Data Reduction Architecture for All-Flash Storage Systems

Concurrent and Robust End-to-End Data Integrity Verification Scheme for Flash-Based Storage Devices

Benchmarking for Observability: The Case of Diagnosing Storage Failures

Detailed black-box monitoring of distributed systems

A survey of operating system support for persistent memory

Recycling of Critical Raw Materials from Hydrogen Chemical Storage Stacks (PEMWE), Membrane Electrode Assemblies (MEA) and Electrocatalysts

Design of Phase‐Change Memory Using Apertureless Scanning Near‐Field Optical Microscopy in the Near‐Infrared Region

A Novel Global Key-Value Storage System Based on Kinetic Drives