In-memory Database Systems Research Articles

Recent in-memory database systems leverage advanced hardware features like RDMA to provide transaction processing at millions of transactions per second. Distributed transaction processing systems can scale to even higher rates, especially for partitionable workloads. Unfortunately, it is challenging to sustain such high rates during live reconfiguration of partitions. In this article, we observe that state-of-the-art approaches would cause notable performance disruption under fast transaction processing. To this end, this article presents DrTM+B, a live reconfiguration approach that seamlessly repartitions data with little performance disruption to running transactions. DrTM+B uses a pre-copy-based mechanism to avoid excessive data transfer by leveraging common properties in recent transactional systems. DrTM+B's reconfiguration plans reduce data movement by preferring existing data replicas, while copying data from multiple replicas asynchronously and in parallel. It further reuses the log forwarding mechanism in primary-backup replication to seamlessly track and forward dirty database tuples and avoids iterative copying costs. To commit a reconfiguration plan in a transactional-safe way, DrTM+B designs a cooperative commit protocol for synchronization of data and state among replicas. To boost the performance during data migration, DrTM+B combines the pre-copy and post-copy schemes to propose a hybrid copy scheme. The live reconfiguration approach can also coexist with fault-tolerance mechanisms of primary-backup replication to provide high availability. Evaluation on a working system based on DrTM+R with 3-way replication using typical OLTP workloads like TPC-C and SmallBank shows that DrTM+B incurs only very small performance degradation during live reconfiguration and provides high availability. Both the reconfiguration time and the downtime are also minimal.

Read full abstract

Modern in-memory database systems are facing the need of efficiently supporting mixed workloads of OLTP and OLAP. A conventional approach to this requirement is to rely on ETL-style, application-driven data replication between two very different OLTP and OLAP systems, sacrificing real-time reporting on operational data. An alternative approach is to run OLTP and OLAP workloads in a single machine, which eventually limits the maximum scalability. In order to tackle this challenging problem, we propose a novel database replication architecture called HANA Asynchronous Parallel Table Replication (ATR). ATR supports OLTP workloads in one primary machine, while it supports heavy OLAP workloads in replicas. Here, row store formats can be used for OLTP transactions at the primary, while column store formats are used for OLAP analytical queries at the replicas. ATR is designed to support elastic scalability of OLAP query performance, while it minimizes the overhead for transaction processing at the primary and minimizes CPU consumption for replayed transactions at the replicas. ATR employs a novel optimistic lock-free parallel log replay scheme which exploits characteristics of multi-version concurrency control (MVCC) to enable real-time reporting by minimizing the propagation delay between the primary and replicas. It supports adaptive query routing depending on its predefined acceptable staleness range. Through extensive experiments with a concrete implementation available in a commercial product, we demonstrate that ATR achieves sub-second visibility delay even for update-intensive workloads, providing scalable OLAP performance without notable overhead to the primary. In addition, with extension of ATR to eager parallel replication, we demonstrate how the parallel log replay and its log-less replica recovery mechanisms improve run-time transaction performance under eager replication.

Read full abstract

In-memory Database Systems Research Articles

Related Topics

Articles published on In-memory Database Systems

LeanStore: A High-Performance Storage Engine for NVMe SSDs

DrTM+B: Replication-Driven Live Reconfiguration for Fast and General Distributed Transaction Processing

Budget-Conscious Fine-Grained Configuration Optimization for Spatio-Temporal Applications

Memory-optimized multi-version concurrency control for disk-based database systems

Index checkpoints for instant recovery in in-memory database systems

B2-Tree: Page-Based String Indexing in Concurrent Environments

Application-Oriented Data Migration to Accelerate In-Memory Database on Hybrid Memory.

Robust and budget-constrained encoding configurations for in-memory database systems

Replicated layout for in-memory database systems

Micro-architectural analysis of in-memory OLTP: Revisited

Main Memory Database Recovery

In-memory database acceleration on FPGAs: a survey

Evaluating the Vector Supercomputer SX-Aurora TSUBASA as a Co-Processor for In-Memory Database Systems

In-memory for the masses

Efficient compute node-local replication mechanisms for NVRAM-centric data structures

From a Comprehensive Experimental Survey to a Cost-based Selection Strategy for Lightweight Integer Compression Algorithms

Adaptive optimistic concurrency control for heterogeneous workloads

Anomaly Detection Using Data Mining Methods in IT Systems: A Decision Support Application

Parallel replication across formats for scaling out mixed OLTP/OLAP workloads in main-memory databases

Parallel replication across formats in SAP HANA for scaling out mixed OLTP/OLAP workloads

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

In-memory Database Systems Research Articles

Related Topics

Articles published on In-memory Database Systems

LeanStore: A High-Performance Storage Engine for NVMe SSDs

DrTM+B: Replication-Driven Live Reconfiguration for Fast and General Distributed Transaction Processing

Budget-Conscious Fine-Grained Configuration Optimization for Spatio-Temporal Applications

Memory-optimized multi-version concurrency control for disk-based database systems

Index checkpoints for instant recovery in in-memory database systems

B2-Tree: Page-Based String Indexing in Concurrent Environments

Application-Oriented Data Migration to Accelerate In-Memory Database on Hybrid Memory.

Robust and budget-constrained encoding configurations for in-memory database systems

Replicated layout for in-memory database systems

Micro-architectural analysis of in-memory OLTP: Revisited

Main Memory Database Recovery

In-memory database acceleration on FPGAs: a survey

Evaluating the Vector Supercomputer SX-Aurora TSUBASA as a Co-Processor for In-Memory Database Systems

In-memory for the masses

Efficient compute node-local replication mechanisms for NVRAM-centric data structures

From a Comprehensive Experimental Survey to a Cost-based Selection Strategy for Lightweight Integer Compression Algorithms

Adaptive optimistic concurrency control for heterogeneous workloads

Anomaly Detection Using Data Mining Methods in IT Systems: A Decision Support Application

Parallel replication across formats for scaling out mixed OLTP/OLAP workloads in main-memory databases

Parallel replication across formats in SAP HANA for scaling out mixed OLTP/OLAP workloads