Database Tuples Research Articles

Recent in-memory database systems leverage advanced hardware features like RDMA to provide transaction processing at millions of transactions per second. Distributed transaction processing systems can scale to even higher rates, especially for partitionable workloads. Unfortunately, it is challenging to sustain such high rates during live reconfiguration of partitions. In this article, we observe that state-of-the-art approaches would cause notable performance disruption under fast transaction processing. To this end, this article presents DrTM+B, a live reconfiguration approach that seamlessly repartitions data with little performance disruption to running transactions. DrTM+B uses a pre-copy-based mechanism to avoid excessive data transfer by leveraging common properties in recent transactional systems. DrTM+B's reconfiguration plans reduce data movement by preferring existing data replicas, while copying data from multiple replicas asynchronously and in parallel. It further reuses the log forwarding mechanism in primary-backup replication to seamlessly track and forward dirty database tuples and avoids iterative copying costs. To commit a reconfiguration plan in a transactional-safe way, DrTM+B designs a cooperative commit protocol for synchronization of data and state among replicas. To boost the performance during data migration, DrTM+B combines the pre-copy and post-copy schemes to propose a hybrid copy scheme. The live reconfiguration approach can also coexist with fault-tolerance mechanisms of primary-backup replication to provide high availability. Evaluation on a working system based on DrTM+R with 3-way replication using typical OLTP workloads like TPC-C and SmallBank shows that DrTM+B incurs only very small performance degradation during live reconfiguration and provides high availability. Both the reconfiguration time and the downtime are also minimal.

Read full abstract

When faced with a database containing millions of tuples, a user may be only interested in a (typically much) smaller representative subset. Recently, a query called the regret minimization query was proposed toward this purpose to create such a subset for users. Specifically, this query finds a set of tuples that minimizes the user regret (measured by how far the user’s favorite tuple in the selected set is from his/her favorite tuple in the whole database). The regret minimization query was shown to be very useful in bridging the best worlds between two existing well-known queries, top-k queries and skyline queries: Like top-k queries, the total number of tuples returned in this new query is controllable, and like skyline queries, this new query does not require a user to specify any preference function. Thus, it has attracted a lot of attention from researchers in the database community. Various methods were proposed for regret minimization. However, despite the abundant research effort, there is no systematic comparison among the existing methods. This paper surveys this interesting and evolving research topic by broadly reviewing and comparing the state-of-the-art methods for regret minimization. Moreover, we study different variants of the regret minimization query that has garnered considerable attention in recent years and present some interesting problems that have not yet been addressed in the literature. We implemented 12 state-of-the-art methods published in top-tier venues such as SIGMOD and VLDB from 2010 to 2018 for obtaining regret minimization sets and give an experimental comparison under various parameter settings on both synthetic and real datasets. Our evaluation shows that the optimal choice of methods for regret minimization depends on the application demands. This paper provides an empirical guideline for making such a decision.

Read full abstract

Database Tuples Research Articles

Related Topics

Articles published on Database Tuples

Research on blind reversible database watermarking algorithm based on dual embedding strategy

Database Repairing with Soft Functional Dependencies

BopSkyline: Boosting privacy-preserving skyline query service in the cloud

How Large Language Models Will Disrupt Data Management

A Graph-Based Blocking Approach for Entity Matching Using Contrastively Learned Embeddings

DrTM+B: Replication-Driven Live Reconfiguration for Fast and General Distributed Transaction Processing

Process Approach and Construction of the Database for Non-Core Asset Management in Credit Organizations

Query Games in Databases

A Robust and Reversible Watermarking Algorithm for a Relational Database Based on Continuous Columns in Histogram

IDR Privacy Protection Based on Database Digital Watermarking

An experimental survey of regret minimization query and variants: bridging the best worlds between top-k query and skyline query

A reversible database watermarking method with low distortion

An Ameliorated Methodology for Ranking the Tuple

PrefDB: Supporting Preferences as First-Class Citizens in Relational Databases

Indeterministic Handling of Uncertain Decisions in Deduplication

Matching dependencies: semantics and query answering

Finding Top-k Answers in Keyword Search over Relational Databases Using Tuple Units

Heuristic algorithm for interpretation of multi-valued attributes in similarity-based fuzzy relational databases

Privacy-Preserving Tuple Matching in Distributed Databases

Path oracles for spatial networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Database Tuples Research Articles

Related Topics

Articles published on Database Tuples

Research on blind reversible database watermarking algorithm based on dual embedding strategy

Database Repairing with Soft Functional Dependencies

BopSkyline: Boosting privacy-preserving skyline query service in the cloud

How Large Language Models Will Disrupt Data Management

A Graph-Based Blocking Approach for Entity Matching Using Contrastively Learned Embeddings

DrTM+B: Replication-Driven Live Reconfiguration for Fast and General Distributed Transaction Processing

Process Approach and Construction of the Database for Non-Core Asset Management in Credit Organizations

Query Games in Databases

A Robust and Reversible Watermarking Algorithm for a Relational Database Based on Continuous Columns in Histogram

IDR Privacy Protection Based on Database Digital Watermarking

An experimental survey of regret minimization query and variants: bridging the best worlds between top-k query and skyline query

A reversible database watermarking method with low distortion

An Ameliorated Methodology for Ranking the Tuple

PrefDB: Supporting Preferences as First-Class Citizens in Relational Databases

Indeterministic Handling of Uncertain Decisions in Deduplication

Matching dependencies: semantics and query answering

Finding Top-k Answers in Keyword Search over Relational Databases Using Tuple Units

Heuristic algorithm for interpretation of multi-valued attributes in similarity-based fuzzy relational databases

Privacy-Preserving Tuple Matching in Distributed Databases

Path oracles for spatial networks