Query Dictionary for Frequent Non-Indexed Queries in HTAP Databases

Sucharitha Shetty,Srikanth Prabhu,B Dinesh Rao

doi:10.1109/access.2022.3153350

Abstract

The increasing demand for the simultaneous transaction and review of the data for either decision making or forecasting has created a need for faster and better Hybrid Transactional/Analytical Processing (HTAP). This paper emphasizes the speedup of Online Analytical Processing (OLAP) operations in an HTAP environment where analytical queries are mainly repetitive and contain non-indexed keys as their predicates. Zone maps and materialized views are popular approaches adopted by more extensive databases to address this issue. However, they are absent in in-memory databases because of space constraints. Instead, in-memory databases load the cache with result pages of frequently accessed queries. Increasing the number of such queries can fill the cache and raise the system’s overhead. This paper presents Query_Dictionary, a hybrid storage solution that leverages the full capabilities of SQLite by retaining less information of repetitive queries in the cache and efficiently accommodating the newly updated data by the end-user. The solution proposes storing page-level metadata query information for a larger result set and row-level information for a smaller result set. It demonstrates Query_Dictionary capabilities on three types of representative queries: single table, binary join, and transactional queries on non-indexed attributes. In comparison with SQLite, the proposed method performs better.

Highlights

I N the modern computing world, Hybrid Transactional/Analytical Processing (HTAP) generally adopts in-memory techniques [1]
A binary search using the indexed key is performed on the index table to fetch the rowid
The obtained rowid is used as a key to perform a binary search on the original table

Summary

INTRODUCTION

I N the modern computing world, HTAP generally adopts in-memory techniques [1]. Storing the entire database structure in the memory is the signature characteristic of inmemory databases. A recent study of modern in-memory database systems shows that index lookup can contribute to 94% of query execution time [3] [4]. OUR CONTRIBUTIONS This paper introduces Query_Dictionary, a hybrid storage solution that accelerates repetitive non-indexed queries with minimal memory footprint It processes most requests in the cache by exploiting page prefetches of only significant pages to simplify data processing. When supplemented with a priori user knowledge about query workloads and datasets, data skipping approaches can considerably improve scan performance than traditional indexing algorithms They minimize the number of filtered records, reducing space and maintenance costs. Lazy maintenance strategies have been found to reduce maintenance costs [16]

BACKGROUND

8: Fetch Records using B-tree index search

12 Integer 550 4 0

OpenRead 0 9 0

OpenRead 0 495 0

23: Modify the VDBE code to specify page

OpenRead 0 8 0

OpenRead 0 x 0

30: Remove the unwanted page numbers from the

ENHANCEMENTS

EXPERIMENTAL EVALUATION

HARDWARE AND SOFTWARE SETUP

EXPERIMENT 1

EXPERIMENT 2

EXPERIMENT 3

DISCUSSION

Findings

VIII. CONCLUSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Query Dictionary for Frequent Non-Indexed Queries in HTAP Databases

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions

Lead the way for us

Journal: IEEE access : practical innovations, open solutions	Publication Date: Jan 1, 2022
License type: CC BY 4.0

Similar Papers

Unsupervised transactional query classification based on webpage form understanding
Yuchen Liu ... Xiaochuan Ni
-
Yuchen Liu, et. al.Yuchen Liu ... Xiaochuan Ni
24 Oct 2011
24 Oct 2011

StreamingCube: Seamless Integration of Stream Processing and OLAP Analysis
Salman Ahmed Shaikh ... Hiroyuki Kitagawa
IEEE access : practical innovations, open solutions | VOL. 8
Salman Ahmed Shaikh, et. al.Salman Ahmed Shaikh ... Hiroyuki Kitagawa
01 Jan 2020
IEEE access : practical innovations, open solutions | VOL. 8

QB4MobOLAP: A Vocabulary Extension for Mobility OLAP on the Semantic Web
Irya Wisnubhadra ... Safiza Kamal Baharin
Algorithms | VOL. 14
Irya Wisnubhadra, et. al.Irya Wisnubhadra ... Safiza Kamal Baharin
13 Sep 2021
Algorithms | VOL. 14

Adaptive use of a cluster of PCs for data warehousing applications
Amit Rudra ... Raj Gopalan
-
Amit Rudra, et. al.Amit Rudra ... Raj Gopalan
01 Mar 2000
01 Mar 2000

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Query Dictionary for Frequent Non-Indexed Queries in HTAP Databases

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions