HI-Sky: Hash Index-Based Skyline Query Processing

Jong-Hyeok Choi,Aziz Nasridinov,Fei Hao

doi:10.3390/app10051708

Jong-Hyeok Choi, Aziz Nasridinov + Show 1 more

Open Access

https://doi.org/10.3390/app10051708

Copy DOI

Abstract

The skyline query has recently attracted a considerable amount of research interest in several fields. The query conducts computations using the domination test, where “domination” means that a data point does not have a worse value than others in any dimension, and has a better value in at least one dimension. Therefore, the skyline query can be used to construct efficient queries based on data from a variety of fields. However, when the number of dimensions or the amount of data increases, naïve skyline queries lead to a degradation in overall performance owing to the higher cost of comparisons among data. Several methods using index structures have been proposed to solve this problem but have not improved the performance of skyline queries because their indices are heavily influenced by the dimensionality and data amount. Therefore, in this study, we propose HI-Sky, a method that can perform quick skyline computations by using the hash index to overcome the above shortcomings. HI-Sky effectively manages data through the hash index and significantly improves performance by effectively eliminating unnecessary data comparisons when computing the skyline. We provide the theoretical background for HI-Sky and verify its improvement in skyline query performance through comparisons with prevalent methods.

Highlights

The skyline query [1] returns data points that are not dominated by other data points in a given database
Data point B may not dominate other data points C and D. As these data do not satisfy the conditions of Lemma 1, i.e., neither point is in a partition with smaller corresponding dimensions than the other point given the order of the partition, Lemma 1 shows that hash index-based skyline (HI-Sky) can prune the data space using GLAD and can remove a large amount of data early, which significantly reduces the number of dominance tests
We proposed a hash index structure and a special hash key for a skyline query

Summary

Introduction

The skyline query [1] returns data points that are not dominated by other data points in a given database. Research interest has tended toward index structure-based methods due to their efficiency in handling large amounts of data. BBS and Z-SKY are not suitable for skyline computation when the data the data frequently changes, as these methods require a large number of resources to maintain the frequently changes, as these methods require a large number of resources to maintain the indices indices [14,15]. We propose a hash index-based skyline (HI-Sky), which is a skyline query method. The results our experiment demonstrate that HI-Sky generate indices indices faster otherand methods perform query processing at higher faster than otherthan methods performand faster skylinefaster queryskyline processing at higher dimensions by dimensions by effectively reducing the number of dominance tests.

Related Study

Traditional Skyline Computation

Index Based Skyline Computation

Parallel and Distributed Skyline Computation

Hash Index-Based Skyline Query Processing

Hash Index for Skyline

Background

Given d-dimensional space Sorder

Data Space Pruning Step

Skyline Computation Step

Skyline Computation using HI-Sky

Performance Evaluation

Experimental Environment

Comparison of Changes in np

Comparison of Indexing Time

Indexing

Comparison of Skyline Computation Time

Comparison

Comparisons Using Real-World Dataset

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Mar 2, 2020
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

HI-Sky: Hash Index-Based Skyline Query Processing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Application of processing technology based on skyline query in computer network
Yifu Zeng ... Zhibang Yang
Neural Computing and Applications | VOL. 34
Yifu Zeng, et. al.Yifu Zeng ... Zhibang Yang
01 Apr 2021
Neural Computing and Applications | VOL. 34

Efficient Processing of Metric Skyline Queries
Lei Chen ... Xiang Lian
IEEE Transactions on Knowledge and Data Engineering | VOL. 21
Lei Chen, et. al. Lei Chen ... Xiang Lian
01 Mar 2009
IEEE Transactions on Knowledge and Data Engineering | VOL. 21

An efficient parallel processing method for skyline queries in MapReduce
Junsu Kim ... Myoung Ho Kim
The Journal of Supercomputing | VOL. 74
Junsu Kim, et. al.Junsu Kim ... Myoung Ho Kim
31 Oct 2017
The Journal of Supercomputing | VOL. 74

Efficient k-dominant Skyline Query Based on Dominate Hierarchical Tree in MapReduce
Linlin Ding ... Shu Wang
-
Linlin Ding, et. al.Linlin Ding ... Shu Wang
01 Jul 2018
01 Jul 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HI-Sky: Hash Index-Based Skyline Query Processing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences