IDSA: An Efficient Algorithm for Skyline Queries Computation on Dynamic and Incomplete Data With Changing States

Yonis Gulzar,Sharyar Wani,Arjumand Bano Soomo,Hamidah Ibrahim,Sherzod Turaev,Ali A Alwan,Yasir Hamid

doi:10.1109/access.2021.3072775

Abstract

Skyline queries have been widely used as an effective query tool in many contemporary database applications. The main concept of skyline queries relies on retrieving the non-dominated tuples in the database which are known skylines. In most database applications, the contents of the databases are dynamic due to the continuous changes made towards the database. Typically, the changes in the contents of the database occur through data manipulation operations (INSERT and/or UPDATE). Performing these operations on the database results in invalidating the most recent skylines before changes are made on the database. Furthermore, the presence of incomplete data in databases becomes frequent phenomena in recent database applications. Data incompleteness causes several challenges on skyline queries such as losing the transitivity property of the skyline technique and the test dominance process between tuples being cyclic . Reapplying skyline technique on the entire updated incomplete database to determine the new skylines is unwise due to the exhaustive pairwise comparisons. Thus, this paper proposes an approach, named Incomplete Dynamic Skyline Algorithm (IDSA) which attempts to determine the skylines on dynamic and incomplete databases. Two optimization techniques have been incorporated in IDSA, namely: pruning and selecting superior local skylines. The pruning process attempts to exploit the derived skylines before the INSERT/UPDATE operation made on the database to identify the new skylines. Moreover, selecting superior local skylines process assists in further eliminating the remaining non-skylines from further processing. These two optimization techniques lead to a large reduction in the number of domination tests due to avoiding re-computing of skylines over the entire updated database to derive the new skylines. Extensive experiments have been accomplished on both real and synthetic datasets, and the results demonstrate that IDSA outperforms the existing solutions in terms of the number of domination tests and the processing time of the skyline operation.

Highlights

Traditional queries operate in a very non-flexible manner as they either return data from a database that strictly satisfies the conditions given in the submitted query or return no result if otherwise
Among the most remarkable variation of skyline technique designed for a database with complete data are Divide-and-Conquer (D&C), Block Nested- Loop (BNL) [14], Bitmap and Index [15], Sort Filter Skyline (SFS) [16], Branch and Bound Skyline (BBS) [19], Linear Elimination Sort Skyline (LESS) [17], Sort and Limit Skyline algorithm (SaLSa) [18], Nearest Neighbor (NN) [20], ZSearch [21], and OSPS [22].the assumption of data completeness assures that all tuples are comparable against each other, and performing the pairwise comparisons is straightforward and results in identifying the skyline results
In this paper, a new skyline solution called Incomplete Dynamic Skyline Algorithm (IDSA) is proposed which is capable of retrieving the skylines over a dynamic and incomplete database in which the database state changed due to the insert operation performed towards the initial incomplete database

Summary

INTRODUCTION

Traditional queries operate in a very non-flexible manner as they either return data from a database that strictly satisfies the conditions given in the submitted query or return no result if otherwise. Based on the most recent information in the bar database, it can be noticed that the bar b9 which has been reported as skyline before the insert operation has been dominated by the newly inserted bar b13 based on the rating dimension This indicates that b9 is no longer a valid skyline and should be removed from the skyline result. It is unwise and impractical to directly apply the skyline technique on the entire database after changes are made to compute the new skylines This is due to the fact that not all tuples are affected by the performed insert operation. ● The problem of processing skyline queries in an incomplete and dynamic database where values of certain dimensions of tuples are missing and the contents of the database are frequently updated through data manipulation operations (insert and update) has been highlighted. The conclusion is described in the final section, Section VI

RELATED WORK

IDSA ALGORITHM

RESULTS AND DISCUSSION

EFFECT OF DATASET SIZE

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 44	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

IDSA: An Efficient Algorithm for Skyline Queries Computation on Dynamic and Incomplete Data With Changing States

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Efficient Computation of Skyline Queries Over a Dynamic and Incomplete Database
Ghazaleh Babanejad Dehaki ... Fatimah Sidi
IEEE Access | VOL. 8
Ghazaleh Babanejad Dehaki, et. al.Ghazaleh Babanejad Dehaki ... Fatimah Sidi
01 Jan 2020
IEEE Access | VOL. 8

IDENTIFYING SKYLINES IN CLOUD DATABASES WITH INCOMPLETE DATA
Yonis Gulzar ... Imad Fakhri Al Shaikhli
Journal of Information and Communication Technology | VOL. 18
Yonis Gulzar, et. al.Yonis Gulzar ... Imad Fakhri Al Shaikhli
01 Jan 2018
Journal of Information and Communication Technology | VOL. 18

A Framework for Identifying Skylines over Incomplete Data
Ali A Alwan ... Nur Izura Udzir
-
Ali A Alwan, et. al.Ali A Alwan ... Nur Izura Udzir
01 Dec 2014
01 Dec 2014

A Framework for Evaluating Skyline Queries over Incomplete Data
Yonis Gulzar ... Syed Idrees Mairaj Alvi
Procedia Computer Science | VOL. 94
Yonis Gulzar, et. al.Yonis Gulzar ... Syed Idrees Mairaj Alvi
01 Jan 2015
Procedia Computer Science | VOL. 94

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IDSA: An Efficient Algorithm for Skyline Queries Computation on Dynamic and Incomplete Data With Changing States

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access