Modeling and Computing Probabilistic Skyline on Incomplete Data

Kaiqi Zhang,Zhipeng Cai,Hong Gao,Xixian Han,Jianzhong Li

doi:10.1109/tkde.2019.2904967

Kaiqi Zhang, Zhipeng Cai + Show 3 more

Open Access

https://doi.org/10.1109/tkde.2019.2904967

Copy DOI

Abstract

The skyline query is important in the database community. In recent years, the researches on incomplete data have been increasingly considered, especially for the skyline query. However, the existing skyline definition on incomplete data cannot provide users with valuable references. In this paper, we propose a novel skyline definition utilizing probabilistic model on incomplete data where each point has a probability to be in the skyline. In particular, it returns K points with the highest skyline probabilities. In addition, we propose incomplete models and estimate probability density functions of missing values on independent, correlated, and anti-correlated distributions, respectively. Meanwhile, it is a big challenge to compute probabilistic skyline on incomplete data. We propose three efficient algorithms SPISkyline, SPCSkyline, and SPASkyline for probabilistic skyline computation on incomplete data complying with independent, correlated, and anti-correlated distributions, respectively. They employ pruning strategy, optimization of the process of probability computation, and sorting technique to improve the efficiency of probabilistic skyline computation on incomplete data. Our experimental results demonstrate that our proposed concept of probabilistic skyline is an effective method to tackle skyline query on incomplete data and our algorithms are tens of times faster than the naive algorithm on both synthetic and real datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Jul 1, 2020
Citations: 51	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Modeling and Computing Probabilistic Skyline on Incomplete Data

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Similar Papers

Probabilistic Skyline on Incomplete Data
Kaiqi Zhang ... Hong Gao
-
Kaiqi Zhang, et. al.Kaiqi Zhang ... Hong Gao
06 Nov 2017
06 Nov 2017

Efficient computation for probabilistic skyline over uncertain preferences
Arun K Pujari ... Vineet Padmanabhan
Information Sciences | VOL. 324
Arun K Pujari, et. al.Arun K Pujari ... Vineet Padmanabhan
27 Jun 2015
Information Sciences | VOL. 324

IDENTIFYING SKYLINES IN CLOUD DATABASES WITH INCOMPLETE DATA
Yonis Gulzar ... Imad Fakhri Al Shaikhli
Journal of Information and Communication Technology | VOL. 18
Yonis Gulzar, et. al.Yonis Gulzar ... Imad Fakhri Al Shaikhli
01 Jan 2018
Journal of Information and Communication Technology | VOL. 18

Answering skyline queries on probabilistic data using the dominance of probabilistic skyline tuples
Trieu Minh Nhut Le ... Zhen He
Information Sciences | VOL. 340-341
Trieu Minh Nhut Le, et. al.Trieu Minh Nhut Le ... Zhen He
11 Jan 2016
Information Sciences | VOL. 340-341

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modeling and Computing Probabilistic Skyline on Incomplete Data

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering