Computing Prominent Skyline on Massive Data

Xiaolong Wan,Xixian Han,Jinbao Wang

doi:10.1007/s41019-024-00259-6

Abstract

AbstractIn many practical applications, skyline query is an important operation to return the pareto optimal tuples, which provides a candidate set for the optimum. On massive data, skyline often reports too many results, the users will be overwhelmed and be difficult to find the desired information easily. This paper devises P-skyline to reduce the size of the returned results. Given the approximation factor, P-skyline only generates the prominent skyline results by the definition of p-dominance. To the best of our knowledge, this paper is the first work to study P-skyline problem. This paper first proposes a baseline algorithm, which requires one full table scan to compute the results. It is found that baseline algorithm incurs a relatively high execution cost on massive data. Then, PSTP algorithm is proposed, which consists of two stages: candidate acquisition and refinement. On the presorted table, PSTP utilizes selective retrieval and selective checking to process P-skyline with much lower I/O cost and computation cost. The extensive experimental results, conducted on synthetic and real-life data sets, show that PSTP can compute P-skyline on massive data efficiently.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Computing Prominent Skyline on Massive Data

Abstract

Talk to us

Similar Papers

More From: Data Science and Engineering

Lead the way for us

Journal: Data Science and Engineering	Publication Date: Dec 9, 2024
License type: CC BY 4.0

Similar Papers

Skyline Preference Query Based on Massive and Incomplete Dataset
Yan Wang ... Baoyan Song
IEEE Access | VOL. 5
Yan Wang, et. al.Yan Wang ... Baoyan Song
01 Jan 2017
IEEE Access | VOL. 5

Efficient Skyline Computation on Massive Incomplete Data
Jingxuan He ... Xixian Han
Data Science and Engineering | VOL. 7
Jingxuan He, et. al.Jingxuan He ... Xixian Han
03 Apr 2022
Data Science and Engineering | VOL. 7

Efficient computation of G-Skyline groups on massive data
Xixian Han ... Hong Gao
Information Sciences | VOL. 587
Xixian Han, et. al.Xixian Han ... Hong Gao
18 Dec 2021
Information Sciences | VOL. 587

Efficient Discovery of Functional Dependencies on Massive Data
Xiaolong Wan ... Jianzhong Li
IEEE Transactions on Knowledge and Data Engineering | VOL. 36
Xiaolong Wan, et. al.Xiaolong Wan ... Jianzhong Li
01 Jan 2024
IEEE Transactions on Knowledge and Data Engineering | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Computing Prominent Skyline on Massive Data

Abstract

Talk to us

Similar Papers

More From: Data Science and Engineering