Distributed threshold querying of general functions by a difference of monotonic representation

Guy Sagy,Izchak Sharfman,Daniel Keren,Assaf Schuster

doi:10.14778/1921071.1921072

Abstract

The goal of athreshold queryis to detect all objects whose score exceeds a given threshold. This type of query is used in many settings, such as data mining, event triggering, and top-kselection. Often, threshold queries are performed overdistributed data. Given database relations that are distributed over many nodes, an object's score is computed by aggregating the value of each attribute, applying a given scoring function over the aggregation, and thresholding the function's value. However, joining all the distributed relations to a central database might incur prohibitive overheads in bandwidth, CPU, and storage accesses. Efficient algorithms required to reduce these costs exist only for monotonic aggregation threshold queries and certain specific scoring functions.We present a novel approach for efficiently performing general distributed threshold queries. To the best of our knowledge, this is the first solution to the problem of performing such queries with general scoring functions. We first present a solution for monotonic functions, and then introduce a technique to solve for other functions by representing them as a difference of monotonic functions. Experiments with real-world data demonstrate the method's effectiveness in achieving low communication and access costs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Distributed threshold querying of general functions by a difference of monotonic representation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Journal: Proceedings of the VLDB Endowment	Publication Date: Nov 1, 2010
Citations: 55

Similar Papers

Novel Parameterized Score Functions on Interval-Valued Intuitionistic Fuzzy Sets With Three Fuzziness Measure Indexes and Their Application
Fangwei Zhang ... Guan Ke Liew
IEEE Access | VOL. 7
Fangwei Zhang, et. al.Fangwei Zhang ... Guan Ke Liew
01 Jan 2019
IEEE Access | VOL. 7

SEMI-SUPERVISED THRESHOLD QUERIES ON PHARMACOGENOMICS TIME SEQUENCES
J Assfalg ... P Kunath
-
J Assfalg, et. al.J Assfalg ... P Kunath
01 Dec 2005
01 Dec 2005

Knowledge-Based Scoring Functions in Drug Design. 1. Developing a Target-Specific Method for Kinase−Ligand Interactions
Mengzhu Xue ... Hualiang Jiang
Journal of Chemical Information and Modeling | VOL. 50
Mengzhu Xue, et. al.Mengzhu Xue ... Hualiang Jiang
03 Aug 2010
Journal of Chemical Information and Modeling | VOL. 50

MinSearch: An Efficient Algorithm for Similarity Search under Edit Distance
Haoyu Zhang ... Qin Zhang
-
Haoyu Zhang, et. al.Haoyu Zhang ... Qin Zhang
20 Aug 2020
20 Aug 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Distributed threshold querying of general functions by a difference of monotonic representation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment