Submodularity of Distributed Join Computation.

Rundong Li,Xinyan Deng,Mirek Riedewald

doi:10.1145/3183713.3183728

Abstract

We study distributed equi-join computation in the presence of join-attribute skew, which causes load imbalance. Skew can be addressed by more fine-grained partitioning, at the cost of input duplication. For random load assignment, e.g., using a hash function, fine-grained partitioning creates a tradeoff between load expectation and variance. We show that minimizing load variance subject to a constraint on expectation is a monotone submodular maximization problem with Knapsack constraints, hence admitting provably near-optimal greedy solutions. In contrast to previous work on formal optimality guarantees, we can prove this result also for self-joins and more general load functions defined as weighted sum of input and output. We further demonstrate through experiments that this theoretical result leads to an effective algorithm for the problem of minimizing running time, even when load is assigned deterministically.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings. ACM-SIGMOD International Conference on Management of Data	Publication Date: May 27, 2018
Citations: 9	License type: mit

R Discovery Prime

R Discovery Prime

Submodularity of Distributed Join Computation.

Abstract

Talk to us

Similar Papers

More From: Proceedings. ACM-SIGMOD International Conference on Management of Data

Lead the way for us

Similar Papers

Fast algorithms for maximizing submodular functions
...
-
, et. al. ...
05 Jan 2014
05 Jan 2014

Fast algorithms for maximizing submodular functions
Ashwinkumar Badanidiyuru ... Jan Vondrák
-
Ashwinkumar Badanidiyuru, et. al.Ashwinkumar Badanidiyuru ... Jan Vondrák
18 Dec 2013
18 Dec 2013

Non-monotone submodular maximization under matroid and knapsack constraints
Jon Lee ... Maxim Sviridenko
-
Jon Lee, et. al.Jon Lee ... Maxim Sviridenko
31 May 2009
31 May 2009

Maximizing a Monotone Submodular Function with a Bounded Curvature under a Knapsack Constraint
Yuichi Yoshida
SIAM Journal on Discrete Mathematics | VOL. 33
Yuichi YoshidaYuichi Yoshida
01 Jan 2019
SIAM Journal on Discrete Mathematics | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Submodularity of Distributed Join Computation.

Abstract

Talk to us

Similar Papers

More From: Proceedings. ACM-SIGMOD International Conference on Management of Data