Efficiently extendible mappings for balanced data distribution

D M Choy,L Stockmeyer,R Fagin

doi:10.1007/bf01940647

Abstract

In data storage applications, a large collection of consecutively numbered data “buckets” are often mapped to a relatively small collection of consecutively numbered storage “bins.” For example, in parallel database applications, buckets correspond to hash buckets of data and bins correspond to database nodes. In disk array applications, buckets correspond to logical tracks and bins correspond to physical disks in an array. Measures of the “goodness” of a mapping method include: One contribution of this paper is to give a new mapping method, theInterval-Round-Robin (IRR) method. The IRR method has optimal balance and relocation cost, and its time complexity and storage requirements compare favorably with known methods. Specifically, ifm is the number of times that the number of bins and/or buckets has increased, then the time complexity isO(logm) and the storage isO(m 2). Another contribution of the paper is to identify the concept of ahistory-independent mapping, meaning informally that the mapping does not “remember” the past history of expansions to the number of buckets and bins, but only the current number of buckets and bins. Thus, such mappings require very little information to be stored. Assuming that balance and relocation are optimal, we prove that history-independent mappings are possible if the number of buckets is fixed (so only the number of bins can increase), but not possible if the number of bins and buckets can both increase.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficiently extendible mappings for balanced data distribution

Abstract

Talk to us

Similar Papers

More From: Algorithmica

Lead the way for us

Journal: Algorithmica	Publication Date: Aug 1, 1996
Citations: 14

Similar Papers

An Efficient Approximation Scheme for Variable-Sized Bin Packing
Frank D Murgolo
SIAM Journal on Computing | VOL. 16
Frank D MurgoloFrank D Murgolo
01 Feb 1987
SIAM Journal on Computing | VOL. 16

On the optimal ordering of multiple-field tables
Paolo Ciaccia ... Dario Maio
Data & Knowledge Engineering | VOL. 14
Paolo Ciaccia, et. al.Paolo Ciaccia ... Dario Maio
01 Nov 1994
Data & Knowledge Engineering | VOL. 14

A Linear Time Side Match Vector Quantization Implementation
Kris Manohar ... The Duc Kieu
-
Kris Manohar, et. al.Kris Manohar ... The Duc Kieu
01 Feb 2020
01 Feb 2020

Ranged hash functions and the price of churn
...
-
, et. al. ...
20 Jan 2008
20 Jan 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficiently extendible mappings for balanced data distribution

Abstract

Talk to us

Similar Papers

More From: Algorithmica