A Gaussian limit process for optimal FIND algorithms

Henning Sulzbach,Michael Drmota,Ralph Neininger

doi:10.1214/ejp.v19-2933

Abstract

We consider versions of the FIND algorithm where the pivot element used is the median of a subset chosen uniformly at random from the data. For the median selection we assume that subsamples of size asymptotic to $c \cdot n^\alpha$ are chosen, where $0 < \alpha \leq \frac{1}{2}$, $c > 0$ and $n$ is the size of the data set to be split. We consider the complexity of FIND as a process in the rank to be selected and measured by the number of key comparisons required. After normalization we show weak convergence of the complexity to a centered Gaussian process as $n \to \infty$, which depends on $\alpha$. The proof relies on a contraction argument for probability distributions on càdlàg functions. We also identify the covariance function of the Gaussian limit process and discuss path and tail properties.

Highlights

After normalization we show weak convergence of the complexity to a centered Gaussian process as n → ∞, which depends only on α
The FIND algorithm is a selection algorithm, called Quickselect, to find an element of given rank in a set S of data, where the data set S is a subset of finite cardinality |S| of some ordered set
By induction we find that (Zn)n≥0 is a sequence of centered Gaussian processes

Summary

Introduction

The FIND algorithm is a selection algorithm, called Quickselect, to find an element of given rank in a set S of data, where the data set S is a subset of finite cardinality |S| of some ordered set. Martínez and Roura [35] give an average case analysis, where optimal choices for the tradeoff between better balanced sublists versus additional cost for the median selection are discussed Note that another idea to adapt the FIND algorithm is to not choose the median of a subsample but to choose an element that may depend on the rank searched for such that the sublist where the algorithm is recursively called may be small. A similar version of the Quicksort algorithm consists in choosing the pivot element in each step as a median of a random sub-sample of size k = k(n) ∼ cnα with n the size of the list to be split We conjecture that such a Quicksort algorithm admits a Gaussian limiting distribution for the normalized number of key comparisons. The second, Lemma 4.3, is needed in the study of the path variation of the limit process Z

Construction

Characterization of the limit process

Analysis of the Quickselect process

Preliminaries

Further properties of the limit process

The supremum of the limit process

Variation of paths

Binary topology and path continuity

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Journal of Probability	Publication Date: Jan 1, 2014
Citations: 38	License type: cc-by

R Discovery Prime

R Discovery Prime

A Gaussian limit process for optimal FIND algorithms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Probability

Lead the way for us

Similar Papers

Ruin probability for Gaussian integrated processes
Krzysztof Dȩbicki
Stochastic Processes and their Applications | VOL. 98
Krzysztof DȩbickiKrzysztof Dȩbicki
27 Nov 2001
Stochastic Processes and their Applications | VOL. 98

Finite Sample Approximations of Exact and Entropic Wasserstein Distances Between Covariance Operators and Gaussian Processes
Hà Quang Minh
SIAM/ASA Journal on Uncertainty Quantification | VOL. 10
Hà Quang MinhHà Quang Minh
10 Jan 2022
SIAM/ASA Journal on Uncertainty Quantification | VOL. 10

Maximum and High Level Excursion of a Gaussian Process with Stationary Increments
Simeon M Berman
The Annals of Mathematical Statistics | VOL. 43
Simeon M BermanSimeon M Berman
01 Aug 1972
The Annals of Mathematical Statistics | VOL. 43

Large Deviations for Quadratic Functionals of Gaussian Processes
Włodzimierz Bryc ... Amir Dembo
Journal of Theoretical Probability | VOL. 10
Włodzimierz Bryc, et. al.Włodzimierz Bryc ... Amir Dembo
01 Jan 1997
Journal of Theoretical Probability | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Gaussian limit process for optimal FIND algorithms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Probability