Bregman Clustering for Separable Instances

Marcel R Ackermann,Johannes Blömer

doi:10.1007/978-3-642-13731-0_21

Abstract

The Bregman k-median problem is defined as follows. Given a Bregman divergence Dφ and a finite set $P \subseteq {\mathbb R}^d$ of size n, our goal is to find a set C of size k such that the sum of errors cost(P,C)=∑p∈P min c∈C Dφ(p,c) is minimized. The Bregman k-median problem plays an important role in many applications, e.g., information theory, statistics, text classification, and speech processing. We study a generalization of the kmeans++ seeding of Arthur and Vassilvitskii (SODA '07). We prove for an almost arbitrary Bregman divergence that if the input set consists of k well separated clusters, then with probability $2^{-{\mathcal O}(k)}$ this seeding step alone finds an ${\mathcal O}(1)$-approximate solution. Thereby, we generalize an earlier result of Ostrovsky et al. (FOCS '06) from the case of the Euclidean k-means problem to the Bregman k-median problem. Additionally, this result leads to a constant factor approximation algorithm for the Bregman k-median problem using at most $2^{{\mathcal O}(k)}n$ arithmetic operations, including evaluations of Bregman divergence Dφ.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bregman Clustering for Separable Instances

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Coresets and approximate clustering for Bregman divergences
...
-
, et. al. ...
04 Jan 2009
04 Jan 2009

Nonparametric Bayesian estimation on the exponentiated inverse Weibull distribution with record values
Jung In Seo ... Yongku Kim
Journal of the Korean Data and Information Science Society | VOL. 25
Jung In Seo, et. al.Jung In Seo ... Yongku Kim
31 May 2014
Journal of the Korean Data and Information Science Society | VOL. 25

An Approach to Semantic Text Similarity Computing
Imen Akermi ... Rim Faiz
-
Imen Akermi, et. al.Imen Akermi ... Rim Faiz
01 Jan 2014
01 Jan 2014

Global error estimation for linear ordinary differential equations and their numerical optimal solutions
...
SCIENTIA SINICA Mathematica | VOL. 51
, et. al. ...
22 Dec 2020
SCIENTIA SINICA Mathematica | VOL. 51

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bregman Clustering for Separable Instances

Abstract

Talk to us

Similar Papers