Optimal algorithms for approximate clustering

Tomás Feder,Daniel Greene

doi:10.1145/62212.62255

Abstract

In a clustering problem, the aim is to partition a given set of n points in d-dimensional space into k groups, called clusters, so that points within each cluster are near each other. Two objective functions frequently used to measure the performance of a clustering algorithm are, for any L4 metric, (a) the maximum distance between pairs of points in the same cluster, and (b) the maximum distance between points in each cluster and a chosen cluster center; we refer to either measure as the cluster size.We show that one cannot approximate the optimal cluster size for a fixed number of clusters within a factor close to 2 in polynomial time, for two or more dimensions, unless P=NP. We also present an algorithm that achieves this factor of 2 in time O(n log k), and show that this running time is optimal in the algebraic decision tree model. For a fixed cluster size, on the other hand, we give a polynomial time approximation scheme that estimates the optimal number of clusters under the second measure of cluster size within factors arbitrarily close to 1. Our approach is extended to provide approximation algorithms for the restricted centers, suppliers, and weighted suppliers problems that run in optimal O(n log k) time and achieve optimal or nearly optimal approximation bounds.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimal algorithms for approximate clustering

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Cluster size and temperature measurement in a pure vapor source expansion
J G Pruett ... H Windischmann
Journal of Applied Physics | VOL. 64
J G Pruett, et. al.J G Pruett ... H Windischmann
01 Sep 1988
Journal of Applied Physics | VOL. 64

Quantitative Thin-Section CT Analysis of the Enlargement and Coalescence of Low-Attenuation Clusters in Patients with Emphysema
Shin Matsuoka ... Kunihiro Yagihashi
Respiration | VOL. 74
Shin Matsuoka, et. al.Shin Matsuoka ... Kunihiro Yagihashi
29 Sep 2006
Respiration | VOL. 74

SEQUENTIAL AND PARALLEL ALGORITHMS FOR THE k CLOSEST PAIRS PROBLEM
Hans-Peter Lenhof ... Michiel Smid
International Journal of Computational Geometry & Applications | VOL. 05
Hans-Peter Lenhof, et. al.Hans-Peter Lenhof ... Michiel Smid
01 Sep 1995
International Journal of Computational Geometry & Applications | VOL. 05

Rescue of a Dystrophin-like Protein by Exon Skipping In Vivo Restores GABAA-receptor Clustering in the Hippocampus of the mdx Mouse
Cyrille Vaillend ... Elise Peltekian
Molecular Therapy | VOL. 18
Cyrille Vaillend, et. al.Cyrille Vaillend ... Elise Peltekian
01 Sep 2010
Molecular Therapy | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimal algorithms for approximate clustering

Abstract

Talk to us

Similar Papers