Sketch-and-solve approaches to k-means clustering by semidefinite programming

Charles Clum,Dustin G Mixon,Kaiying O’Hare,Soledad Villar

doi:10.1093/imaiai/iaae016

Sketch-and-solve approaches to k-means clustering by semidefinite programming

Charles Clum, Dustin G Mixon + Show 2 more

https://doi.org/10.1093/imaiai/iaae016

Copy DOI

Journal: Information and Inference

Publication Date: Jul 1, 2024

#Means Clustering #Semidefinite Programming + Show 6 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Abstract We study a sketch-and-solve approach to speed up the Peng–Wei semidefinite relaxation of $k$-means clustering. When the data are appropriately separated we identify the $k$-means optimal clustering. Otherwise, our approach provides a high-confidence lower bound on the optimal $k$-means value. This lower bound is data-driven; it does not make any assumption on the data nor how they are generated. We provide code and an extensive set of numerical experiments where we use this approach to certify approximate optimality of clustering solutions obtained by k-means++.

Full Text