Performance of Johnson--Lindenstrauss Transform for $k$-Means and $k$-Medians Clustering

Konstantin Makarychev,Ilya Razenshteyn,Yury Makarychev

doi:10.1137/20m1330701

Performance of Johnson--Lindenstrauss Transform for $k$-Means and $k$-Medians Clustering

Konstantin Makarychev, Ilya Razenshteyn + Show 1 more

Open Access

https://doi.org/10.1137/20m1330701

Copy DOI

Journal: SIAM Journal on Computing	Publication Date: Mar 14, 2022
Citations: 2

#Dimensional Subspace #Dimension Reduction + Show 2 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Consider an instance of Euclidean $k$-means or $k$-medians clustering. We show that the cost of the optimal solution is preserved up to a factor of $(1+\varepsilon)$ under a projection onto a random $O(\log(k / \varepsilon) / \varepsilon^2)$-dimensional subspace. Further, the cost of every clustering is preserved within $(1+\varepsilon)$. More generally, our result applies to any dimension reduction map satisfying a mild sub-Gaussian-tail condition. Our bound on the dimension is nearly optimal. Additionally, our result applies to Euclidean $k$-clustering with the distances raised to the $p$-th power for any constant $p$. For $k$-means, our result resolves an open problem posed by Cohen, Elder, Musco, Musco, and Persu (STOC 2015); for $k$-medians, it answers a question raised by Kannan.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: SIAM Journal on Computing

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.