A Theoretical Analysis of Noisy Sparse Subspace Clustering on Dimensionality-Reduced Data

Yu-Xiang Wang,Yining Wang,Aarti Singh

doi:10.1109/tit.2018.2879912

Yu-Xiang Wang, Yining Wang + Show 1 more

Open Access

https://doi.org/10.1109/tit.2018.2879912

Copy DOI

Journal: IEEE Transactions on Information Theory	Publication Date: Feb 1, 2019
Citations: 70	License type: publisher-specific, author manuscript

Affiliation: Carnegie Mellon University

Abstract

Subspace clustering is the problem of partitioning unlabeled data points into a number of clusters so that data points within one cluster lie approximately on a low-dimensional linear subspace. In many practical scenarios, the dimensionality of data points to be clustered is compressed due to the constraints of measurement, computation, or privacy. In this paper, we study the theoretical properties of a popular subspace clustering algorithm named sparse subspace clustering (SSC) and establish formal success conditions of SSC on dimensionality-reduced data. Our analysis applies to the most general fully deterministic model, where both underlying subspaces and data points within each subspace are deterministically positioned, and also a wide range of dimensionality reduction techniques (e.g., Gaussian random projection, uniform subsampling, and sketching) that fall into a subspace embedding framework. Finally, we apply our analysis to a differentially private SSC algorithm and established both privacy and utility guarantees of the proposed method.

Full Text