On a two-truths phenomenon in spectral graph clustering

Carey E Priebe,Youngser Park,Joshua T Vogelstein,John M Conroy,Vince Lyzinski,Minh Tang,Avanti Athreya,Joshua Cape,Eric Bridgeford

doi:10.1073/pnas.1814462116

Abstract

Clustering is concerned with coherently grouping observations without any explicit concept of true groupings. Spectral graph clustering-clustering the vertices of a graph based on their spectral embedding-is commonly approached via K-means (or, more generally, Gaussian mixture model) clustering composed with either Laplacian spectral embedding (LSE) or adjacency spectral embedding (ASE). Recent theoretical results provide deeper understanding of the problem and solutions and lead us to a "two-truths" LSE vs. ASE spectral graph clustering phenomenon convincingly illustrated here via a diffusion MRI connectome dataset: The different embedding methods yield different clustering results, with LSE capturing left hemisphere/right hemisphere affinity structure and ASE capturing gray matter/white matter core-periphery structure.

Highlights

Clustering is concerned with coherently grouping observations without any explicit concept of true groupings
Spectral graph clustering—clustering the vertices of a graph based on their spectral embedding—is commonly approached via K-means clustering composed with either Laplacian spectral embedding (LSE) or adjacency spectral embedding (ASE)
Our interest is to compare and contrast the two spectral embedding methods for clustering into two clusters. We demonstrate that this synthetic case exhibits the two-truths phenomenon both theoretically and in simulation—the {LG,LW,RG,RW} a priori projection of our composite connectome yields a four-block two-truths stochastic block model (SBM)

Summary

Introduction

Clustering is concerned with coherently grouping observations without any explicit concept of true groupings. It is often the case that practitioners cluster the vertices of a graph—say, via K -means clustering composed with Laplacian spectral embedding—and pronounce the method as having performed either well or poorly based on whether the resulting clusters correspond well or poorly with some known or preconceived notion of “correct” clustering. Such a procedure may be used to compare two clustering methods and to pronounce that one works better (on the particular data under consideration). With respect to graph clustering, ref. 1 shows that there can be no algorithm that is optimal for all possible community detection tasks (Fig. 1)

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the National Academy of Sciences	Publication Date: Mar 8, 2019
Citations: 61	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

On a two-truths phenomenon in spectral graph clustering

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences

Lead the way for us

Similar Papers

Gaussian Mixture Model Clustering with Incomplete Data
Yi Zhang ... Miaomiao Li
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 17
Yi Zhang, et. al.Yi Zhang ... Miaomiao Li
31 Jan 2021
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 17

Structured graph optimization for joint spectral embedding and clustering
Xiaojun Yang ... Liang Lin
Neurocomputing | VOL. 503
Xiaojun Yang, et. al.Xiaojun Yang ... Liang Lin
30 Jun 2022
Neurocomputing | VOL. 503

GMM clustering for heating load patterns in-depth identification and prediction model accuracy improvement of district heating system
Yakai Lu ... Hejia Zhang
Energy and Buildings | VOL. 190
Yakai Lu, et. al.Yakai Lu ... Hejia Zhang
18 Feb 2019
Energy and Buildings | VOL. 190

GMM clustering for in-depth food accessibility pattern exploration and prediction model of food demand behavior
Rahul Srinivas Sucharitha ... Seokcheon Lee
Socio-Economic Planning Sciences | VOL. 83
Rahul Srinivas Sucharitha, et. al.Rahul Srinivas Sucharitha ... Seokcheon Lee
16 Jun 2022
Socio-Economic Planning Sciences | VOL. 83

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On a two-truths phenomenon in spectral graph clustering

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences