Bipartite isoperimetric graph partitioning for data co-clustering

Manjeet Rege,Farshad Fotouhi,Ming Dong

doi:10.1007/s10618-008-0091-4

Abstract

Data co-clustering refers to the problem of simultaneous clustering of two data types. Typically, the data is stored in a contingency or co-occurrence matrix C where rows and columns of the matrix represent the data types to be co-clustered. An entry C ij of the matrix signifies the relation between the data type represented by row i and column j. Co-clustering is the problem of deriving sub-matrices from the larger data matrix by simultaneously clustering rows and columns of the data matrix. In this paper, we present a novel graph theoretic approach to data co-clustering. The two data types are modeled as the two sets of vertices of a weighted bipartite graph. We then propose Isoperimetric Co-clustering Algorithm (ICA)--a new method for partitioning the bipartite graph. ICA requires a simple solution to a sparse system of linear equations instead of the eigenvalue or SVD problem in the popular spectral co-clustering approach. Our theoretical analysis and extensive experiments performed on publicly available datasets demonstrate the advantages of ICA over other approaches in terms of the quality, efficiency and stability in partitioning the bipartite graph.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bipartite isoperimetric graph partitioning for data co-clustering

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery

Lead the way for us

Journal: Data Mining and Knowledge Discovery	Publication Date: Feb 9, 2008
Citations: 60

Similar Papers

Co-clustering Documents and Words Using Bipartite Isoperimetric Graph Partitioning
Manjeet Rege ... Farshad Fotouhi
-
Manjeet Rege, et. al.Manjeet Rege ... Farshad Fotouhi
01 Dec 2006
01 Dec 2006

Co-clustering documents and words using bipartite spectral graph partitioning
Inderjit S Dhillon
-
Inderjit S DhillonInderjit S Dhillon
26 Aug 2001
26 Aug 2001

A Bipartite Graph Partition-Based Coclustering Approach With Graph Nonnegative Matrix Factorization for Large Hyperspectral Images
Nan Huang ... Yang Xu
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60
Nan Huang, et. al.Nan Huang ... Yang Xu
01 Jan 2021
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60

Fast Flexible Bipartite Graph Model for Co-Clustering
Wei Chen ... Zhiguo Long
IEEE Transactions on Knowledge and Data Engineering | VOL. -
Wei Chen, et. al.Wei Chen ... Zhiguo Long
01 Jan 2021
IEEE Transactions on Knowledge and Data Engineering | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bipartite isoperimetric graph partitioning for data co-clustering

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery