Co-Clustering Under the Maximum Norm

Laurent Bulteau,Vincent Froese,Rolf Niedermeier,Sepp Hartung

doi:10.1007/978-3-319-13075-0_24

Abstract

AbstractCo-clustering, that is, partitioning a matrix into “homogeneous” submatrices, has many applications ranging from bioinformatics to election analysis. Many interesting variants of co-clustering are NP-hard. We focus on the basic variant of co-clustering where the homogeneity of a submatrix is defined in terms of minimizing the maximum distance between two entries. In this context, we spot several NP-hard as well as a number of relevant polynomial-time solvable special cases, thus charting the border of tractability for this challenging data clustering problem. For instance, we provide polynomial-time solvability when having to partition the rows and columns into two subsets each (meaning that one obtains four submatrices). When partitioning rows and columns into three subsets each, however, we encounter NP-hardness even for input matrices containing only values from \(\{ 0,1,2\}\).KeywordsInput MatrixBoolean FormulaCluster BoundaryColumn BlockGeneralize Maximum EntropyThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Full Text