Co-Clustering via Information-Theoretic Markov Aggregation

Clemens Blochl,Bernhard C Geiger,Rana Ali Amjad

doi:10.1109/tkde.2018.2846252

Abstract

We present an information-theoretic cost function for co-clustering, i.e., for simultaneous clustering of two sets based on similarities between their elements. By constructing a simple random walk on the corresponding bipartite graph, our cost function is derived from a recently proposed generalized framework for information-theoretic Markov chain aggregation. The goal of our cost function is to minimize relevant information loss, hence it connects to the information bottleneck formalism. Moreover, via the connection to Markov aggregation, our cost function is not ad hoc, but inherits its justification from the operational qualities associated with the corresponding Markov aggregation problem. We furthermore show that, for appropriate parameter settings, our cost function is identical to well-known approaches from the literature, such as Information-Theoretic Co-Clustering of Dhillon et al. Hence, understanding the influence of this parameter admits a deeper understanding of the relationship between previously proposed information-theoretic cost functions. We highlight some strengths and weaknesses of the cost function for different parameters. We also illustrate the performance of our cost function, optimized with a simple sequential heuristic, on several synthetic and real-world data sets, including the Newsgroup20 and the MovieLens100k data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Co-Clustering via Information-Theoretic Markov Aggregation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Apr 1, 2019
Citations: 33

Similar Papers

An extension of statistical decision theory with information theoretic cost functions to decision fusion: Part II
Michael B Hurley
Information Fusion | VOL. 6
Michael B HurleyMichael B Hurley
01 Jun 2005
Information Fusion | VOL. 6

Information-Theoretic Interactive Sensing and Inference for Autonomous Systems
Christopher Robbiano ... Mahmood R Azimi-Sadjadi
IEEE Transactions on Signal Processing | VOL. 69
Christopher Robbiano, et. al.Christopher Robbiano ... Mahmood R Azimi-Sadjadi
01 Jan 2020
IEEE Transactions on Signal Processing | VOL. 69

Kernel width adaptation in information theoretic cost functions
Abhishek Singh ... Jose C Principe
-
Abhishek Singh, et. al.Abhishek Singh ... Jose C Principe
01 Mar 2010
01 Mar 2010

Instance reduction for supervised learning using input-output clustering method
Anusorn Yodjaiphet ... Nipon Theera-Umpon
Journal of Central South University | VOL. 22
Anusorn Yodjaiphet, et. al.Anusorn Yodjaiphet ... Nipon Theera-Umpon
01 Dec 2015
Journal of Central South University | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Co-Clustering via Information-Theoretic Markov Aggregation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering