Information-based clustering.

Noam Slonim,Gurinder Singh Atwal,William Bialek,Gašper Tkačik

doi:10.1073/pnas.0507432102

Noam Slonim, Gurinder Singh Atwal + Show 2 more

Open Access

https://doi.org/10.1073/pnas.0507432102

Copy DOI

Abstract

In an age of increasingly large data sets, investigators in many different disciplines have turned to clustering as a tool for data analysis and exploration. Existing clustering methods, however, typically depend on several nontrivial assumptions about the structure of data. Here, we reformulate the clustering problem from an information theoretic perspective that avoids many of these assumptions. In particular, our formulation obviates the need for defining a cluster "prototype," does not require an a priori similarity metric, is invariant to changes in the representation of the data, and naturally captures nonlinear relations. We apply this approach to different domains and find that it consistently produces clusters that are more coherent than those extracted by existing algorithms. Finally, our approach provides a way of clustering based on collective notions of similarity rather than the traditional pairwise measures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Information-based clustering.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences

Lead the way for us

Journal: Proceedings of the National Academy of Sciences	Publication Date: Dec 13, 2005
Citations: 225

Similar Papers

A Novel Information-Theoretic Approach for Variable Clustering and Predictive Modeling Using Dirichlet Process Mixtures
Yun Chen ... Hui Yang
Scientific Reports | VOL. 6
Yun Chen, et. al.Yun Chen ... Hui Yang
01 Dec 2016
Scientific Reports | VOL. 6

Uncovering hierarchical structure in data using the growing hierarchical self-organizing map
Michael Dittenbach ... Dieter Merkl
Neurocomputing | VOL. 48
Michael Dittenbach, et. al.Michael Dittenbach ... Dieter Merkl
29 Aug 2002
Neurocomputing | VOL. 48

Visualization and Exploration of Complex Scientific Data
-
Turkish Online Journal of Qualitative Inquiry | VOL. -
--
01 Jan 2023
Turkish Online Journal of Qualitative Inquiry | VOL. -

SAGES consensus recommendations on surgical video data use, structure, and exploration (for research in artificial intelligence, clinical quality improvement, and surgical education)
Jennifer A Eckhoff ... Nicolas Padoy
Surgical Endoscopy | VOL. 37
Jennifer A Eckhoff, et. al.Jennifer A Eckhoff ... Nicolas Padoy
29 Jul 2023
Surgical Endoscopy | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Information-based clustering.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences