Graph-based consensus clustering for class discovery from gene expression data

Zhiwen Yu,Hongqiang Wang,Hau-San Wong

doi:10.1093/bioinformatics/btm463

Abstract

Consensus clustering, also known as cluster ensemble, is one of the important techniques for microarray data analysis, and is particularly useful for class discovery from microarray data. Compared with traditional clustering algorithms, consensus clustering approaches have the ability to integrate multiple partitions from different cluster solutions to improve the robustness, stability, scalability and parallelization of the clustering algorithms. By consensus clustering, one can discover the underlying classes of the samples in gene expression data. In addition to exploring a graph-based consensus clustering (GCC) algorithm to estimate the underlying classes of the samples in microarray data, we also design a new validation index to determine the number of classes in microarray data. To our knowledge, this is the first time in which GCC is applied to class discovery for microarray data. Given a pre specified maximum number of classes (denoted as K(max) in this article), our algorithm can discover the true number of classes for the samples in microarray data according to a new cluster validation index called the Modified Rand Index. Experiments on gene expression data indicate that our new algorithm can (i) outperform most of the existing algorithms, (ii) identify the number of classes correctly in real cancer datasets, and (iii) discover the classes of samples with biological meaning. Matlab source code for the GCC algorithm is available upon request from Zhiwen Yu.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Graph-based consensus clustering for class discovery from gene expression data

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Sep 14, 2007
Citations: 186

Similar Papers

Semi-supervised consensus clustering for gene expression data analysis.
Yunli Wang ... Youlian Pan
BioData Mining | VOL. 7
Yunli Wang, et. al.Yunli Wang ... Youlian Pan
08 May 2014
BioData Mining | VOL. 7

A formal concept analysis approach to consensus clustering of multi-experiment expression data.
Anna Hristoskova ... Veselka Boeva
BMC Bioinformatics | VOL. 15
Anna Hristoskova, et. al.Anna Hristoskova ... Veselka Boeva
19 May 2014
BMC Bioinformatics | VOL. 15

SC2ATmd: a tool for integration of the figure of merit with cluster analysis for gene expression data
Amy L Olex ... Jacquelyn S Fetrow
Bioinformatics | VOL. 27
Amy L Olex, et. al.Amy L Olex ... Jacquelyn S Fetrow
03 Mar 2011
Bioinformatics | VOL. 27

A clustering ensemble framework based on selection of fuzzy weighted clusters in a locally adaptive clustering algorithm
Hamid Parvin ... Behrouz Minaei-Bidgoli
Pattern Analysis and Applications | VOL. 18
Hamid Parvin, et. al.Hamid Parvin ... Behrouz Minaei-Bidgoli
22 Jan 2014
Pattern Analysis and Applications | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Graph-based consensus clustering for class discovery from gene expression data

Abstract

Talk to us

Similar Papers

More From: Bioinformatics