Rough Based Symmetrical Clustering for Gene Expression Profile Analysis.

Anasua Sarkar,Ujjwal Maulik

doi:10.1109/tnb.2015.2421323

Abstract

Identification of coexpressed genes is the central goal in microarray gene expression data analysis. Point symmetry-based clustering is an important unsupervised learning technique for recognizing symmetrical convex or non-convex shaped clusters. To enable fast automatic clustering of large microarray data, in this article, a distributed time-efficient scalable parallel rough set based hybrid approach for point symmetry-based clustering algorithm has been proposed. A natural basis for analyzing gene expression data using the symmetry-based algorithm, is to group together genes with similar symmetrical patterns of expression. Rough-set theory helps in faster convergence and initial automatic optimal classification, thereby solving the problem of unknown knowledge of number of clusters in microarray data. This new parallel implementation with K-means algorithm also satisfies the linear speedup in timing on large microarray datasets. This proposed algorithm is compared with another parallel symmetry-based K-means and parallel version of existing K-means over four artificial and benchmark microarray datasets. We also have experimented over three skewed cancer gene expression datasets. The statistical analysis are also performed to establish the significance of this new implementation. The biological relevance of the clustering solutions are also analyzed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rough Based Symmetrical Clustering for Gene Expression Profile Analysis.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on nanobioscience

Lead the way for us

Journal: IEEE transactions on nanobioscience	Publication Date: Apr 29, 2015
Citations: 22

Similar Papers

Gene microarray data analysis using parallel point-symmetry-based clustering.
Anasua Sarkar ... Ujjwal Maulik
International journal of data mining and bioinformatics | VOL. 11
Anasua Sarkar, et. al.Anasua Sarkar ... Ujjwal Maulik
25 Aug 2010
International journal of data mining and bioinformatics | VOL. 11

Parallel Point Symmetry Based Clustering for Gene Microarray Data
Anasua Sarkar ... Ujjwal Maulik
-
Anasua Sarkar, et. al.Anasua Sarkar ... Ujjwal Maulik
01 Feb 2009
01 Feb 2009

GRID distribution supports clustering validation of large mixed microarray data sets
Angelica Tulipano ... Leonardo Angelini
EMBnet.journal | VOL. 17
Angelica Tulipano, et. al.Angelica Tulipano ... Leonardo Angelini
12 May 2011
EMBnet.journal | VOL. 17

Combining Hadoop and GPU to preprocess large Affymetrix microarray data
Sufeng Niu ... Pradip Srimani
-
Sufeng Niu, et. al.Sufeng Niu ... Pradip Srimani
01 Oct 2014
01 Oct 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rough Based Symmetrical Clustering for Gene Expression Profile Analysis.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on nanobioscience