Comparison of Clustering Approaches for Gene Expression Data

Anton Borg ,Niklas Lavesson ,Veselka Boeva

doi:10.3233/978-1-61499-330-8-55

Abstract

Clustering algorithms have been used to divide genes into groups ac- cording to the degree of their expression similarity. Such a grouping may suggest that the respective genes are correlated and/or co-regulated, and subsequently in- dicates that the genes could possibly share a common biological role. In this pa- per, four clustering algorithms are investigated: k-means, cut-clustering, spectral and expectation-maximization. The algorithms are benchmarked against each other. The performance of the four clustering algorithms is studied on time series expres- sion data using Dynamic Time Warping distance in order to measure similarity be- tween gene expression profiles. Four different cluster validation measures are used to evaluate the clustering algorithms: Connectivity and Silhouette Index for esti- mating the quality of clusters, Jaccard Inde xf or evaluating the stability of ac luster method and Rand Index for assessing the accuracy. The obtained results are ana- lyzed by Friedman's test and the Nemenyi post-hoc test. K-means is demonstrated to be significantly better than the spectral clustering algorithm under the Silhouette and Rand validation indices. Keywords. gene expression data, graph-based clustering algorithm, minimum cut clustering, partitioning algorithm, dynamic time warping

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of Clustering Approaches for Gene Expression Data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Co-Evolutionary Multi-Objective approach for a K-adaptive graph-based clustering algorithm
Hector D Menendez ... David Camacho
-
Hector D Menendez, et. al.Hector D Menendez ... David Camacho
01 Jul 2014
01 Jul 2014

Hierarchical clustering of time series data with parametric derivative dynamic time warping
Maciej Łuczak
Expert Systems with Applications | VOL. 62
Maciej ŁuczakMaciej Łuczak
09 Jun 2016
Expert Systems with Applications | VOL. 62

Effects of some design factors on the distribution of similarity indices in cluster analysis
Ahmed N Albatineh ... Golam B M Kibria
Communications in Statistics - Simulation and Computation | VOL. 46
Ahmed N Albatineh, et. al.Ahmed N Albatineh ... Golam B M Kibria
23 Oct 2015
Communications in Statistics - Simulation and Computation | VOL. 46

SUMA: a lightweight machine learning model-powered shared nearest neighbour-based clustering application interface for scRNA-Seq data.
Necla Koçhan ... Baris Emre Dayanc
Turkish Journal of Biology | VOL. 47
Necla Koçhan, et. al.Necla Koçhan ... Baris Emre Dayanc
28 Dec 2023
Turkish Journal of Biology | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of Clustering Approaches for Gene Expression Data

Abstract

Talk to us

Similar Papers