Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering

Dikla Dotan-Cohen,Simon Kasif,Avraham A Melkman

doi:10.1093/bioinformatics/btp327

Dikla Dotan-Cohen, Simon Kasif + Show 1 more

Open Access

https://doi.org/10.1093/bioinformatics/btp327

Copy DOI

Journal: Bioinformatics	Publication Date: Jun 3, 2009
Citations: 50	License type: CC BY-NC 2.0 UK

Affiliation: Boston Children's Hospital, Boston University

Abstract

Motivation: There is a growing interest in improving the cluster analysis of expression data by incorporating into it prior knowledge, such as the Gene Ontology (GO) annotations of genes, in order to improve the biological relevance of the clusters that are subjected to subsequent scrutiny. The structure of the GO is another source of background knowledge that can be exploited through the use of semantic similarity.Results: We propose here a novel algorithm that integrates semantic similarities (derived from the ontology structure) into the procedure of deriving clusters from the dendrogram constructed during expression-based hierarchical clustering. Our approach can handle the multiple annotations, from different levels of the GO hierarchy, which most genes have. Moreover, it treats annotated and unannotated genes in a uniform manner. Consequently, the clusters obtained by our algorithm are characterized by significantly enriched annotations. In both cross-validation tests and when using an external index such as protein–protein interactions, our algorithm performs better than previous approaches. When applied to human cancer expression data, our algorithm identifies, among others, clusters of genes related to immune response and glucose metabolism. These clusters are also supported by protein–protein interaction data.Contact: dotna@cs.bgu.ac.ilSupplementary information: Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

Interspecies gene function prediction using semantic similarity
Guoxian Yu ... Jun Wang
BMC Systems Biology | VOL. 10
Guoxian Yu, et. al.Guoxian Yu ... Jun Wang
01 Dec 2016
BMC Systems Biology | VOL. 10

DeepGOA: Predicting Gene Ontology Annotations of Proteins via Graph Convolutional Network
Guangjie Zhou ... Guoxian Yu
-
Guangjie Zhou, et. al.Guangjie Zhou ... Guoxian Yu
01 Nov 2019
01 Nov 2019

NMFGO: Gene Function Prediction via Nonnegative Matrix Factorization with Gene Ontology.
Guoxian Yu ... Guangyuan Fu
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 17
Guoxian Yu, et. al.Guoxian Yu ... Guangyuan Fu
30 Jul 2018
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 17

AptRank: an adaptive PageRank model for protein function prediction on bi-relational graphs.
Biaobin Jiang ... Michael Gribskov
Bioinformatics | VOL. 33
Biaobin Jiang, et. al.Biaobin Jiang ... Michael Gribskov
14 Feb 2017
Bioinformatics | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering

Abstract

Talk to us

Similar Papers

More From: Bioinformatics