A Novel Measure for Semantic Similarity Computation of Gene Ontology Terms Using Weighted Aggregation of Information Contents

Amir Lakizadeh,Saeed Jalili

doi:10.5812/zjrms.12041

Abstract

Background: Gene ontology (GO) is a well-structured knowledge of biological terms that describes roles of genes and their products in a standardized and organized controlled vocabulary format. Over the last decade, many measures are developed to exploit GO advantages to determine semantic similarities between biological entities. Using GO ontologies, there are some constraints that existing GO-based semantic similarity measures try to address them. For instance, (1) edges in a GO graph, do not indicate uniform distances and also have different densities, and (2) ignoring term levels in an ontology makes “shallow annotation” drawback, i.e., two terms with a certain distance near the root of GO graph have equal semantic similarity with two terms with the same distance but far from the root. Methods: Here, we present wAIC, a two-stage hybrid semantic similarity measure using weighted aggregation of information contents. In wAIC, the impact of each common ancestor on semantic similarity value is determined according to the location of the ancestor in the ontology graph. wAIC, also, filters (from annotating term set) terms that are in upper levels of the graph ontology to reduce shallow annotation constraints. Results: Experimental results confirm that the proposed measure is more consistent with major related constraints, such that, wAIC semantic similarity values have more correlation with both sequence similarity values and gene expression based similarity values than state-of-the-art semantic similarity measures. Conclusions: WAIC show using a weighted aggregation of common ancestors is completely consistent with the human perception and can improve accuracy of gene similarity measurement.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel Measure for Semantic Similarity Computation of Gene Ontology Terms Using Weighted Aggregation of Information Contents

Abstract

Talk to us

Similar Papers

More From: Hepatitis Monthly

Lead the way for us

Journal: Hepatitis Monthly	Publication Date: Aug 31, 2017
License type: cc-by-nc

Similar Papers

Exploring information from the topology beneath the Gene Ontology terms to improve semantic similarity measures
Shu-Bo Zhang ... Jian-Huang Lai
Gene | VOL. 586
Shu-Bo Zhang, et. al.Shu-Bo Zhang ... Jian-Huang Lai
12 Apr 2016
Gene | VOL. 586

IntelliGO: a new vector-based semantic similarity measure including annotation origin.
Sidahmed Benabderrahmane ... Olivier Poch
BMC Bioinformatics | VOL. 11
Sidahmed Benabderrahmane, et. al.Sidahmed Benabderrahmane ... Olivier Poch
01 Dec 2010
BMC Bioinformatics | VOL. 11

A new hybrid semantic similarity measure using information content and topological features of the Gene Ontology graph
Pritha Dutta ... Mahantapas Kundu
-
Pritha Dutta, et. al.Pritha Dutta ... Mahantapas Kundu
01 Jan 2017
01 Jan 2017

An improved method for scoring protein-protein interactions using semantic similarity within the gene ontology
Shobhit Jain ... Gary D Bader
BMC Bioinformatics | VOL. 11
Shobhit Jain, et. al.Shobhit Jain ... Gary D Bader
15 Nov 2010
BMC Bioinformatics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Measure for Semantic Similarity Computation of Gene Ontology Terms Using Weighted Aggregation of Information Contents

Abstract

Talk to us

Similar Papers

More From: Hepatitis Monthly