Annotating proteins by mining protein interaction networks

Gultekin Ozsoyoglu,Jiong Yang,Mustafa Kirac

doi:10.1093/bioinformatics/btl221

Gultekin Ozsoyoglu, Jiong Yang + Show 1 more

Open Access

https://doi.org/10.1093/bioinformatics/btl221

Copy DOI

Journal: Bioinformatics	Publication Date: Jul 15, 2006
Citations: 31	License type: implied-oa

Affiliation: Case Western Reserve University

Abstract

In general, most accurate gene/protein annotations are provided by curators. Despite having lesser evidence strengths, it is inevitable to use computational methods for fast and a priori discovery of protein function annotations. This paper considers the problem of assigning Gene Ontology (GO) annotations to partially annotated or newly discovered proteins. We present a data mining technique that computes the probabilistic relationships between GO annotations of proteins on protein-protein interaction data, and assigns highly correlated GO terms of annotated proteins to non-annotated proteins in the target set. In comparison with other techniques, probabilistic suffix tree and correlation mining techniques produce the highest prediction accuracy of 81% precision with the recall at 45%. Code is available upon request. Results and used materials are available online at http://kirac.case.edu/PROTAN.

Full Text