Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information

James W Cooper,Aaron Kershenbaum

doi:10.1186/1471-2105-6-143

Abstract

BackgroundThe rapid publication of important research in the biomedical literature makes it increasingly difficult for researchers to keep current with significant work in their area of interest.ResultsThis paper reports a scalable method for the discovery of protein-protein interactions in Medline abstracts, using a combination of text analytics, statistical and graphical analysis, and a set of easily implemented rules. Applying these techniques to 12,300 abstracts, a precision of 0.61 and a recall of 0.97 were obtained, (f = 0.74) and when allowing for two-hop and three-hop relations discovered by graphical analysis, the precision was 0.74 (f = 0.83).ConclusionThis combination of linguistic and statistical approaches appears to provide the highest precision and recall thus far reported in detecting protein-protein relations using text analytic approaches.

Highlights

The rapid publication of important research in the biomedical literature makes it increasingly difficult for researchers to keep current with significant work in their area of interest
There are a number of tabulations of these interactions, such as that provided by the Munich Institute for Protein Sequence (MIPS); these tabulations are of necessity incomplete
We have been developing a group of biology-specific computational annotators that work in conjunction with our group's text analytic software, for the discovery of protein-protein relations in text

Summary

Introduction

The rapid publication of important research in the biomedical literature makes it increasingly difficult for researchers to keep current with significant work in their area of interest. While the actual experimental study of such interactions remains the most important manner of obtaining these data, the number of protein-protein interactions reported in the literature is substantial and growing rapidly. There are a number of tabulations of these interactions, such as that provided by the Munich Institute for Protein Sequence (MIPS); these tabulations are of necessity incomplete. To address this problem, we have been developing a group of biology-specific computational annotators that work in conjunction with our group's text analytic software, for the discovery of protein-protein relations in text. We undertook a study that utilizes a combination of computational linguistics, statistics and domain-specific rules to detect protein-protein interactions in a set of Medline abstracts. Having a scalable, robust system for protein interaction discovery provides a major information tool for molecular biologists

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC bioinformatics	Publication Date: Jan 1, 2005
Citations: 39	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics

Lead the way for us

Similar Papers

Text Analytics
Divanu Sameera ... Niraj Sharma
-
Divanu Sameera, et. al.Divanu Sameera ... Niraj Sharma
09 Nov 2023
09 Nov 2023

A Text Analytic Approach to Classifying Document Types
Steven Walczak
The Journal of Writing Analytics | VOL. 1
Steven WalczakSteven Walczak
01 Jan 2017
The Journal of Writing Analytics | VOL. 1

Big data text analytics: an enabler of knowledge management
Zaheer Khan ... Tim Vorley
Journal of Knowledge Management | VOL. 21
Zaheer Khan, et. al.Zaheer Khan ... Tim Vorley
13 Feb 2017
Journal of Knowledge Management | VOL. 21

Is Academic Research in Industry 4.0 and IoT Aligned to the Industrial Needs- a Text Analytic Approach
Pankaj Kumar Medhi
SSRN Electronic Journal | VOL. -
Pankaj Kumar MedhiPankaj Kumar Medhi
01 Jan 2019
SSRN Electronic Journal | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics