Abstract
Automatic extracting protein–protein interaction information from biomedical literature can help to build protein relation network, predict protein function and design new drugs. This paper presents a protein–protein interaction extraction system BioPPIExtractor for biomedical literature. This system applies Conditional Random Fields model to tag protein names in biomedical text, then uses a link grammar parser to identify the syntactic roles in sentences and at last extracts complete interactions by analyzing the matching contents of syntactic roles and their linguistically significant combinations. Experimental evaluations with two other state of the art extraction systems indicate that BioPPIExtractor system achieves better performance.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have