Training the Hidden Vector State Model from Un-annotated Corpus

Deyu Zhou,Chee Keong Kwoh,Yulan He

doi:10.1007/978-3-540-72586-2_54

Abstract

Since most knowledge about protein-protein interactions still hides in biological publications, there is an increasing focus on automatically extracting information from the vast amount of biological literature. Existing approaches can be broadly categorized as rule-based or statistically-based. Rule-based approaches require heavy manual effort. On the other hand, statistically-based approaches require large-scale, richly annotated corpora in order to reliably estimate model parameters. This is normally difficult to obtain in practical applications. We have proposed a hidden vector state (HVS) model for protein-protein interactions extraction. The HVS model is an extension of the basic discrete Markov model in which context is encoded as a stack-oriented state vector. State transitions are factored into a stack shift operation similar to those of a push-down automaton followed by the push of a new preterminal category label. In this paper, we propose a novel approach based on the k-nearest-neighbors classifier to automatically train the HVS model from un-annotated data. Experimental results show the improved performance over the baseline system with the HVS model trained from a small amount of the annotated data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Training the Hidden Vector State Model from Un-annotated Corpus

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Semi-supervised Learning of the Hidden Vector State Model for Protein-Protein Interactions Extraction
Deyu Zhou ... Yulan He
-
Deyu Zhou, et. al.Deyu Zhou ... Yulan He
01 Jan 2007
01 Jan 2007

Semi-supervised learning of the hidden vector state model for extracting protein–protein interactions
Deyu Zhou ... Chee Keong Kwoh
Artificial Intelligence In Medicine | VOL. 41
Deyu Zhou, et. al.Deyu Zhou ... Chee Keong Kwoh
17 Aug 2007
Artificial Intelligence In Medicine | VOL. 41

Ontology-Based Protein-Protein Interactions Extraction from Literature Using the Hidden Vector State Model
Yulan He ... Keiichi Nakata
-
Yulan He, et. al.Yulan He ... Keiichi Nakata
01 Dec 2008
01 Dec 2008

Semantic processing using the Hidden Vector State model
Yulan He ... Steve Young
Computer Speech & Language | VOL. 19
Yulan He, et. al.Yulan He ... Steve Young
23 Mar 2004
Computer Speech & Language | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Training the Hidden Vector State Model from Un-annotated Corpus

Abstract

Talk to us

Similar Papers