Finding Out Biological Terms from Texts with CRFs for Reinforcement Learning

Zhao Hui Wang,Wei Huang

doi:10.4028/www.scientific.net/amm.198-199.1345

Abstract

The rapid growth of biological texts promotes the study of text mining which focuses on mining biological knowledge in various unstructured documents. Meanwhile, most biological text mining efforts are based on identifying biological terms such as gene and protein names. Therefore, how to identify biological terms effectively from text has become one of the important problems in bioinformatics. Conditional random fields (CRFs), an important machine learning algorithm, are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of observation and label pairs. Here we use CRFs in a class of temporal learning algorithms, reinforcement learning. Consequently the labels are actions that update the environment and affect the next observation. As a result, from the view of reinforcement learning, CRFs provide a way to model joint actions in a decentralized Markov decision process, which define how agents can communicate with each other to choose the optimal joint action. We use GENIA corpus to carry on training and testing the proposed approach. The result showed the system could find out biological terms from texts effectively. We get average precision rate=90.8%, average recall rate=90.6%, and average F1 rate=90.6% on six classes of biological terms. The results are pretty better than many other biological named entity recognition systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Finding Out Biological Terms from Texts with CRFs for Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Applied Mechanics and Materials

Lead the way for us

Similar Papers

Extracting Terms from Texts with Conditional Random Fields
Xun Lu ... Yixuan Li
-
Xun Lu, et. al.Xun Lu ... Yixuan Li
01 Jan 2015
01 Jan 2015

Graphical Models: Modeling, Optimization, and Hilbert Space Embedding

-

29 Jul 2010
29 Jul 2010

Conditional random fields for multi-agent reinforcement learning
Xinhua Zhang ... Douglas Aberdeen
-
Xinhua Zhang, et. al.Xinhua Zhang ... Douglas Aberdeen
20 Jun 2007
20 Jun 2007

Combined SVM-CRFs for Biological Named Entity Recognition with Maximal Bidirectional Squeezing
Fei Zhu ... Bairong Shen
PLoS ONE | VOL. 7
Fei Zhu, et. al.Fei Zhu ... Bairong Shen
26 Jun 2012
PLoS ONE | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Finding Out Biological Terms from Texts with CRFs for Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Applied Mechanics and Materials