Named Entity Recognition From Biomedical Texts Using a Fusion Attention-Based BiLSTM-CRF

Hao Wei,Mingyuan Gao,Wen Qu,Mingyu Lu,Fei Chen,Ai Zhou,Chunli Wang

doi:10.1109/access.2019.2920734

Hao Wei, Mingyuan Gao + Show 5 more

Open Access

https://doi.org/10.1109/access.2019.2920734

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 73	License type: cc-by-nc-nd

Affiliation: Dalian Maritime University

Abstract

Biomedical named entity recognition (BNER) is the basis of biomedical text mining and one of the core sub-tasks of information extraction. Previous BNER models based on conventional machine learning rely on time-consuming feature engineering. Though most neural network methods improve the problems with automatic learning, they cannot pay attention to the significant areas when capturing features. In this paper, we propose an attention-based BiLSTM-CRF model. First, this model adopts a bidirectional long short-term memory network (BiLSTM) to obtain more complete context information. At the same time, the attention mechanism is proposed to improve the vector representations in BiLSTM. We design different attention weight redistribution methods and fuse them. It effectively prevents the significant information loss when extracting features. Finally, combining BiLSTM with conditional random field (CRF) layer effectively solves the problems of the inability to handle the strong dependence of tags in the sequence. With the simple architecture, our model achieves a reasonable performance on the JNLPBA corpus. It obtains an F1-score of 73.50. Our model can enhance the ability of the neural network to extract significant information and does not rely on any feature engineering, with only general pre-training word vectors. It makes our model have high portability and extendibility.

Full Text