Multitask learning for biomedical named entity recognition with cross-sharing structure

Xi Wang,Ke Xu,Jiagao Lyu,Li Dong

doi:10.1186/s12859-019-3000-5

Abstract

BackgroundBiomedical named entity recognition (BioNER) is a fundamental and essential task for biomedical literature mining, which affects the performance of downstream tasks. Most BioNER models rely on domain-specific features or hand-crafted rules, but extracting features from massive data requires much time and human efforts. To solve this, neural network models are used to automatically learn features. Recently, multi-task learning has been applied successfully to neural network models of biomedical literature mining. For BioNER models, using multi-task learning makes use of features from multiple datasets and improves the performance of models.ResultsIn experiments, we compared our proposed model with other multi-task models and found our model outperformed the others on datasets of gene, protein, disease categories. We also tested the performance of different dataset pairs to find out the best partners of datasets. Besides, we explored and analyzed the influence of different entity types by using sub-datasets. When dataset size was reduced, our model still produced positive results.ConclusionWe propose a novel multi-task model for BioNER with the cross-sharing structure to improve the performance of multi-task models. The cross-sharing structure in our model makes use of features from both datasets in the training procedure. Detailed analysis about best partners of datasets and influence between entity categories can provide guidance of choosing proper dataset pairs for multi-task training. Our implementation is available at https://github.com/JogleLew/bioner-cross-sharing.

Highlights

Biomedical named entity recognition (BioNER) is a fundamental and essential task for biomedical literature mining, which affects the performance of downstream tasks
All the multi-task models in the paper are implemented based on this single-task model; we choose it as our baseline model
Multi-task model with cross-sharing structure (CS-multi-task models (MTM)) we introduce our multi-task model with cross-sharing structure

Summary

Introduction

Biomedical named entity recognition (BioNER) is a fundamental and essential task for biomedical literature mining, which affects the performance of downstream tasks. Multi-task learning has been applied successfully to neural network models of biomedical literature mining. For BioNER models, using multi-task learning makes use of features from multiple datasets and improves the performance of models. Biomedical named entity recognition (BioNER) aims at use hand-crafted rules [9] and domain-specific features annotating named entity mentions with their entity types [10], such as orthographic features, morphological fea-. (e.g., genes, proteins [1], and diseases [2]) in the input tures [11,12,13,14] The drawback of these neural network biomedical text. BioNER models provide useful information for down- mance; features used in one BioNER model may not stream tasks of biomedical literature mining, such as work well in another. Most previous Random Field (BiLSTM-CRF) [15], and other models systems treat the task as a sequence labeling problem. have extra character-level CNN [16, 17] or character-level

Methods

Results

Discussion

Conclusion