Abstract

The lack of human annotations has been one of the main obstacles for neural named entity recognition in low-resource domains. To address this problem, there have been many efforts on automatically generating silver annotations according to domain-specific dictionaries. However, the information of domain dictionaries is usually limited, and the generated annotations may be noisy which poses significant challenges on learning effective models. In this work, we try to alleviate these issues by introducing a dictionary-guided graph attention model. First, domain-specific dictionaries are utilized to extract entity mention candidates by a graph matching algorithm, which can capture word patterns of domain entities. Furthermore, a word-mention interactive graph is leveraged to integrate the semantic and boundary information of entities into their context. We evaluated our model on the biomedical-domain datasets of recognizing chemical and disease entities, namely BC5CDR and NCBI disease corpora. The results show that our model outperforms several state-of-the-art models with different methodologies, such as feature-based models (e.g., BANNER), ensemble models (e.g., CollaboNet), multi-task learning models (e.g., MTM-CW), dictionary-based models (e.g., AutoNER). Moreover, the performance of our model is also comparable with BioBERT that owns huge parameters and needs large-scale pre-training.

Highlights

  • Named entity recognition (NER) is a task that extracts entity mentions from texts and classifies them into predefined types, such as person and location in the general domain [1], [2], and disease and chemical in the biomedical domain [3], [4]

  • NER is a fundamental task in natural language process (NLP) and bioinformatics, which is crucial for many downstream applications including relation extraction [5] and event extraction [6]

  • We evaluate our approach on two NER datasets, namely the BC5CDR and NCBI disease corpora [15], [16]

Read more

Summary

A Graph Attention Model for Dictionary-Guided Named Entity Recognition

This work was supported in part by the Major Project of the National Social Science Foundation of China under Grant 11&ZD189, in part by the National Key Research and Development Program of China under Grant 2017FC1200500, in part by the Major Project of Ministry of Education under Grant 18JZD015, in part by the National Natural Science Foundation of China under Grant 61772378, in part by the Natural Science Foundation Project of Hubei under Grant 2018CFB690, and in part by the Doctoral Initiation Foundation of Hubei University of Science and Technology under Grant 2016-19XB012.

INTRODUCTION
RELATED WORK
EXPERIMENTS
BASELINES
Findings
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.