Abstract

Named entity recognition (NER) is a common task in Natural Language Processing (NLP). To this end, we propose a novel approach based on pre-training model to complete the sequence labeling tasks by learning the large-scale real-world data from Brazilian legal documents. Especially, combining iterated dilated convolution[1] (IDCNN) and Bi-LSTM, we develop the scalable sequence labeling model named Sequence Tagging Model (STM) and extensive experiments validate the effectiveness of STM for NER tasks. Furthermore, compared with the IDCNN-CRF model, the experimental results show that the STM is better and the F1 score is 93.23%, which provides an important basis for NER tasks.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.