Abstract
The Internet is rich in information related to the financial field. The financial entity information text containing new internet vocabulary has a certain impact on the results of existing recognition algorithms. How to solve the problems of new vocabulary and polysemy is a problem to be solved in the current field. This paper proposes an ERNIE-Doc-BiLSTM-CRF named entity recognition model based on the pretrained language model. Compared with the traditional model, the ERNIE-Doc pretrained language model constructs a unique word vector from the word vector and combines the location coding, which solves polysemy problem well. The intensive skimming mechanism realizes the long text processing well and captures the context information effectively. The experimental results show that the accuracy of this model is 86.72%, the recall rate is 83.39%, and the F1 value is 85.02%, which is 13.36% higher than other models; the recall rate is increased by 13.05%, and the F1 value is increased by 13.21%.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.