Abstract

Named entity recognition (also known as entity recognition, entity segmentation and entity extraction) is a sub task of information extraction. It aims to locate and classify named entities in text into predefined categories, such as people, organization, location, time expression, etc. Compared with English, there are more unsolved problems in Chinese named entity recognition. Named entities in English have obvious formal signs, that is, the first letter of every word in entities should be capitalized, and entity boundary recognition is relatively easy. Compared with English, the task of Chinese named entity recognition is more complex, and the recognition of entity boundary is more difficult. In this paper, we propose a named entity method by adding the word position, which embeds the word position of each word into the word vector, in order to better recognize the boundary of Chinese named entity. The experimental results show that the F1 value of the named entity recognition method proposed in this paper increases by about 1%.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call