Abstract

In order to identify the basic data dictionary of polar expedition information portal relating to web texts, the first character searching and the attribute matching algorithms are proposed through analyzing text structure and characteristics of the basic data dictionary, and a method to look for basic data is provided based on these algorithms, which divides the whole basic data dictionary into two parts, i.e., first character filtering and first character associated filtering, thus this paper establishes a two-layer filtering model. It is used to identify the first character matching in the text and determine the association degree between different phrases or sentences in the basic data dictionary through attribute matching algorithm. The experimental results show that compared with traditional direct search algorithm, this first character searching and attribute matching algorithm could not only look for gazetteer more finely from various levels, but also effectively improve the accuracy of looking for basic data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call