Abstract

In order to identify the basic data dictionary of polar expedition information portal relating to web texts, the first character searching and the attribute matching algorithms are proposed through analyzing text structure and characteristics of the basic data dictionary, and a method to look for basic data is provided based on these algorithms, which divides the whole basic data dictionary into two parts, i.e., first character filtering and first character associated filtering, thus this paper establishes a two-layer filtering model. It is used to identify the first character matching in the text and determine the association degree between different phrases or sentences in the basic data dictionary through attribute matching algorithm. The experimental results show that compared with traditional direct search algorithm, this first character searching and attribute matching algorithm could not only look for gazetteer more finely from various levels, but also effectively improve the accuracy of looking for basic data.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.