Abstract

At present, there is relatively little research on ancient Chinese texts in the field of digital humanities, and ancient Chinese information processing urgently needs new algorithms. To realize the word similarity calculation of pre-Qin classics, a total of 25 pre-Qin classics were first mapped into a language network. Based on local relative entropy, we proposed an improved weighted network node similarity calculation method (LREW). This method judges the similarity based on the local network characteristics of the nodes, and the degree of the nodes and the weight information of the edges between the nodes are considered. We used the relative entropy to calculate the difference in the amount of information between different nodes. After experimental comparison, compared with the existing LRE and RE algorithms based on relative entropy, the proposed LREW method can achieve the best results in calculating the similarity between words in the pre-Qin classics. Compared with CN, Jaccard, Salton, and CDSim algorithms based on common neighbor nodes, although the accuracy of LREW is low, the comprehensiveness of the similar word recognition is high, which can ensure that potential similar nodes in the network will not be missed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.