Abstract

With the rapid development of network technology and the popularization of electronic documents, Chinese text automatic proofreading technology has attracted increasing attention. Automatic proofreading of semantic errors in Chinese text is a key and difficult point in the field of Chinese information processing. Aiming at this problem, we propose a semantic error proofreading method that contains dependency parsing and statistical theory, and construct a two-layer semantic knowledge base to assist error detection and error correction. The two-layer semantic knowledge base includes (1) knowledge base of word collocations containing structured information of sentences extracted from a large-scale corpus; (2) knowledge base of sememe collocations obtained by sememe mapping through HowNet. On this basis, cubic association ratio and degree of polymerization are introduced to evaluate the proofreading results to reduce false positives and improve the accuracy of error correction opinions. The experiment result shows that our method will be of great use for the construction of semantic proofreading knowledge base and semantic error automatic proofreading methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call