Abstract

Language models adopted by most existing error detection and correction approaches of Chinese text are N-Gram models of characters, words or POS tags. Their deficiencies are that only the local language constraint is employed and there is no language model unification process. A feature-based automatic error detection and correction approach is presented. It uses both local language features and wide-scope semantic features. Winnow is adopted in the learning step. In experiment, this method achieved an error detection recall rate of 85%, precise rate of 41% and error correction rate of 51%. It shows that the approach performs better than the existing approaches based on N-Gram models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.