Abstract

In many applications, Chinese information is very often provided in the form of phonetic symbol sequences, and it is desired to decode such sequences into the corresponding Chinese character sequences (sentences) as the output. Phonetic input of Chinese characters into computers is a typical example. The problem is due primarily to the high degree of ambiguities caused by the large number of homonyms in Mandarin Chinese. In this paper, Markov models for Mandarin Chinese are developed to solve effectively the above decoding problem, and an efficient algorithm suitable for parallel processing based on dynamic programming is further proposed to search fully the solution space of exponential size in polynomial time. Extensive experiments were performed and the results show that appropriate models with proper training conditions can effectively solve the above problem, and the techniques developed here are also suitable for real-time applications.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.