Abstract

This paper proposes a method to extract rules for anaphora resolution of Japanese zero pronouns from aligned sentence pairs. The method focuses on the characteristics of Japanese and English in which both the language families and the distribution of zero pronouns are very different. In this method, zero pronouns in the Japanese sentence and the English translation equivalents of their antecedents are extracted from Japanese and English aligned sentence pairs. Then resolution rules for Japanese zero pronouns are automatically extracted using the pairs of Japanese zero pronouns and translation equivalents of their antecedents in English and equivalent word/phrase pairs which were extracted from the aligned sentence pairs, based on the syntactic and semantic structure of the Japanese sentence. This method was implemented in the Japanese-to-English machine translation system, ALT-J/E. The evaluation showed that, for 371 zero pronouns with deictic reference in a sentence set for the evaluation of Japanese-to-English machine translation systems, the rules which were created automatically from Japanese and English aligned sentence pairs correctly resolved 99.2% of zero pronouns in a window test and 87.6% of zero pronouns in a blind test.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call