Recognizing noisy romanized Japanese words in learner English

Ryo Nagata,Yukiko Yabuta,Hiromi Sugimoto,Jun-Ichi Kakegawa

doi:10.3115/1631836.1631840

Recognizing noisy romanized Japanese words in learner English

Ryo Nagata, Yukiko Yabuta + Show 2 more

Open Access

https://doi.org/10.3115/1631836.1631840

Copy DOI

Publication Date: Jan 1, 2008

Citations: 3

Affiliation: Konan University, Hyogo University of Teacher Education

#Small Set Of Rules #Learner English + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper describes a method for recognizing romanized Japanese words in learner English. They become noise and problematic in a variety of tasks including Part-Of-Speech tagging, spell checking, and error detection because they are mostly unknown words. A problem one encounters when recognizing romanized Japanese words in learner English is that the spelling rules of romanized Japanese words are often violated by spelling errors. To address the problem, the described method uses a clustering algorithm reinforced by a small set of rules. Experiments show that it achieves an F-measure of 0.879 and outperforms other methods. They also show that it only requires the target text and a fair size of English word list.

Full Text