Transliteration Using a Network of Phoneme Chunks

In-Ho Kang,Gil Chang Kim

doi:10.1142/s0219427905001183

Abstract

In this paper, we present methods of transliteration and back-transliteration. In Korean technical documents and web documents, many English words and Japanese words are transliterated into Korean words. These transliterated words are usually technical terms and proper nouns, so it is hard to find them in a dictionary. Therefore an automatic transliteration system is needed. Previous transliteration models restrict an information length to two or three letters per letter. However, most transliteration phenomena cannot be explained with a single standard rule especially in Korean. Various rules such as the origin of a word and profession of users are applied to each transliteration. The restriction of information length may lose the discriminative information of each transliteration rule. In this paper, we propose the methods that find similar words which have the longest overlap with an input word. To find similar words without the loss of each transliteration rule, phoneme chunks that do not have a length limit are used. By merging phoneme chunks, an input word is transliterated. With our proposed method, we could get 86% character accuracy and 53% word accuracy in an English-to-Korean transliteration test.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transliteration Using a Network of Phoneme Chunks

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Processing of Languages

Lead the way for us

Journal: International Journal of Computer Processing of Languages	Publication Date: Mar 1, 2005
Citations: 5

Similar Papers

English-to-Korean transliteration using multiple unbounded overlapping phoneme chunks
In-Ho Kang ... Gilchang Kim
-
In-Ho Kang, et. al.In-Ho Kang ... Gilchang Kim
01 Jan 1999
01 Jan 1999

Manual response set in a Stroop-like task involving categorization of English and Japanese words indicates a common semantic representation.
Satoko Ikeda
Perceptual and Motor Skills | VOL. 87
Satoko IkedaSatoko Ikeda
01 Oct 1998
Perceptual and Motor Skills | VOL. 87

Hybrid English words in Korean and Japanese: a strange brew or an asset for global English?
Jieun Kiaer ... Anna Bordilovskaya
Asian Englishes | VOL. 19
Jieun Kiaer, et. al.Jieun Kiaer ... Anna Bordilovskaya
07 Feb 2017
Asian Englishes | VOL. 19

Neural substrates of passively listening to Japanese and English words, nonsense words by Japanese subjects: an fMRI study
Chang Cai ... Jinglong Wu
-
Chang Cai, et. al.Chang Cai ... Jinglong Wu
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transliteration Using a Network of Phoneme Chunks

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Processing of Languages